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CATTCCGATT ATCATTTCAT ATAAAGAAGG TTTACATATT ATTAAAGATT TAATTGTTGC 
GACATTACGA GCAGTTGTGC AATTAATCAT TTTGGGATTT TTGCTGCATT ATATTTTTAA 
AATAAACGAT AAATGGCTGC TTATTTTATG TGTATTGGTC ATTATTATTA ATGCATCATG 
GAATACAATT AGTCGAGCAT CACCAGTGAT GCATCATGTG TTTTGGATAT CATTTCTAGC 
TATCTTCATT GGAACGGCAT TACCGCTTGC AGGTACTATT GCGACAGGGG CCATTCAATT 
TACCGCAAAT GAAGTTATAC CTATCGGCGG CATGCTTGCA AATAATGGCT TGATTG CAAT 
TAATTTAGCT TACCAGAATT TAGATCGTGC ATTCGTACAA GATGGTACTA ATATTGAATC 
TAAATTATCA CTTGCAGCTA CACCTAAATT GGCTTCTAAA GGTG CAATAC GTGAAAGTAT 
TCGTTTAGCT ATAGTG CCAA CTATTGATTC GGTTAAAACA TATGGGCTTG TGTCGATTCC 
TGGTATGATG ACAGGCTTAA TTATTGGTGG CGTACCACCT TTACAAGCGA TTAAATTTCA 
ATTGTTAGTC GTGTTTATTC ATACAACTGC GACCATTATG TCTGCTTTGA TTGCGACATA 
TTTAAGCTAT GGTCAATTTT TCAATGCAAG ACATCAATTA GTAGCACGAA ATACTGATGT 
TAAGAGTGAA TCATGATAGA TTTTACTGCA TCAGATTTAG G CATT AGTTT TAATTGGAAA 
TGAAGTGACG CGCACATATA GTATCGCTAT TCATTAGCGC AGCGAAAATA TTCATAAAGG 
CACG CATACT TTGTAGTCAG TTATCTGTTC TGACATATAA AGCGTGCGTG CTTTTTTGGA 
GTTATTGTTG AAACTGAAGT AATTATACAT AATTATTAAA TGACATACTT GTGTTAATTT 
TT CAAAT ACT GAAAAACAAT TTCaATAATT TTCCaATTAA GCACAGAAAA TTAAAGCAAA 
ATATTATATA ATAGAACGGT TATATATaAA nATTngTgCA CACATTTTTT AATAAATCGT 
TATTCTAAGG GAAATGAATA TCGGAAATTT TGTTTGAAAG GAGTTTTAAA TTGTCAATCA 
TGCGACTATT TACATTCATT TTAAGTATTT TTATCGTAGG AATGGTTGAA ATGATGGTTG 
~ CAGGAATTAT GAACTTGATG AGTCAGGACT^TACATGTATC~ AGAMCTGTC GTTGGTCAAT ~ 
TAGTGACAAT GTACG CTTT A ACATTTGCGA TATGTGGACC TATTCTGGTT AAATTAACGA 
ACCGTTTTT C ATCAAGGCCT GTATTATTAT GGACATTACT TATATTTATC ATTGGTAATG 
GCATTATTG C TGTAGCGCCA AATTTTTCaA TATTAGTAGT TGGTAGAATT ATCTCATCTG 
CAGCAGCAGC ACTAATT AT C GTAAAAGTAT TAGCTATTAC AGCGATGTTA TCAGCACCTA 
AAAATCGTGG TAAAATGATT GGACTTGTCT ATACAGGGTT TAGTGGTGCT AATGTTTTTG 
GTGTACCAAT TGGAACGGTT ATCGGCGATT TAGTAGGTTG GCGCTATACA TTTCTATTCT 
TAATTATTGT GAGTATTATT GTTGGCTTCT TGATGATGAT CTATTTACCG AAGGATCAGG 
AAATACAACG AGGCCCTGTG AATCATGAGA CACCATCTCA TGAAAATCAT GTTACTTCGA 
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CAAACTCAGT GACATTCGTC TTTATAAATC CACTTATTTT ATCTAATGGT CATGATATGT 
CATTCGTTTC ATTAG CACTT CTAGTAAATG GAATCGCTGG CGTTATTGGA ACATCATTAG 
GTGGTATATT CTCCGATAAA ATTACAAGTA AGCGTTGGTT AATGATTTCT GTTTCTATTT 
TTATCGTCAT GATGTTACTT ATGAATTTAA TCTTACCTGG TTCAGGTCTA TTGTTAG CAG 
GACTATTTAT TTGGAATATC ATGCAATGGA GTACTAATCC AGCAGTGCAA AGCGGTGTGA 
TTCAACATGT TGAAGGCGAC ACAAGCCAAG TAATGAGTTG GAACATGTCT AG TTT AAA CG 
CTGGTATTGG TGTTGGAGGC ATTATTGGAG GCTTGGTCAT GACACATGTT TCTGTTCAAG 
CTATCACATA TACGAGTGCC ATCATTGGCG CATTAGGATT AATCGTTGTT TTCACATTGA 
AAAATAATCA TTATGCTAAA ACATTTAAAT CATCATAATT CTCATATGAm AAGCACGCCT 
GCTATCAAAT TCAGGTGTGC TTTTTTAGAT GCGATAACGT TATTGATATG TGCGATAATA 
GCGACGTTCA TTATGATACA TCGGCCAAGG CATTTTACCG CTTTTAGCAA AATTAG CTAA 
ATCATTTTGC ATTTGTCGAC TTAAAAATTT AAGGTGaGCA GTTGTTGGaT ATgAT 
(2) INFORMATION FOR SEQ ID NO: 68: 
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30 



(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
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CGCAAAGAAG 


TACAAAAAAT 


GTTTTTACAA GAAGGTATTA AAACACCTCA 


ACCAATTATG 


60 




ACTGCTTATA 


ATCATAGTGA 


AAACGgTGTT TAGTAGTTTA TAATACATGG 


AGGTCATATT 


120 




TAATGGCGTC 


AAAATATGGA 


ATAAATGATA TAGTAGAAAT GAAAAAACAA 


CATGCGTGTG 


180 


40 


GAACAAACCG 


TTTTAAGATT 


ATTAGAATGG GTG CAGACAT AAGAATTAAA 


TGTGAAAATT 


240 




GTCAAAGAAG 


TATTATGATT 


CCACGTCAAA CGTTTGATAA AAAACTTAAA 


AAAATCATCG 


300 




AATCTCATGA 


TGATACACAA 


AGATAGGAGA ATGATTAATG GCTTTAACAG 


CAGGTATCGT 


360 


45 


TGGATTGCCA 


AACGTTGGTA 


AATCAACATT ATTTAATGCA ATAACAAAAG 


CAGGTGCTTT 


420 




AGCAGCGAAC 


TATCCATTCG 


CTACGATTGA TCCTAATGTA GGGATAGTAG 


AAGTGCCAGA 


480 


SO 


TGCTAGATTA 


CTTAAATTAG 


AAGAAATGGT TCAACCTAAA AAGACATTGC 


CGACTACATT 


54 0 


TGAATTTACA GATATCGCTG 


GTATTGTGAA AGGTGCTTCA AAGGGAGAAG 


GGTTAGGTAA 


600 




TAAATTCTTA 


TCACATATTA 


GAGAAGTAGA TGCGATTTGT CAGGTCGTTC 


GTGCATTTGA 


660 
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AGCGCAAGAA GACTCTGAAG TGATTGTTAT TAGTGCAAAA ATTGAAGAAG AAATTGCTAC 
ATTAGATGAT GAAGATAAAG AAATGTTCTT AGAAGaTTTA GGTATCGaAG AACCAGGATT 
AGATCgrTTA ATTAGGArnCA ctTATGAATT ATTAGGnTTA TCCACCATAA TT 
(2) INFORMATION FOR SEQ ID NO: 69: 



TCTAATCCAG GTAAGTATCG CGGGGAATAC ATTTTTTCTA AACGTTTAAA GTCTTTTGTA 
TCCA^TTAA TAAACGCCCC ATTTATTTTT CTCTATT^ 

GAAAATAAAA TATTGCACTG aTTTTCATTA ATTGATTTAA CCCTTAATTA AGATAGTTTT 
AAATTTTTTA TTAAGTAGAA AACAATTATT ACAGTTGATT TCATTACTGC AAACCACATA 
TAAATTTGTC GATTTTACTA CATAACATAG ATTATCATAG ATTCTTGAAT TTTTAGCAAA 
ATAACTGTTA TTTTCATTAT ATTTTTACAA AAAAAGGTTC GTTTTATATT TTATGCATCT 
TACTGTAACA GAATCATTAA GATATGCTAT TCGAATATAC TTTTTCAAAA TTTATATAAT 
GAATAAATTA ACATGTATTG AAAAAAAAGC GAAATGCAGC CTATCCTCTA ATGTAAACCA 
AACGATATAT CTCGTCAGAC TTTATATTTA AACGCTATGT GTCACTTTTA AAATGAATAT 
TACTAAGATT GTCATATCAA TTATTATTGC ATCGAATTAA TCTTTTAAAT TTCTGTAATA 



780 
840 
900 



TAATATGGAA TTAGTACTAG CGGACTTAGA ATCTGTTGAG AAACGTTTGC CTAGAATTGA 
AAAATTAGCA CGTCAAAAAG ATAAGACTGC TGAAATGGAA GTACGTATTT TAACAACTAT 
5 TAAAGAAGCT TTAGAAAATG GTAAACCCGC TCGTAGTATT GACTTTAATG AAGAAGATCA 

AAAATGGGTG AATCAAGCGC AATTACTGAC TTCTAAAAAA ATGCTTTATA TCGCTAATGT 960 
TGGTGAAGAT GAAATTGGTG ATGATGATAA TGATAAAGTA AAAGCGATTC GTGAATATGC 
10 — lkrr ^ rT ^ n ^ -^^rTv-oaaaik aTTflAAGAAG AAATTGCTAC 1080 



1140 
1192 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 94 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



2S ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

AATATAGCTG CAATAGCATC TCGTTTCATT TGTATAATCA ATTCCGGTTT AAAT AT CAGT 
GTGAACGTAA GCACGACACA GATTAAAAAT AACACTGCCG GAATGAGTCG TTTCAATCGT 
30 CGCTt CCAAA ACTCTAGCAA ATCGATTTTT TGCGTCCGAT AATACTCACT TATCAACAAA 
CTTGTTATTA AATAACCTGA AATAACGAAG AATGTATCTA CTCCTAAAAA GCCCCCACTT 
AACCATTGTG CATTCAAGTG ATAAATAATG ATTCCTATAA CTGCGAATGC CCTCAATCCA 
35 — RTTTTrrrTA AACGTTTAAA GTCTTTTGTA 360 



60 
120 
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240 
300 
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ACGGAAGTCA TTATTAGAAT AAAAATACTG TGCACTAATA AATTTATCAA TTGTTCCTAA 
ATAAATACCA TCGATATTTT GTTCTTTACA TGTCATTATA ACTTTATCTA AAAGTTTTTT 
ACCTATTTTT AAATTCCTAT AACCTTTATC AACAAACATT TTTTTAAGTG CAGACATATT 
ATTATCTAGT CTAATCAAAC CTATAGTACC AACAATATTT TGaTGATTGT TTATTGCAAG 
Q CCAAAATgCC CTCCATTATT CAAATAGTTA TGTTCGATGT TCTCCAAATC AGGTTGATCA 

TCTCTATCAA TTTTTATATa AATTCATTTT TTTGAATCGA TAAAATAAAC TCGATTAGCT 
CTTCCTTATA AGACCTATTA TATTCAATTA TGTTTATAGC CATTTTTATC TCCTTTTTCA 
>5 TTTAATTTAA TTATAAAATG TGCGTTTAGT TTGTATCTAG TGTACTCAGT ACAGCCTCAA 

ATGAAGTTTC ATTCCACTTG GCACTTAATA AAGACAAGTA TTTTAGCAGT AATACAATAA 
AGTCCAATAA ATTTCCCTAA CTTCAATATC CACTTTTTAA AAAATGTATT TTTAATTAAT 
20 AAAAAAACTC TCCCCAArrr CTATGGGAAG AGCTATATAT TTAATGTCTA AACATTACTT 

TTATTTATTA TGAAGGAATT AGAATCCCCA AGCACCTAAA CCTTGTGCTT TGTATG CTTT 
AACAGCTGCG TTGATTTGTT GGTCAACAGT GTTTGTTGGA CCCCAACCTG GCATAGTTTG 
" GAATAAACCT GAAGCACCTG ATGGGTTGTA AGCATTTACT TGACCATTTG ATTCACGAG C 

GATGATTGCA GCCCATGTAG AAGCTGAAAC ACCAGTACGT TGAGCCATGA TTTGAGCTGC 
^ TGATGAACCA GTAGCACCTG CAGTATTACC ATTGCTTAAT CTCACTGAAC TTGAAGTAGT 

TGAAGTGCTG TAGTTATGGT AAGTTGGAGC TGAAACAGCT TCAACGTt TG AGTTACTTGA 
TTGTGCATTG TAGCTTACTG ATTGTACATT TGAACCTTGG TTGTATGAAG TAGTGTAGTC 
^ TGCACCTGCA ACGTTTGAGA AACCAGCAGT TTGACCATTA GCTGCTTCAT AGCTCCATGA 

CCATGTAGTA CCATTTGAAG TGAAGTTATA TTGGAAACCA TCTTTTACAA AGTGGATGTC 
ATATGCACCA TCTTTGATTG GAGCTGCATT TAATTGATCT TGGTGATTAT GCGCTAAGTC 
- AACTAAGTCT GCTrGATCAA CGTTTACTTC AGCAGCGTGT GCTTGATGTC CTGTACCTGC 

TGCGTAACCT GTTACACCTA ATGCCACTGC TAATGATGAT GCCATAATTG TCTTTTTCAT 
AGTAAAAAAT CCTCCAGTAA TAATTGTnAG TTTATGTTTT TAGTAATTAT AtTTTGaATT 
TGAATGTCGT AGTgCAAGTT TAAATTGTCT TTTATTTCTT TCaACGGTAC TCACTATATC 
ACAaAAAACC AG CCAGTAAA TTACACTTTC TTTACAAAAC ATTACAATAT CAAGTGTTAT 
TTGtAATGTT GAAATATGGC TGTTTTATAC TGTAATGTGA AATATGTGCC CTTTAGAATC 
CAATCAACCC TTGAAATAGT CTTTAACACA TAAGATTTTT ACTATATTTA GCTCAACTAT 
TACAGCTTTC GTAATATTAC AGATTGTATT TTTGTTACAT AGCTGTAATA TATCTGACAT 
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TACACATGTA TTGATTGCTA TTATTGTTGT ATATTCAAAG TTTTAAAACA CACATCTTTT 
GTGAATTGTC TTATCTTTTA TTAGCG CAAA TAAACTGCAG CTCAATTATA TTGTTCAACT 
TCATTCTCGC AATTCACAAT AACATTAAAT AATTTTTGGT CTCATATTTT CAAAAAACAT 
ACTGTTATTA TCCCATGAAT TTAAAAATAT CATTAGTATA TAAACGAAAC ACTTTACGAT 
AAATGATATC TGCAAGCCAA GCTGTTACAA ATGGTACAAC AAAGAACGCT ACTACAATTA 
GTAAGACACT CAACCAAGCA GAATCAACCT CCATAAATTT AAATGCATTA ATCGGTCCTA 
CCATTC CTAT AAAACCAAAT CCAG CTGACT CTTTCGTTCC ATGAATACCT ACTAATGCTG 
ATACCAAACC TGATACAATG GCTGTCGTTA ATATTGGTAA CATAAGAATT GGATATTTCA 
C CAT ATT AGG TATCATCATT TTAACGCCTC CAAAGAAGAC GGATAACGGC ACCCCTAAAC 
GATTCACTTT ACTTGTACCA ATTATCAATA CTGCTTCAGT CGCGGAGATA CCAATTGACG 
CTGATCCAGC TGCTAAACCT GTAATACCTA TCGCAAAGGC AATGG CCACA GTTGATAGTG 
GCGAAATAAT AATAAGACTA AATACCATTG AAATCAAAAT ACT CATGACA ATCGGTTGTA 
ATTCTGTAAA ACCATTAACC ATATTACCGA TGGCTGTTGT AATCATTTTC GTATACGGCA 
ATATTAAAAC ACCAATTGCA CCTGAAATAC CGCCAACAAC TGTTGGGAAT ACAATCAATG 
CCATACTACC TACGCGATGT TGAATAAGTA AAATGAATAA CACTGCAATC GCTGCTGTAA 
TCATTGTATT AATTAAATCA CCAATACCCG TAATCATCCA AGCACCATTT TTAAACTGCG 
CTGCACCGCT T C CT AC AT AT GCTGCACTTG CCACAACAGC AATTGCTAAT GGCGATAGGT 
CAAATTTCAT GGCAACCAAT GCACCAATCA AAGCAGGTAC TGTAAATTGA ATTGCAACGA 
CAACG CCTAA TAACGTTTTA AAAATCGGAT GATAATCCAT AAAGTATTTA AAAATTTCTC 
CAAGTATCGC ATTAGGAACT AAACCCGCAA CAATACCTAT GGCGACACCT GATAAAACTC 

taaStataaa ATCTTCGGGT GTAATTGTTT taattgatgt cataatatca- tccttccatt- 

TATGTATATA CATCTGTATG CAAATAATAA AGAGCCTTAA GTTATAAGCT GCCACTAGCT 
TAAATTCTAA GATGTGCATG CCGATGTTGT TATATTTAGG CTAGCAGTAT CATCTATAAC 
TCAAGACTAT GAAAAATAGT ATATCACAAA ATTCTGAATT TTTAGATAAA TAAATTGGCA 
ATTTTTCAAA CATATTGTTA CAATACACTT TTATTTTATC TTCATTTTTA AAATCCATTA 
ATACAATAGA AGAAAGACAT TCAAATGCTT ACCAAAAAGG TACATTATTT GTTAGGAGCG 
TATCAGCaCT TACATATCAT CAACACAATT GACAATATAA TAGAAGATAC TGATAATAAG 
50 TGTTAAAACA ACAGATGTTA GGTAGTGAAC AAATGATGGA AAGTAAATCC ATAGATCCAA 

GAATCGTTAG AACCAAACAA TTGCTTGTCG ATGCTTTTCT TAAAATTTCT AGAGAAAAGA 
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TTTACG CTCA TTTCGCTGAT AAAGAAGACC TCCTAGACTA CACATTATCT GTAACCATTT 462 , 

TAAAAGACTT GAATGATAAT TTGAGCATTT CTAATGTCAT TAATGAAAAG GTTCTGCGTA 468 | 

ATATTTTCAT TTCAATTGCG AGTTATATCA AAGATGCTGC AAAGTCTTGC GAATTAAATA 474( 

GTGAAGCATT TTGCAACAAA GCACATCAAC GTATTAATAA TGAATTAGAA GATATTTTTG 480 ( 

jo CGATTATGTT AGAAAACAGC TATCCGGAGC ATCAACGAGA TATCATTGTA AATAGTGCGA 486( 

GTTTTTTAGC AGCTGGTATC TCAGGCTTAG CATTACATTG GTTTAACACG AGTCAAGAGA 492C 

CAGCCGATGT GTTTATCGAT CGCAACCTTC CATTTTTAAT TCATCATATA GCACATTTTT 49ec 

, 5 AATAAAACTT GGTATTTAGT CATGCATCTT GAAATCACTA TGTGACTTAG GTTCATACTT 504C 

GTACACACAA TAAAATTTAA CGTATTACGA TTGATTAGCC GTGTCTAGGA CATAAATCAA 510C 

CGTCCTATAC TCTACAATGT CATATTAGCA GTCGTTAACT GAATGAAAAT AAGCTTGTCA 5160 

TTAAAACATA TAGATTTTAG TGACAAGCAT TTTTGTTTTT GCGTACTTAA ACAACACTTC 5220 

AGGCAATATG TTGTTTAGGC AACAAATGAT ATGTGCGTGT TTATTGGCAA ACGTACGACA 5280 

TAGTAGTATA GTATGTCTAA ACAACATATG TTGCATAGTT GATATGCGTT GTTTAAATAC 5 340 

TAAGATAGGA GGGATTGACG TGAGCGAGAC AGATGAACCT CAGGGGTTTG AACG CACGCA 5400 

TAATATATTA AATATTAATC AGAGTAGTCT GGGTGTAGTG ACATACATTA CAAATAAATT 546 0 

AAAGTCGACG TTGAAGCAAC ACATAATAAT TGCTCGTGGT AAAAAGCGAA TCGACTATCG 5520 

ACTGTCGTAT AACTTTTACA TACGTATTAT GATAATGTAG AAATCAAGAA AATCGACTGT 5580 

GAATATACCT ATGCTATGCC CATTGCAATT TTAATAAGAC ACACGATGTC ATTCGACAAT 564 0 

35 GCTCATTTCT TTGCTCAGTT ACGTCATCCT GTCTTATAAA ACAACATTGC AGACATGTAT 5700 
ATCAAACGAC ACTTCAATAA CATCACTTTG CCcATCGTAC TACTAGTAAA ATCGTGTCTC 
AAATCCCTTA TTTTAATTCC AAAAAtCTGC TGGTCAAAAG AGCGAGAAAC TAAAAACATT 
40 ACTTAATGTG TTGATAAATT ACCATATAAA AATAATCTCA AAATATATCA ACACTTGATT 
CTAAGGAGGA TATGACAATA TGAAAATTTT AGATAGAATT AATGAACTTG CAAATAAAGA 
AAAAGTACAA CCACTTACTG TAGCTGAAAA ACAAGAACAA CATGCATTGC GTCAAGAcTA 
CTTAAG C ATG ATCCGAGGAC AAGTATTAAC AACATTTTCC ACAATAAAAG TGGTTGATCC 
AATCGGTcAG GATGTCACAC CAGATAAAGT TTATGATCTT CGCCAACAAT ACGGTTATAT 
TCaAAATTAA tATTTGCTCA CGAGGTATTG CACTTAAGGT GCCAACTGAC CTCATAAACA 
AAGCCCATAC TGATTGAAGA CACTAATGTG tCsaCCATGG TGCACATTAC GCTTCATCTC 
TGTATGGGCT TTTTATTTAT TCTTTTGAGA ATTTCATTTT AGCAGACCAA AAAATTAAAA 
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TGAACGACTG 


TGCCACCCGC 


TTCTTTCACT 


TTATTCACCA 


ACTGGTCAAC 


TTCTTCATTT 


6420 




GTGTTCACAC 


CTAGAGAAAT 


CATCACTTCA 


TTTGGTTCAG 


TATTAAGGCT 


TTGCTGACTT 


6480 


5 


ACATTTTGAA 


AATG CTTGTn 


TTCTATTAAA 


ATTACGGkTG 


t TTG AC CT AT 


tTGAATGCCG 


6S40 




ACCATTTTAT 


CTAACATTTG 


TGGGTTTCTA 


TTTATTTTAA 


ATCCTAACGC 


TTTATAAAAC 


6600 


10 


TGTGCGCTCT 


TTTCTAAATC 


TTGCACATGC 


AAATTAAACC 


ACATTGATTG 


AATCATGATT 


6660 


GCACCCCATT 


CATTACTTAT 


TATAGTTTTG 


GACTTTAAGC 


CAATCACTTA 


ATGATAATCT 


6720 




TGTTGGATTT 


ATTTCAGCCA 


TTAATTCAAA 


GTCTACTTCA 


TAACCTTTTT 


CTTCCAACCA 


6780 


15 


TTGCTTTTCT 


GCAACACCAC 


TAACAAATTC 


TCCTTCTATA 


ACAGTAGATT 


TACCTGTCAC 


6840 




TTCACTAAAA 


ATTGTTGCTG 


CTTCACTTAA 


TGTAACTTCA 


TCGGAACCAA 


TCTCTATTGA 


6900 




TTGATGCGTA 


AAGCTTTGTG 


GATGTGCAAA 


AATATACGAT 


GCAATTTTAG 


CTATATCAAT 


6960 


20 


AGAAGAAATC 


ATTGTGAATT 


TTATATTCGG 


ATTAATAAAT 


TCTGGTAATG 


TAATACGTTC 


7020 




ATCTTCGACT 


TT AG CAATG C 


GTAAAAAATT 


ATCCATAAAG 


AATGATGGTT 


TGATAACTGT 


7080 




TGCATTTATA 


TTAGATTCCA 


TTAATCTATT 


TTCTATTTTT 


GCTAGTACTT 


CAAAGTGTGG 


7140 


25 


GCCAGTTCGA 


TTTCGATTAA 


CCCCTCCCGC 


AGTACTATAC 


ACAATATGTT 


GAATATTTTC 


7200 




TTGCTCAGCT 


ATTTCAATTA 


TCTTCATACC 


TTGTCTTAAT 


TCTTCGCTAA 


CATCATCTTT 


7260 




AACGATTGGC 


TGAATACTGT 


ATAAGC CATA 


CTTACCTTTC 


ATCGCTGATT 


GCAAACTAAC 


7320 


30 


ATTATCACTC 


AGATCACCTT 


CArCGATTGA 


TAAATGCGGA 


TGTCCTATGT 


CTGAAAGTTT 


7380 




ACGATTATnC 


TTATTTCTAG 


TTAATGCACT 


TACATACCAT 


CCATCCTCTA 


ACAACTGTTT 


7440 




TACAACTGCA 


TTACCTTGCT 


TCCCTGTTGC 


GCCTATTACn 


AAAATATCTT 


TCAT 


7494 



35 

(2) INFORMATION FOR SEQ ID NO: 70; 

_ r sequence- CHARACTERISTICS : 

(A) LENGTH: 11802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

AATTTATTTC GCCGTCCCAC CCCAACTTGC ATTGTCTGTA GAAATTGGGA ATCCAATTTC 60 

TCTTTGTTGG GGCCCcGCCC CAACTCGCAT TGC CTGTAGA ATTTCTTTTC GAAATTCTCT 120 

GTGTTGGGGC CCCTGACTAG AATTGAAAAA AGCTTATTAC AAGCGCATTT TCGTTCAGTC 180 

AATTACTGCC AATATAACTT CGTAGATCAT AGAACATTGA TTTATTTCCC AG C CTATTCT 240 
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AG CAAAGGTA ATAATGATAT TAATAATGTA CAAAAAATAT AAATCAAATC G ACATCCTTA 
TAAAACATCA GAACCACTAA AAACAAAAAA GCACAAAATA AAATTAAATT TAAAATAAAC 
GACCACTTTT CAAAAAAATC TCtTTTCaTa TTTCCACCCC TAATTTTAAT AAGCATTATT 
TTATATTCTC TTTTAAGTTT ATTATTCAAA AGGAAAACAG AAATATCTTT CaATATTATT 
ATAAACATTT CAACTACTTT TAAAAACCAA CAAAAAAATA CTTATTTTAA GT AGATG AG C 
ATAAGTGAAC ATAGTTCTTT AGTTATAATA ATTAATTCAA CCAAAAGTCG ATTTGTTTTT 
GCAATTGGTT TTCATTTCCT CTTAAAGATA TTTTCATTAA ATCTGTCAAA TCAATAGACG 
,5 CTATATTTTT CAACTTATCT CTATATTTAT TTTTAGTACG TCTTTCTAAA TTTCCCCATT 

CCTCTTCTTC GTGAGTTAAT AAATGAAGCA TTGCTCGTTC TTGTATATTT TCAATCATTT 
TTAAATTCGG TTTTAAAATA TGCAAATCAT CAAAACAATC TTTCCAACAA TCAACCATAT 
CTCGTTTTAA TTCAATTTCC ACACGCCATA GAAATGTTGA ATCAATTTCA ACATCTGCAT 
TATCTTTACG TTCTTGTTTT TATTATAAAT CCGAATAAAC CTATCACTAT TACGCACACC 
25 AAAATATTTT GTTTCTGGTT TTACATTACG TCCATAAAAT ATAGTTTTCT TTACCGACTT 

ATCTGACAAT G CAT AATAGT CATTTAAATC AAATTCAAAA TCAAAAGCCA AATCTAATCT 
CGTAAAACTA ACATCGTCCA AATAACTGAT GATATTTTGT TTTAACCAAA GCACTTCATC 
„ ATGCGAAAGC TTATTAGGAT TAAATTCAAC GCGCATAtAC GTCTATTCCA AAGAGTTGCT 

TTTATTTTGT CATATTCAAT ATAAACTTTT TCTTTAAGAG CTTTAGCTTT AAAGTTTGTT 
TGTAAAATAT CCCAAAGCCG AATTTCAGGA TTAGTACTCA TAAAATGTGA AAGTCTCTCT 
55 GCGTTAGACA TGCTAAGATT CCCAACAATC GTTATAGCGT CAAAAGACAA TTTTGGAATA 

GCTAGTGACA TCCTATGTCG ATTTAACCGG CTATTACCGG ATATTAGAGT ATCCAGTTTT 
ACAAATGGAT GAAACGAAAT TCAAAACACT AAAAAATATG TTCCACTAAC AGCAAAAAAA 

40 

TACCATTATG TTCCTACTAA AAAACyAAAA ATACTGGAGA ACAAATGTCA GGATATAACT 
TAGGATACTA TGTAATAAAA ATTTACAATA AAAAAACAGG AAAACAAATT TCAAGTAAAA 

45 GmATACCCAT ACAAAGAGGA TAAAATAAAA AACCTCGAAC TGaAATGATG ATCTTTTCAG 
CTCGAGGTTT AAATATTGGT GCCTTATTTA TATAGATTCG TTATATTATA TTCTCTATTT 
TCATTAACmT AATCCTTAAA GAGTTTTAAA TTAATACCTG CTAGATGATT CAAAAATGTT 

so TCATCAACTT TTAAATAATT CAATAATTTT TGTGGTGTCA GTAAATnTCT ATCAAAATAC 
AACTTTAATA AACTATTCAT TTTGACAGGA CGTGACATTT CAATCACGTC GTCTAAAGAT 
AATACTTTCT CGCTTTAnAC AAAnACAAAA ACTTACCCGA TTAAAATCAA GTAAGTTTTA 
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TATTTGATAA AAAATCAATA AGTAATTGTG CGCCTTCAAC TTGAATATCT TTTACAACTG 2160 

GCGCGTCGAT ATACATATCA TACTGACCAC CGCCTACTGC ACGATAATTA TTTACACAAA 2220 

5 

TTGTATATGT CTGCTTTAAA TCAACTGCGT GACCTTGAAT CATCATATTG CTCACACGTT 2 2 BO 

GTCCCTTTGG TCTTCCAACA TGAATGGTAT AACTTACGCC ACCATATATA TCATAATTAA 2340 

w AGTGTTGTGG TTTGGGTTCA AGGAAGTCTG CGCTCACACT AACTTCATCA TTTTTCACGT 24 00 

CAAAATATTC TGCTGATCGT TCAATGGCTT CTTTAAGTTT GGCACCACTT AC AG CTAAAA 24 60 

CTTTAAATGT ATTTGGAAAT GGGTAATTGT TAATAACATC TCGCATCGTC ACGACTTGCT 2520 

15 TGAAACCACT AGCAGAATCA AACAAAGCTG TACAGGCAAC ATCTGCGTCA CTTTTTTCTA 2580 

ATAAAGCGTA ATTCATAAAA TTTGTAAAAG GATGCGGTGC CACACGTGCC TCAAATGCAT 264 0 

GATTAATCGT CATATCATAT GGCAATGTAG TAATTTCGTA ATCTAACCAG TCCTCTAACT 2700 

20 

GCTTTCGTAA AXGXXGGTCA XCTTCATCAA TAG TAAATGT GGAATCATCT ATAACAGGAA 2760 

GTAATTCACA TGATTCAACG GATAGATTTT CAT ATT CATC AGTACTCAAG ACTACTCTGC 2820 

CTACAGTTGT ACCTCTCGTA CCAGGTTGAA TCACAGCCGT TTGCTTAAAC CTTTCAGCAA 2880 

25 

TTTGTCGATG TTGGTGACCC GTAATAAAGA TATCTATATC TTTAGAAAAC GCTTCTAACA 2 94 0 

TGGCATATCC TTCATTTTCA CCCGTTAATA CTTCGGTCGG CGTACCACTT TCTAAATCCT 3 000 

30 TTTCAAATCC ACCATGGTAA CAAACCACAA TGATATCTGC ATGTCGCTTC ATTTCAGGTA 3060 

AGTATTGTTG AAGTATTTCA AAAGCACTAT GAAACGTArT GnCnTGAATA TGCTCTGGTT 3120 

GTTCCCAATG GGGAATAAAT TGTGTCGTTA AACCTATCAC ACCAACAGTT TGATCTCCAA 3180 

35 CCTGAAAATA CTTCACACCG TTATCAGTCA ATGTACTATC ATTTTCATAT ATATTAGCGC 3240 

ACAAAACTGG ATAATTGAGT CTGCGTAAAG -TGTCTTTTAA GTATGGTAAT CCATAATTAA 3300 

ATTCATGATT ACCAAGCGTA CCAAAGTCGA ATGCCATTCG ATTATAAAAA TCAACTAAAG 3360 

40 

GCTGGCTACT GCCGCTATGC GCGATTAAGT AATTACAAAA TGGTGACCCT TGCAAAAAAT 34 20 

CACCATTATC TATTTTAAAA CTTTGGTCAT ACTGCCTTCT GTsTTGTTCT ATAACATGAT 34 60 

TCGCTAGTAA CAATCCCATA GGTTGATATT GATTTCTACT CGTAAAATCT GTTGGGAAAA 3540 

45 

TATAACCATG TACGTCACTC ACGACATAAA ATGCTATGTT TGACATCCTC ACTCACTCCT 3600 

TCAATCACAA ACATCTTTCT TATTTCTATT ATATATTTAT TTGAAGTCTG TTGTAATCAA 3660 

so GGTTTTGTCA CCGAGTTTTA AACGAATCTT TGAACCTTCC ATACTTTCAA GTACTTTAGC 3720 

ATTGACCTTA ATTGTGACAT TTCCX5TTTTC ATCTGCTTTA ACTGTTGGCA AAGTACTGTA 3780 

ACCTGGTGGG TTATAATCGT TATCTTTACT TGAAAATTGT CCGATTTGAC GTCOGCCTTC 3840 
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TATTGTCATT TCAAA^OT cm,*** ^ 

^CTOA TTTGGTGGTG TATGATCATC „ m GGCTGAGGAT ^ 

TTCGCTGCCA TAACTACCTG C^TAAATGT TCTTGGATCA TACCATTTAT AACCACTCGG < 0 

COOTT^AC CA^GCTCTr ^CAGGCTC AGTT^ACGC I ™ GTC0 „ „ 

CAAC.AGTC TTTGTATCTA ATGTTAGGCT ACTCGCCTTA A^AXTTCC CATCATTATC „ 

-agacatc caagccg™ T a TOTTT aa tag OT accg tot ™„ „ 

ATCATATGTT CTCCATTATC TTCTCTTACA TAT^CO AACTATCTTC „ 

CACAAGTCAT GAATCACCGA TAAATGCTGC mjcmrr 

iraCCTTTr CCAACTTTAO AAATTCCTAC 431 

ATAGGGGCCT TCTCOTT AC CGCCCCCA^ ATAAATACCT TCATCT ACAG CA TCTC a C GA „, 

GCTC ™ ™™ «™ CC™ OT TCTCTGGATT 45 0 
AGTAATTGCT AATGTCGATC CGGCATGCAT AGAGACAGAT ^CACACCT. 

CAAAC^CT TTTGAAGAAA CAATATTGCT GGTATTTAAA TCACCTAGTG CATTATATCG 462 

A«ACGTACG CCAAAG™ TAGATAACCA ATCXGAACXT TTCACACCTT GCAXxgCAGT „. 

AGAACTTTTT TCTTCTGCAT TCATACCTTT CGACAXAXCX TCATATCCTC CACGTCGATA 47 < 

ACCATTCAXT GCCTCCGATG AATCAATACG ATTTAAATTT CGGXCAGCAX TGTAATGATC < 80 , 

XGAAATAAAG ACAACAXXGC CACCTTGTTt CACATAXTTA ACAATTGCTG CCXGXXCXGA „«, 

TTCTTTGAAA GGAATGTTAG CCTCAGGAAT TACAAATATT TTGGAACTTT TCAAACXXGC mi 

™™ TTCGAATGAC CATCAATAGC TTTAACGTCA TAACC^ ^A^A „ t( 

-CCCCATAA TCTGAAAATG CACCATCAOT AACCCAATCT GCAGCACCAG CTGTTTGACC SO<C 

ATGAGAACGA XCGAA.AATA CC^CCCXG XTGCT^XA OG^CGAXX CATGCGTTAT 5100 

AGCXAAAGAX TGOGGTAAAG CACTTAATGA TACCGTTGCA ACAATTGCAG AGACAGTTAA „„ 

T^ACTTATAT ATTTTTTTCA TTTTCTGAGG CXCCXTTTAA AATAAATTTG TTCTTGAATT 52J0 

ATAGGATAAA AATTCGTTGC ATATGAGCAA TTTAACGAAA AATTTACAAA ATCTTATCAA 52 »0 

ACTCTTAAAG AAAG.TA™ AAATTCAXTT TTATAAAATA CH^AACA TTTAAATGTG 5340 

CTACGCTATA AGTGTAATTT CATTGCATAC ATATTACACG ATTAAGAATG TGAAGGGGAC Moe 

AGTTATCAAA TGAAAAATTT XAAGTGTTTA TTTGTATTAA TGTTAGCAGT CA™^ m .„ 

- GCAGCAGCAT G^CXC AAGTTC^A GATAATCAAA AGAACGCTAG TAA™ 5520 

GATTCTAAAT ' CAGGAGGATA CAAACCTAAA GAATTAACCG TTCAATTTGT ACCX^CAA 558 0 

55 C ™»« TAAAGCAAAA CCA^AGAAA AATTACTATC TAAAGAATTA S«„ 
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TCTAAAAAAG TTGATGTTGG TTTCTTACCA CCAACGGCAT ACACATTAGC ACATGATCAA 5760 

AAAGCAGCTG ATTTATTATT ACAAGCACAA CGTTTCGGTG TAAAAGAAGA TGGTTCAGCA 5820 

AGTAAAGAAC TTGTAGATAG TTATAAATCA GAAATTCTTG TTAAAAAAGA CTCAAAAATT 5880 

AAAAGCTTGA AAGATTTAAA AGGTAAGAAA ATTGCCTTAC AAGATGTAAC ATCAACTGCT 594 0 

GGATATACAT TCCCACTTGC GATGTTAAAA AACGAAGCAG GTATTAATGC AACTAAAGAT 6000 

ATGAAAATTG TGAATGTTAA AGGTCATGAC CAAGCAGTTA TCTCATTATT AAATGGAGAt 6 060 

GTAGATGCTG CGGCTGTATT TAACGATGCA CGTAATACTG TGAAAAAAGA CCAACCAAAT 612 0 

GTATTTAAAG ACACACGAAT TTTAAAATTA ACACAAGCTA TTCCGAATGA CACAATTTCT 6180 

GTAAGACCAG ATATGGATAA AGATTTTCAA GAAAAATTGA AAAAAGCTTT TATAGACATT 624 0 

GCTAAATCAA AAGAAGGTCA CAAAATTATT AGCGAAGTTT ATTCACATGA AGGATACACA 63 00 

GAAACGAAAG ATTCAAATTT CGACATTGTA AGAGAGTACG AAAAATTAGT TAAAGATATG 6360 

AAATAATCAT TATTTAACAA ATGAATCATT AGCGAATTTG GTATTAAAAG CTTTCGTTCA 64 20 

ATAGATATAT T CT AG ATT AA TATTGAAAAG CTAGGCGCTA AACTGAAACA GATATAGAAA 64 80 

GGTGTCGCTG TACATTTGAA ACCATTTGTA CACAGAAACC CAATGTCTAT GATATTTCAG 654 0 

TTTACCTTGG CTTTTCTTTA TTAAAGAAAG GTGTCAAACA TGAGTCAAAT CGAATTTAAA 6600 

AACGTCAGTA AAGTCTATCC TAACGGTCAT GTAGGCTTGA AAAATATTAA CTTAAATATT 6 660 

GAAAAAGGTG AATTTGCAGT TATTGTCGGA CTATCTGGTG CTGGGAAATC CACGTTATTA 6720 

AGATCTGTAA ATCGTTTGCA TGATATCACG TCAGGTGAAA TTTTCATCCA AGGTAAATCA 6780 

ATCACTAAAG CCCATGGTAA AG CATT ATT A GAAATGCGCC GAAATATAGG TATGATTTTC 684 0 

_ CAACATTTTA ATTTAGTTAA. ACGGTCAAGT GTATTACGAA ATGTACTAAG TGGACGTGTA_ 690 °^ 

GGTTATCACC CTACTTGGAA AATGGTATTA GGTTTATTCC CAAAAGAAGA CAAAATTAAG 6960 

GCAATGGATG CACTAGAACG CGTCAATATC TTAGATAAAT ATAATCAACG CTCTGATGAA 7020 

TTATCAGGTG GCCAACAACA ACGTATATCT ATTGCACGTG CGCTATG CCA AGAATCTGAA 7080 

ATTATTCTTG CAGATGAACC AGTTGCTTCA TTAGACCCAT TAACTACGAA ACAGGTTATG 714 0 

GATGATTTAA GAAAAATCAA CCAAGAATTA GGCATCACAA TTTTAATTAA TTTACATTTT 7200 

GTTGACTTGG CAAAAGAATA TGGCACACGC ATCATTGGTT TACGTGATGG TGAAGTTGTC 7260 

TATGATGGTC CTGCATCTGA AGCAACAGAT GACGTATTTA GTGAAATATA TGGACGTACA 7320 

ATTAAAGAAG ATGAAAAGCT AGGAGTGAAC TAACATGCCT TTAGAAATAC CTACAAAGTA 7380 

TGACTCCCTT TTAAAGAAAA AGGTTTCTTT AAAAACGAGT TTTACCTTCA TGTTAATCAT 744 0 



511 

NSDOQD: <EP 07865 19A2_L> 



EP 0 786 519 A2 



AATACCTCAA ATAGGTGATC TATTCAAACA AATGATTCCA CCTGATTTCG AGTATTTACA 
ACAAATTACA ACGCCAATGT TAGATACCAT TCGAATGGcT ATCGTAAGTA CAGTATTAGG 
TAGCATCGTT TCAATACCAA TTGCGTTATT ATGTGCTAGC AATATCGTTC ATCAAAAGTG 
GATTTCAATA CCCTCGCGCT TTATTTTAAA TATAGTTCGT ACTATTCCAG ATTTGTTATT 



7560 
7620 
7680 
7740 

AGCAGCAATC TTTGTGGCTG TATTTGGAAT CGGTCAAATT CCAGGGATAT TAG CACTGTT 7800 
TATTTTAACT ATCTGTATTA TTGGAAAATT ATTATATGAA TCATTGGAAA CGATAGATCC 
AGGTCCAATG GAAGCAATGA CGGCTGTTGG CGCTAATAAA ATAAAATGGA TTGTTTTCGG 
TGTTGTACCA CAAGCCATAT CGTCATTTAT GTCATACGTA TTATATG CAT TTGAAGTAAA 
TATACGTGCT TCAGCTGTGC TTGGATTAGT CGGCGCTGGC GGTATTGGAT TGTTTTATGA 
TCAAACACTT GGTTTATTTC AATATCCAAA AACAGCAACG ATTATTTTAT TTACTTTAGT 
TATCGTCGTC GTCATTGATT ACATCAGTAC GAAAGTGAGG GCACATCTCG CATGACACAG 
GAAATAGCAA AATATAATGT TCACACAAAA GCACACAAAC GAAAATTGAT TAAAAGATGG 
CTTATTGCAA TTGTCGTCTT AGCTATTATC ATCTGGG CAT TTGCAGGTGT ACCAAGTTTA 8280 
GAACTTAAAA GTAAATCATT AGAAATCTTA AAAT C CAT AT TCAGCGGATT ATTCCATCCT 
GATATCAGCT ATATCTATAT ACCAGATGGC GAAGACTTAT TACGTGGTTT ACTTGAAACC 
TTTGCGATAG CCGTTGTAGG TACTTTCATC GCCGCAATTA TCTGTATTCC ATTAGCATTT 
CTAGGTGCAA ATAATATGGT AAAGCTACGC CCAGTTTCAG GTGTTAGCAA ATTTATTTTA 
AGTGTTATAC GTGTCTTCCC AGAAATTGTA ATGGCACTTA TATTTATCAA AGCTGTTGGC 
CCAGGTTCAT TTTCAGGTGT ATTAGCTTTA GGTATCCATT CCGTAGtATG CTTGGGAAAC 
TTTTAGCTGA AGATATTGAA GGTCTAGATT TCAGTGCTGT AGAATCATTA AAGGCCAGTG 
GTGCEAATAA GATTAAAACA CTCGTATTTG CAGTCATACC ACAAATTATG CCTGCCTTTC 
TATCACTCAT ACTTTATCG C TTTGAACTAA ACTTACGTTC AGCTTCTATA CTGGGGCTAA 
TTGGGGCTGG TGGTATCGGG ACACCACTCA TATTTGC CAT TCAAACACGT TCTTGGGACC 
GTGTAGGTAT TATATTAATC GGTTTAGTAC TAATGGTCGC AATTGTCGAT TTAATTTCCG 
GTTCAATCCG AAAACGTATT GTTTAACATT AAATCAGGAT ACTCCTAAAT AAGAAGTCCT 
ACCGTCTTAC GTTTCTCTAT TATAATAAAA ACAGCAGTGA AGAAAACTAT TGTTATAGTT 
AACTTCACTG CTGTTTTTAT AATATCTAAA TTTATTCTAT TTCAATTCCT TTAAATAACT 
TTTACCGAAC TCTGGTAATG TTACGTTGAA ATTATCTGCT ATAGTTG CAC CGATAGAACT 
GAATGTAGTA TCACTTTCTA GTGCATGACC ACCTTTAAAT TTCGGACTGT ACATAATTAC 



7860 
7920 
7980 
8040 
8100 
8160 
8220 



8340 
8400 



9060 
9120 
9180 
9240 
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TGTAATAATT ACT AAAT CGT CTTCTTTTAA GTTGCTAAAC AGTTCTGGCA AGCGATCATC 93 60 

GAAATCTTTA ATTGCTTGTG CATAACCTGG TTTATCACGA CGATGACCGT ATAATGCATC 9420 

AAAGTCTACT AAGTTTAAGA AG CTAATACC TGTGaAATCT TTCTTAACAA TTTTCATCAA 94 80 

TTGATCCATA CCGTCCATGT TACTCTTCGT ACGAACCGCT TCTGTTACAC CTTCACCATC 954 0 

ATAAATGTCA TTAATTTTAC CGATGGCAAT AACATCATAA CCACCGTCTT TCAAATGATC 96 00 

TAAGACAGTT TTACCAAAAG GTTTTAACGC ATAGTCATGT CGATTAGATG TACGTGTAAA 966 0 

GTTTCCTGGT T CAC CAACAT ATGGACGTGC GATAATACGA CCAATTAAAT ATTTAGGGTC 9720 

TTTTGTCAAC TCACGAACCT TTTCACAAAT ATCATATAAC TCTTCTAATG GGATAATGTC 9780 

TTCATGTGCA GCAATTTGCA ATACTGGGTC TGCACTTGTA TAAACAATTA AGTCACCAGT 984 0 

TTTCATTTGG TGCTCGCCCC ACTCATCGAT AATTTG CGT A CCCGATGCCG GTTTGTTAGC 9900 

AACAACTTTA CGACCTGTCA TTTCTTCAAT TTGTTGAATT AACTCTTCAG GGAATCCATT 9 960 

AGGGTATACT TTAAAAGGTT GCATAATATT TAATCCCATA ATTTCCCAGT GACCAGTCAT 10020 

TGTATCTTTA CCAACTGAAG CTTCACTCAA TTTAGTATAG TATGCTTCTG GTTGTTCAAC 10080 

TGCATTTACT ACTGGTAATT TAT CGATGTT CCCTAGACCT AACTTTTCAA GGTTTGGTAA 1014 0 

AGTTTGATCG AAACCTTCTA AGGTATGTCT TAAAGTATGT GAACCTTCAT CTTTAAAATC 10200 

AGCTGCGTCT GGCGCTT CAC CAATACCTAC TGAATC CATT ACGATTAAAT GTACACGATT 10260 

AAATGGTCTT GTCATAGCTA TCACTCCCAA AATTTATATA TATTAGTAAT CTGAATCTGC 10320 

TTCTAAACCT TGCATAATTT GAACACCTGC GCTCGCACCA ATACGTGTCG CACCTGCTTC 10380 

AACCATTTTA TTGAAATCTT CTAAATTACG TACGCCACCT GATGCTTTTA CTTCTACATC 10440 

AG CAC CTACT GTATCTTTCA TTAATTTAAC GTCTTCTGCA GTCGCACCGC CACCTGCAAA 10500 

ACCTGTTGAA GTTTT^ACGJ^AGTCCGCACC^ ACTCCGCTTTT GTTTU^TTCAC TCGCTTTTAC ~10560~ 

AATTTCGTCA TGGTCCAACA ATACCGTCTC AATAATCACT TTTACTGTGT GACCTTTCGC 10620 

AGCTTTAACC ACTGCTTCAA TGTCTTGTTG TACATCATCA AAACGTCCAT CTTTTAATGC 10680 

GCCGATGTTG ATGACCATGT CAATTT CAT C TGCACCATTT TGAATTGCAT CTTCTGTTTC 10740 

AAATGCTTTC GTTGCAGTTG TCGACGCACC TAATGGGAAT CCTATTACCG TACAAACGAG 10800 

CACCTCTGAA TCAGCTAGTC GCTCTGCTGC ATATTTAACA TGTGTTGGAT TCACACATAC 10860 

AGATTTAAAA TTGTATGctT TCGCTTCATC GATGATTTGA TCGATTTGCG TACGTGTTGA 10920 

CTCAGGCTTC AATAAAGTGT GATCTATATA TTTCTCAAAT TTCATACTTA CTACTCCTCG 10980 

TGTTATATAA TCTCTTTATT TAATTTTACT ATAAATACGA ATATATCTCG CGAATTTATA 11040 
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ATACTCATTA AACCTAAAAT AATTAAAATA ATACCGAAAT GTGAACTTAA TGCATCATTG 
CCTGGGAAAT TTAATGCTTT AAAATCGATT AGAGCCGCAG CAATCGCAAT ACCTACAGAT 

5 

ACCGCCACAT TAATAATTAA ATTATAAAAA CCAATAGCCA CACCTGTCAT ATTAAGATCT 
ATTGTTTTAA TGGCTTCGTT AAGTAAAGGT GCATACATTA AAGCAAAGCT ACCTGCAAAG 
, 0 AATATCATAG AAATGACGAA GATTGAAATG TG ATT AC CT A CTGCAAATGC AGGTAAAATC 

AAG CTCAGTG CTATTAAAAT AATTGCTGTG ATAATCG CTT GTTTTGAATT CAGATATTCG 
CCGATTTTAC CACTTAGTGC ACCAACAATG ACTGCTACTA TATAACCCGG TACTAATAAC 
,5 AGTGATGTTG TGTCTAGTTG CAGATGATAA ATTTGCTCCA TTATGAATGG GAACGTAAAA 

ATATAACCCA ATTGGATAGC ATACATTACA AATACTATAA ATAAAAATGA AGCATAACGT 
TTATTTTGGA AAAATGATTT ATTTACTAAT GGACGTTGCG CATTTTTAAT ATATAG CGCA 
AAAACGATAA TCGCAATTAA GGCACCAATC ATATATAACC AATTAAAGTT CGTAATAAAC 
AGCATGACTG TTGTAGCAGG GGATCCTCTA GAGTCGAnCC TG 
(2) INFORMATION FOR SEQ ID NO: 71: 



20 



25 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

35 CTAAAGAAGA TGCGAAACAA GATGTTGATA AACAAGTTCA AGCTTTAATT GACGAAATCG 6 0 

ATCAAAATCC AAATCTAACA GATAAGGAAA AACAAGCACT TAAAGATCGT ATTAATCAAA 120 

TACTTCAACA AGGTCATAAC GACATTAACA ATGCGATGAC AAAAGAAGCA ATTGAACAAG 180 

CAAAAGAACG TTTAG CGCAA gCATTGCAAG ACATCAAAGA TTTAGTGAAA GCTAAAGAAG 24 0 

ATGCGAAAAA TGATATTGAT AAACGTGTAC AAGCTTTAAT TGACGAAATC GATCAAAATC 300 

CAAATCTAAC AGATAAGGAA AAACAAGCAC TTAAAGATCG AATTAATCAA ATACTTCAAC 360 

AAGGTCATAA CGACATTAAC AATGCGCTGA CTAAAGAAGA AATTGAGCAG GCAAAAGCAC 420 

AACTTGCACA AGCATTGCAA GACATCAAAG ATTTAGTGAA AGCTAAAGAA GATGCGAAAA 480 

ATGCAATAAA AGCCTTAGCT AATGCGAAgc GTGATCAAAT CAATTCAAAT CCAGATTTAA 54 0 

CACCTGAGCA AAAAGCAAAA GCGCTCAAAG AAATTGACGA AGCTGAAAAA CGAGCACTAC 600 

AAAACGTTGA GAATGCTCAA ACTATAGATC AATTAAATCG AGGATTAAAC TTAGGTTTAG 660 



55 



3NSOOCID: <£P 078651 9 A2J_> 



514 



EP0 786 519 A2 



TTGAAGCAAC ACCTGAGCAA ATCCTAGTTA ATGGTGAACT CATTGTACAT CGTGATGACA 780 

TCATTACAGA ACAAGATATT CTTGCACACA TAAACTTAAT TGATCAGCTT TCAGCAGAAG 84 0 

5 TCATCGATAC ACCATCAACT GCAACGATTT CTGATAGCTT AACAGCAAAA GTTGAAGTTA 900 

CATTGCTTGA TGGATCAAAA GTGATTGTTA ATGTTCCTGT AAAAGTTGTA GAAAAAGAAT 960 

TGTCAGTAGT CAAACAACAG GCAATTGAaT CAATCGAAAA TGCGGCACAA CAAAAGATTA 102 0 

10 

ATGAAATCAA TAATAGTGTG ACATTAACAC TGGAACAAAA AGAAGCTGCA ATTG CGnAAG 10 8 0 

TTAATAAGCT TAAACAACAA G CAATTGG AT CATGTTnAAC AATGGCACCT GGATGTTCCA 114 0 

TTCAGTTGAA GGAAATTTCA ACAACAAGGA ACAAGCGCCn GATTGGAACA ATTTGA 1196 

75 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



3S 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CAATCGTTTC AACGCTATTA TCTTTAGACA ACAATTGTAA GCGTGTATGT GCAGTTTCTA 60 

AACAGTCTAT AATTCGAGTT CTTAATTCAG CTGGATCATC TTTAAAAATA AAATCCATCG 12 0 

CTGCAACTTT GTAGACAAAT GTTAAATAGG TAAGTTCACT GTGACTCGTA ACGAAAATAA 180 

TGTTACCAAC TGGGTCATGC TTACGAATTT CACTGCCTAA TTTGATACCA TTAAT AT CAG 24 0 

TTGAAAGTTG AATATCTAAA AAGTAACAGC CTATGTCATT CATATTTTTA GCTTGCTCAA 300 

GCACCTCATA AGGATTATCA GTTGCGAGGG CAATTTCCAT AGGCTTTTCT TCTATCATTA 360 

- TAT^TTTTT~AAT AATGGTA ACCATGTTTT CTCTTTGTTT" TGGATCGTCT TCGCAAATGA 420 

AAATTTTCAT ACATTCACAT CCTTATGGCT AGTTGTTAAT AATTTCAACT TTTTGAATAA 480 

AGAAACCATT TTCGATAATT GTATCTAATA AGACATTGTC TGCATTATCA GCAATTTCTT 540 

TTAAAGTTGA TAGACCTAAA CCACGACCTT CACCTTTAGT AGAAAAACTT TCTTGGAACA 600 

45 ATTCATGAAT GCGTGGTATA TCATCAGCGC ATTTATTCAT AACAATAAAC GTTACTGAAT 660 

TTTCACTTTC AATAAATGCA ACGCGAATGA TAGGGTCATC AATTTCAGTT GATGCCTCAA 720 

TTGCATTATC AAGAATAATA CCAATACTGC GACTTAAATC GATCATATTC AAGTTAATGC 780 

SO 

TACTTACTTC ATCGGGTATT T CG AT ACT AA TCGGAATATT CATTTCTTGT GCACGTAAAA 840 

TTTTCGCAGT AATTAAGCCT TTAATTTCAC GTACTTTAAG ATTCTCGATA CCATTTAATT 900 

55 
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GTAGG CCAGG CATGTCATCT 


TCTCGAATGT 


ATTCTGAAAG 


TGTCGTTAAG 


ATATTGACAT 


1020 




AAT CATGACG GAACTTGCGC 


ATTTCGTTGT 


TGATAGCTTC 


AATCTTCAAT 


GTATATTCAT 


1080 


5 


AATAGGTTTC AATTTCTTCT 


TGATTACGTT 


TATATTTCAT 


CTCTTTAAGG 


AGAAATTGAG 


1140 




AAATAACAAA TG TTAATAT A 


CTTAAAAATA 


TAGTGATACC 


AATAAAAATA 


AAAGAATACT 


1200 


10 


GCCTTATTAC TTTAGCTTCA 


TCCGAGTTTA 


TTTGTGAATA 


AAAGAAAAAT 


AATGAAAAAG 


1260 


TAAGCAGTAA GATAGTCGAA 


ATAACTATTA 


AAAATCCTTT 


GTTTAGTATT 


AGATATGGTG 


1320 




TGCTAATTTT TTTGAGAACT 


CTATTTATTA 


TATATGAGAA 


TAGTATACTA 


ATAGTCACAT 


1380 




AAACTACAAA AAAG CTAGGG 


AATATTACAA ATATACTATC 


AGAAATTTTG 


GTGGATATAT 






GCATATATAA CTATATACCT 


GTAGTTAGCA 


CnGTnATAGG 


AATAATCnGG 


CGAGGTCCAT 


1500 




AATCCACCAA AATAGAATA 










1519 


20 


(2) INFORMATION FOR SEQ ID NO: 73 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


73 : 






GTAGGAATCT 


CTTTGTCTTT 


TTGGGAGGAC 


ATTTAATATG 


AATGTATATT 


TAGCAGAATT 


60 


CCTAGGAACT 


GCAATCTTAA 


TccrrrrrGG 


TGGTGGCGTT 


TGTGCCAATG 


TCAATTTAAA 


120 


GAGAAGTGCT 


GCGAATGGTG 


CTGATTGGAT 


TGTCATCACA 


GCTGGATGGG 


GATTAGCGGT 


180 


TACAATGGGT 


GTGTTTGCTG 


TCGGTCAATT 


CTCAGGTGCA 


CATTTAAACC 


CAGCGGTGTC 


240 


TTTAGCTCTT 


GCATTAGACG 


GAAGTTTTGA 


TTGGTCATTA 


GTTCCTGGTT 


ATATTGTTGC 


300 


TCAAATGTTA 


GGTGCAATTG 


TCGGAGCAAC 


AATTGTATGG 


TTAATGTACT 


TGCCACATTG 


360 


GAAAGCGACA 


GAAGAAGCTG 


GCGCGAAATT 


AGGTGTTTTC 


TCTACAGCAC 


CGGCTATTAA 


420 


GAATTACTTT 


GCCAACTTTT 


TAAGTGAGAT 


TATCGGAACA ATGGCATTAA 


CTTTAGGTAT 


480 


TTTATTTATC 


GGTGTAAACA 


AAATTGCCGA 


TGGTTTAAAT 


CCTTTAATTG 


TCGGAGCATT 


540 


AATTGTTGCA 


ATCGGATTAA 


GTTTAGGCGG 


TGCTACTGGT 


TATGCAATCA 


ACCCAGCACG 


600 


TGATTTAGGT 


CCGAGAATTG 


CACATGCGAT 


TTTACCAATA 


GCTGGTAAAG 


GTGGTTCAAA 


660 


TTGGTCATAT 


GCAATCGTTC 


CTATCTTAGG 


ACCAATTGCC 


GGTGGTTTAT 


TAGGTGCAGT 


720 


GGTATACGCT 


GTATTTTATA 


AACATACATT 


TAATATTGGT 


TGTGCAATTG 


CrATTGTTGT 


780 
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CGAATCAATT TACTAAAATA AAAAGAAACG T AAAT AG CAT AATTTAACAT GTTTGATTCA 900 
TGGATTATGC TATTTTTTCG CCAAAATTTA ACAGATTTTG TACAATGGGT TAGCGATTAT 96 0 

s TTTTTAATAA AGGAGATACT ACTAATGGAA AAATATATTT TATCTATAGA CCAAGGAACA 1020 

ACAAGCTCAA GAGCGATTTT ATTCAATCAA AAAGGGGAAA TTGCAGGGGT AGCACAACGT 1080 

GAGTTTAAGC AATATTTTCC ACAATCAGGT TGGGTTGAAC ATGATGCAAA TGAAATTTGG 114 0 

W ACATCTGTGT TAGCTGTAAT GACGGAAGTA ATTAATGAAA ATGATGTTAG AGCTGATCAA 1200 

ATTGCAGGTA TCGGTATTAC AAACCAACGT GAAACAACGG TTGTTTGGGA CAAaCATACT 1260 

GGCCGCCCAA TTTATCACGC AATTGTTTGG CAATCACGTC AAACACAATC AATTTGTTCA 13 2 0 

GAATTAAAAC AACAAGGATA TGAACAAACA TTTAGAGATA AGACAGGATT ACTTTTAGAT 13 80 

eCGTATTTTG CAGGTACAAA AG TT AAATGG ATTCTAGACA ATGTTGAAGG TGCACGAGAA 144 0 

AAAGCAGAAA ATGGCGATCT ATTATTTGGA ACGATTGATA CTTGGTTAGT ATGGAAATTA 1500 

TCaGGaAAAg CtGCGCATAT TACTGATTAT TCaAATGCGA GTCGTACATT AATGTTTAAT 1560 

ATCCATGATT TAGAATGGGA CGATGAGTTA TTAGAACTAt TACAGTACCT AAAAATATGT 1620 

25 TGCCAGAAGT TAAAGCTTCG AGTGAAGTAT ATGGTAAGAC AATTGATTAC CACTTCTATG 16 80 

GTCAAGAAGT ACCAATCGCT GGAGTAGCTG GTGATCAACA AGCAGCATTA TTTGGACAAG 174 0 

CTTGCTTCGA ACGTGGTGAC GTGAAAAACA CATATGGAAC TGGTGGCTTC ATGTTAATGA 1800 

30 ATACAGGTGA CAAAG CGGTT AAATCTGAAA GTGGTTTATT AACAACAATT G CTTATGGT A 1860 

TTGATGGAAA AGTAAATTAT GCGCTTGAAG GTTCCATCTT TGTTT CGGGT TCAGCAATCC 1920 
AATGGTTACG TGATGGATTA AGAATGATTA ATTCAGCACC ACAATCAGAA AGTTATGCGA 1980 
CACGAGTTGA CTCTACTGAG GGTGTTTATG TTGTTCCAGC TTTTGTAGGT TTAGGAACAC 2040 
CATMTGGGA^TTCTGAAGCA CGTGGTGCGA : TTTTCGGTTT AACACGTGGA ACTGAAAAAG - 2100- 
AGCACTTTAT CCGTGCAACT TTAGAATCAC TATGTTACCA AACTCGTGAC GTTATGGAAG 2160 
CAATGTCAAA AGACTCTGGT ATTGATGTCC AAAGTTTACG TGTCGATGGT GGTGCAGTTA 2220 
AAAATAACTT TATTATGCAG TTCCAAGCAG ACATTGTTAA TACTTCTGTT GAAAGACCTG 2280 
45 AAATTCAAGA AACTACAGCT TTAGGTGCTG CATTTTTGGC AGGTTTAGCA GTTGGATTCT 2340 
GGGAGAGTAA AGATGATATC GCTAAAAACT GGAAATTAGA AGAAAAATTC GATCCGAAAA 24 00 

TGGATGAAGG CGAAAGAGAA AAATTATATA GAGGTTGGAA AAAAGCTGTT GAAGCAACAC 24 60 

50 AAGTTTTTAA AACAGAATAA ACTTGTAGAT TAGACTTTTG TATAAACATT GTGATACAAT 2520 

CAATTTAAGT TAATATTTGA ATCGAGAAGC GAGAGATTTG TTCGAACATG TACAATTGAA 2580 

55 
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GCATTGTCTA CTTTTAAGAG AGAACATATT AAAAAGAATT TAAGAAATGA TGAATATGAT 
TTAGTAATTA TTGGTGGCGG TATTACAGGT GCAGGTATTG CACTAGACGC GAGTGAAAGA 
GGAATGAAAG TTG CATTAGT TGAAATGCAA GACTTTGCAC AAGGAACAAG CTCAAGATCT 
ACAAAATTAG TCCATGGTGG TTTGCGTTAC TTAAAACAAT TCCAAATTGG AGTAGTTGC C 
GAAACTGGTA AAGAACGTGC GATTGTTTAT GAAAATGGGC CTCATGTTAC GACTCCAGAG 
TGGATGCTTT TACCAATGCA TAAAGGTGGA ACATTTGGTA AATTCTCAAC ATCAATTGGT 
TTAGGAATGT ATGATCGTTT AGCAGGTGTT AAGAAGTCTG AACGTAAAAA AATGTTATCT 
is AAAAAAGAAA CTTTAG CTAA AGAACCATTA GTTAAAAAAG AAGGTCTAAA AGGCGGCGGT 

TACTATGTTG AATATCGTAC TGACGATGCG CGTTTAACTA TTGAAGTTAT GAAGCGTGCT 
GCTGAAAAAG GCGCAGAAAT TATCAACTAT ACTAAATCTG AACACTTCAC TTATGATAAA 
20 AATCAACAAG TAAATGGTGT TAAAGTTATA GATAAATTAA CTAATGAAAA TTATACAATT 

AAGGCTAAAA AAGTGGTTAA TGCAGCAGGT CCATGGGTTG ATGATGTTAG AAGTGGTGAT 
TATGCACGCA ATAATAAAAA ATTACGTTTA ACTAAAGGTG TACATGTTGT TATTGATCAA 
25 TCAAAATTCC CATTAGGTCA AGCAGTATAC TTTGATACTG AAAAAGATGG AAGAATGATT 

TTTGCAATTC CACGTGAAGG AAAAGCGTAT GTAGGTACTA CAGATACATT CTATGACAAT 
ATCAAATCTT CACCATTAAC TACACAAGAA GACAGAGACT ATTTAATCGA TGCGATTAAT 

30 

TACATGTTCC CTAGTGTTAA TGTTACAGAT GAAGATATTG AATCAACATG GGCAGGAATT 
AGACCATTAA TTTACGAAGA AGGCAAAGAC CCTTCTGAAA TCTCTCGTAA GGATGAAATT 

35 TGGGAAGGTA AATCAGGTTT ATTAACTATT GCAGGTGGTA AATTAACAGG CTATCGTCAC 

ATGGCTCAAG ACATTGTTGA TTTAGTATCT AAACGCTTGA AAAAAGACTA CGGTTTAACA 
TTTASTCCAT GTAATACAAA AGGTCTGGCA ATTTCAGGTG GCGATGTAGG TGGTAG CAAG 

40 AACTTTGATG CGTTTGTAGA GCAAAAAGTA GATGTAGCTA AAGGATTCGG CATTGATGAA 

GATGTTGCAA GACGTTTAGC ATCTAAATAT GGTTCAAATG TTGATGAATT GTTCAACATT 
GCG CAAACAT CTCAATACCA TGATAGCAAG TTACCATTAG AAATTTATGT AGAACTTGTT 

45 TATAGTATTC AACAAGAAAT GGTATACAAA CCTAACGATT TCTTAGTTCG TCGTTCTGGT 

AAAATGTATT TCAATATTAA AGATGTATTA GATTATAAAG ATGCTGTCAT CGATATTATG 
GCAGATATGC TTGATTACTC TCCAGCTCAA ATTGAAGCAT ATACTGAAGA AGTTGAGCAA 

SO 

GCAATTAAAG AAGCGCAACA TGGaAATAAT CAACCAGCAG TTAAAGAATA AtTAATTTGT 
ACAATCATAA ACTGGTGTCC TGTTTTAAGG GCATCAGTTT TTTTATACGA GATACATTAG 
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GTTATTAAAG GTOTGAGATG ATGACTGAAA AACAATTTAA ATTAACTGTA CAAGATAATA 4500 

CGAATATTGA AGTTAAAGTG AATTTTACAG ATGTAGATTC AAAAGGAATT ATT CAT AT AT 4560 

TTCATGGTAT GGCTGAACAT ATGGAACGTT ACGATAAATT AGCACATGCA CTTTCAAAGC 4620 

ATGGCTTCGA TGTGATACGT CATAATCATC GAGGACATGG TATTAATATT GATGAATCAA 4680 

CAAGAGGGCA TTACGATGAT ATGAAACGAG TTATCGGTGA TGCCTTTGAA GTAGCGCAAA 4 74 0 

CAGTGAGAGG CAATGTTGAT AAACCATACA TTATAATCGG ACATTCAATG GGATCCGTTA 4800 

TAG CT AG ATT GTTTGTAGAA ACATATCCGC AATATGTTGA TGGTCTAATT TTAAGTGGTA 4 8 60 

CTGGTATGTA TTCATTATGG AAAGGTTTAC CAACCGTTAA AGTGTTACAA CTGATTACAA 4 920 

AAATTTATGG TGCTGAGAAA CGAGTTGAAT GGGTTAACCA GTTAGTATCA AATAGTTTTA 4 980 

ATAAAAnnAT ACGTCCATTA CGTACACAAA GTGATTGGAT TTCTAGTAAT CCAATTGAAG 5040 

T AG AT Aa CTT TATTAAAGAT CCATATAGTG GaTTTAATGT GTCAAATCAA TTATTATATC 5100 

AAACAG CCTA TTATATGCTA CATACATCAC AATTAAAAAA TATGAAAATG TTAAaTCATG 5160 

CCATGC CTAT ATTATTAGTT TCAGGATATG ACGATCCTTT AGGTGATTAT GGTAAAGGGA 5220 

TTTTAAAATT GGCGAATATA TATAGAAACG CTGGCATnAA AAATGTTAAA GTGAATCTTT 

ATCATCATAA ACGTCATGAA GTGTTATTTG AAAAnGATCA TGACnAAATT TGGGAAGACT 

TGTTTAAATG GTTGAATCAA TTTTATAAAA AATAAAGAAA GTGGAATTAA ATATGAATAA 54 00 

AAATAAGCCT TTTATTG TAG TAATTGTGGG GCCAACTGCT TGCAG 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
T (DrtOPOtOGY:~Ti"near — - ~ 



5280 
5340 



5445 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
TGGCTTGAAC TACGCCAATA AGTCCCCCTA GTACAAGAAT GAATACCATG ATATCGACCG 
CTTCTATCGT ACCTTCAACC ATGCTACTTG TTATTTGTTC TGGTCCAGCT GGATGTTGCT 
TTAATCTTrC ATAAGTATTC GGAATTGATA CCGGCTTATT AATTGCACCT GATTTAAATT 180 
GTTCAATCTT AATTTTAACC CCCATTTTGT CTAGTTCCTG TTGCGTACCC GGAACCTTTT 
TCACTTGGTT ATGAGGGTTA ACTATCTTTA GTTCTTGGGA TGAAGGTTCG TAAGAAAGTT 
TAGAATATGC ACCAG CAGG A ATAACCCATG TTGCTATAAC TGCAACAACC GTTAAAATGA 
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TAATTGTATT TTCCACGGTT TCATCTCCTT CGACATTTAA CCTAGCATTT CTACCTTAAA 
GATTTTATAA ATATAAATTA AGAAAGTGCA CCCCGCATCA AAATAGAGGC ATTATTTTCA 
GGGGGTGCAC ATAAATAATA AAAATCATGC ATTTGACATA TAGTAATTGA AAAGCGTTTC 
AATTCAATTA CTTTTTAATC ACAGTACCTA CTTTACCCTC TAAGGCAGCA TCTAATTCAT 
TTAATGATGT TATAAGCACA CTTCCTTTTG GATTGTTTTC AATAAATGAT ATGGCTGCTT 
CAATTTTTGG TAACATACTT CCTTTTGCAA ATTGATTTTC GTCTATATAT CGTTTTAATT 
CATCAACATT TGTTGTTTTC AAAGGCTGTT GGTTTTCAGT GTTAAAATTA ATATATACAT 
, 5 AATCAATTGC TGTTAAAATA ATCAATTGAT CGCATTGAAT ATTAGCACCC AACAACGCAC 

TTGTTTTATC TTTGTCTATA ACTGCATCAA TACCTTTAAA ACCATCATGT TGCTCTCTAA 
TTACTGGTAT ACCTCCACCA CCAGCAGCAA TAACGAGTGT ATCATTTTTA ATAAGTGTTT 
20 TAATACTCTC TAATTCAATA ATAGAGATGG GTTGTGGTGA AGGAACAACG CGTCTATATC 

CTCTTCCAGC ATCTTCAACA AATATAAATC CTTTTTCTTT TTGAATTTGT TCAGCTTCTT 
CTTTGTTGTA AAATAACCCA ATTGGTTTTG AAGGATTGTT AAATGCCGGA TCATTTTCAT 
CAACTTCAAC TTGTGTCACT AGTGTTACCA CTTGTTTATC CATTCCAATA GAATGCAATT 
CATTTTGTAA GCTTTCTTGT AATTGATAGC CGATGTAAGC TTGACTCATT GCGCCACATT 
3Q CAGCAAATGG AAATGCCGGA CCTTGGTTAT GTTCTGCAGC ATAGTTAAGT CCCAAATTAA 

TGCTTCCAAC CTGTGGTCCA TTACCATGAC TAATAACAAT CTCATGTCCT TTTGTnATTA 
AyCCTACTAA TGATTtCGCA GTATTTTTAA CAAGCTCGAG TtGgTyCTTG aGGTGATTTn 
35 CCTAAAGCAT TACCACCTAA TGCTACTACT ATTTTCGCCA TCATATTCAC TTCCTTATAT 

CATTTAAAAT TCACCCAATG TAGCAACCAT GaCTGCTTTG ATTGTATGCA TTCTGTTCTC 
AGCTTCTTGG AATACAACTG AAGCTTTACT TTCGAATACT TCATCTGTAA CTTCCATTTC 
40 TCGAATACCA TATTTTTCAA AAATTTGTTG ACCTATTTTC GTATCAGCAT TATGGAAAGA 

TGGTAAGCAA TGCTCAAAAA TAACATTTGG ATTACCAGTT TTATCCATTA TTTCTTTATT 
TACTTGATAT GGTTTCAATA ATTCAAGTCG TTCTTTCCAT ACTTCATCAG GTTCACCCAT 
45 TGATACCCAA ACATCAGTGT AAATTACATC CGAACCTTTT ACaCCTTGGT CaATATCATC 

TGTGATTAAT ATGTTGCCaC CATTTTCaG C GGCAATATTT TTACAGCGAT TTAATAATTC 
50 ATCTGTTGGA TTTAATTCTT TTGGACAAAC TAAATGGAAG TTCATACCCA TAATGGCAGC 

ACCTTGCATT AATGCATTTG CAACGTTATT ACGACCATCT CCAACATATG TAAAGTTAAT 
ATCTGCATAA TCTTTTTTTA AGACTTCTTT TGCTGTTAAG AAATCAGCAA GAACTTGAGT 
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TTCTACTGTT CTTTGTGAAA AACCACGGTA TTCAATGCCA TCATACATTC CACCAAGCAC 2280 

ACGTGCAGTA TCTTTAGTTG TTTCTTTTTT ACCCATTTGT GATCCAGTTG GGCCTAAATA 234 0 

5 AGTTACATTT GCACCTTGAT CATGCGCTGC AACTTCAAAT GCACATCGCG TT CTTGT AG A 24 00 

ATCTTTTTCA AATAACAGTG CAATATTTTT ATTTTTTAAC ATAGGCTTTT CAGTGCCAAT 24 60 

ATATTTAGCA CGTTTTAAAT CCTCGGAGAG TGTTAATAAG GTTCTACCTC TTGTCGTGAA 252 0 

W AAGTCTAATA AAGTTAAAAA ACTTCTGTTT CGTAnATTTT TCATTAAnA 256 9 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 75: 
CCTGGAACCA TCCaATCGtG CaAATCtTGa AAGaGAATAC GCAACAACAA TTAAATGTAT 6 0 

25 TGGAACACTA TATTCCAAAT GACCATCCAG CACTCGTTGA ATTAAAAATA TGGGAACGTT 120 

GGTTACATAA ACAAGGTTAC AAAGACATCC ATTTAGATAT TACTGCGCAC CACCTAGATC 18 0 

CTATTACACA GGTTTATTTA TTCAATGTCA TTTTGCTGAA AATGAATCTC GAGTTTTAAC 24 0 

AGGTGGTTAT TACAAAGGAA G CAT CG AAGG GTTTGGATTA GGATTAACAC TTTAAGTAAG 300 
GGAGTATG C A CAATGTTAAG AATCG CCATA GCCAAAGGAC GTCTAATGGA TAGTTTAATT 36 0 

AACTATTTAG ATGTAATTGA ATATACGACA TTATCAGAAA CATTAAAAAA TAGAGAACGC 420 
CAATTATTAT TAAGTGTAGA TAATATTGAA TGCATTTTAG TAAAAGGAAG TGACGTGCCA 480 
HATCTATCTGG AACAAGGAAT GGCAGACATA GGCATTGTTG GTAGCGACAT ATTAGATGAG 54 0- 

CGCCAATATA ATGTTAATAA TTTGTTGAAT ATGCCTTTTG GAG CATGTCA TTTTGCGGTT 600 
GCAGCGAAAC CTGAAACGAC CAATTATCGT AAAATCGCAA CGAGTTATGT TCATACTGCT 660 
GAAACATATT TTAAATCAAA AGGTATTGAT GTCGAATTGA TTAAATTGAA TGGCTCTGTT 720 
45 GAATTGGCCT GTGTTGTAGA TATGGTAGAC GGAATTGTCG ACATCGTTCA AACAGGTACT 

ACGCTAAAAG CGAACGGACT GGTTGAAAAG CAACATATTA GTGAT AT CAA TGCAAGATTA 
ATAACTAATA AAGCAGCTTA TTTTAAAAAA TCACAATTAA TAGAGCAATT TATTCGCTCT 900 
50 TTGGAGGTGT CTATTGCCAA TGCTTAATGC ACAACAATTT TTAAATCAAT TTTCATTAGA 

AGCAC CATT A GATGAGTCAT TGTATCCaAT TATTCGCGAT ATTTGTCAGG AAGTTAAAGT 
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TTTAGOAATT AGTCATGAmC AAATTAAAGC AGCATTTGAC ACATTAGATG AAAAAACAAA 
ACAAGCATTA CAACAAAGTT ATGAAAGAAT TAnAG CAT AT CAaGAAaGTA TtaAACAGaC 
GaATCAACAG TTAGAAGaAT CAGTGGaGTG tTrTGaAATA TACCATCCmC taGaAAGTGT 
CGGTATTTAT GTG 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GTTGATAAAT TAAAAATGTT TTTATCAGAT ATTCAAAGTT ACCAACAATA TAGTAAAGAT 
CATCCGGTGT ATCAGTTAAT TGATAAATTT TATAATGATC ATTATGTTAT TCAATACTTT 
AGTGGACTTA TTGGTGGACG TGGACGACGT GCAAATCTTT ATGGTTTATT TAATAAAGCT 
ATCGAGTTTG AGAATTCAAG TTTTAGAGGT TTATATCAAT TTATTCGTTT TAT CG ATGAA 
TTGATTGAAA GAGGCAAAGA TTTTGGTGAG GAAAATGTAG TTGGTCCAAA CGATAATGTC 
GTTAGAATGA TGACAATTCA TAGTAGTAAA GGTCTAGAGT TTCCATTTGT CATTTATTCT 
GGATTGTCAA AAGATTTTAA TAAACGTGAT TTGAAACAAC CAGTTATTTT AAATCAGCAA 
TTTGGTCT CG GAATGGATTA TTTTGATGTG GATAAAGAAA TGGCATTTCC ATCTTTAGCT 
TCGGTTGCAT ATAGAGCTGT TG C CGAr AAA GAACTTGTGT CAGAAGAAAT GCGATTAGTC 
TATGTAGCAT TAACAAGAGC GAAAGAACAA CTTTATTTAA TTGGTAGAGT GAAAAATGAT 
AAATCATTAC TAGAACTAGA GCAATTGTCT ATTTCTGGTG AGCACATTGC TGTCAATGAA 
CGATTAACTT CACCAAATCC GTTCCATCTT ATTTATAGTA TTTTATCTAA ACATCAATCT 
GCGTCAATTC CAGATGATTT AAAATTTGAA AAAGATATAG CACAAATTGA AGATAGTAGT 
CGTCCGAATG TAAATATTTC AATTGTGTAC TTTGAAGATG TGTCTACAGA AACCATTTTA 
GATAATGATG AATATCGTTC GGTTAATCAA TTAGAAACTA TGCAAAATGG TAATGAAGAT 
GTTAAAGCAC AAATTAAACA CCAACTTGAT TATCGATATC CATATGTAAA TGATACTAAA 
AAGCCCTCAA AACAATCTGT TTCTGAATTG AAAAGACAAT ATGAAACAGA AGAAAGTGGC 

SO 

ACAAGTTACG AACGAGTAAG GCAATATCGT ATCGGTTTTT CAACGTATGA ACGACCTAAA 
TTTCTAAGTG AACAAGGTAA ACGAAAAGCG AATGAAATTG GTACGTTAAT GCATACAGTG 
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GATGGATTAA TCGATAAACA TATTATCGAA GCAGATGCGA AAAAAGATAT CCGTATGGAT 1260 
GAAATAATGA CATTTATCAA TAGTGATTAT ATTCGATATT GCTGAAGC 1308 
5 (2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1431 base pairs 

(B) TYPE: nucleic acid 

70 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

15 





GATGCCATTn 


ATnnGTATGC 


AAGAAGTTGT 


TCCGGGTTCA 


GGTGGATTaC 


CAGTTGGTAC 


60 




TGGTGGTAAG 


ACGTTACTAA 


TGCTTTCAGG 


CGGTATAGAC 


TCACCAGTTG 


CTGGGATGGA 


120 


20 


AGTGATGAGA 


CGTGGCGTAA 


CAATTGAAGC 


GATTCATTTC 


CATAGTC CAC 


CATTTACAAG 


180 




TGATCAAGCA 


AAAGAAAAAG 


TTATTGAATT 


GACACGTATT 


TTAGCTGAAC 


GTGTTGGACC 


240 




AATTAAATTG 


CATATTGTAC 


CATTTACAGA 


ATTGCAAAAA 


CAGGTAAATA 


AAGTTGTACA 


300 


25 


TCCAAGATAT 


ACAATGACTT 


CAACGAGACG 


TATGATGATG 


CGTGTTGCTG 


ATAAATTAGT 


360 




ACATCAAATA 


GGGG CTTTAG 


CTATTGTAAA 


TGGTGAAAAC 


CTAGGGCAGG 


TAGCCAGTCA 


420 




AACACTTCAT 


AGCATGTATG 


CAATTAATAA 


TGTAACTTCT 


ACTCCTGTAT 


TACGTCCTTT 


480 


30 


ATTAACTTAC 


GATAAAGAAG 


AAATTATTAT 


TAAATCGAAA 


GAAATTGGTA 


CATTTGAAAC 


540 




ATCTATT CAA 


CCATTTGAAG 


ATTGTTGTAC 


AATTTTCACC 


CCTAAAAATC 


CAGTAACCGA 


600 


35 


ACCAAACTTT 


GATAAGGTAG 


TCCAATATGA 


AAGTGTCTTT 


GATTTTGAAG 


AGATGATTAA 


660 


TCGTGCTGTT 


GAAAATATTG 


AAACACTTGA 


AATAACTAGT 


GATTATAAAA 


CTATTAAAGA 


720 




ACAGCAAACA 


AACCAATTAA 


TAAACGACTT 


TTTATAAATA 


AAATCCTAGA 


GTAAATTTAA 


780 ~" 


40 


ACATAAGGGG 


ATGTTAAACT 


ATGGATTTGA 


ACTTAACGAT 


GATTATAATC 


ATAATTTTAT 


840 




TTGGTTTTAT 


CGCGGCGTTT 


ATAGATTCGG 


TTGTAGGGGG 


TGGCGGTTTA 


ATTTCTACGC 


900 




CAGCATTATT 


AGCAATCGGT 


CTACCACCAT 


CTGTGGCTTT 


AGGTACAAAT 


AAATTGGCAA 


960 


45 


GTTCGTTTGG 


TTCTTTAACT 


AGTACGATAA 


AGTTTATAAG 


GTCCGGTAAA 


GTGGACTTAT 


1020 




ATGTTGTTGC 


CAAATTATTT 


GGTTTTGTAT 


TTTTGGCATC 


TGCATGTGGC 


GCATATATTG 


1080 




CAACGATGGT 


TCCGTCACAA 


ATATTGAAAC 


CTTTAATCAT 


CATTGCACTT 


TCGTCGGTGT 


1140 


SO 


TT AT ATT CAC 


ATTACTTAAA 


AAAGATTGGG 


GCAATACACG 


CACGTTTACT 


CAATTTACAT 


1200 




TTAAGAAAGC 


CATAATATTT 


GCAGCACTTT 


' TTATATTAAT 


CGGCTTTTAT 


' GATGGATTTG 


1260 
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TAAGTGCAGC AGGAAATGCT AAAGTTTTGA ACTTTGCTTC TAATATAGGT GCGCTTGTAT 
TATTTATGGT ATTAGGACAA GTAGATTATG TAATAGGTTT AATTATGGCT A 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4403 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



15 


(xi) SEQUENCE DESCRIPTION: 


uty XL) NU: 


78 : 








AATATTATTT TAAATTCAAT 


ATTTATTGGT 


*JV-AA 1 lAi 11 


TAAACTTATT 


ATTCGCCTTT 


60 




ACCATTATTT TCATGGAAAG 


ACGTTCTGCC 


AATTCTATCT 


GGGCTTGGTT 


ACTAGTCTTA 


120 


20 


GTTTTCTTGC CTTTATTCGG 


CTTCATTTTA 


1 AL. 1AT 


TAGGACGACA 


AATTCAACGT 


180 




GACCAAATTT TCAAAATTGA 


TAAGGAAGAT 




TAGAGTTAAT 


CGTTGATGAG 


240 


25 


CAATTAGCTG CTTTAAAAAA 


TGAAAACTTT 


TCAAATTCCA 


ATTATCAAAT 


TGTAAAATTT 


300 


AAAGAAATGA TTCAAATGTT 


GTTATATAAT 


AACGCAG CAT 


TTTTAACAAC 


AGACAACGAT 


360 




TTArrrrtAT ACACAGACGG 


CCAAGAAAAA 


TTTGATGACC 


TAATACAAGA 


CATCCGTAAT 


420 


30 


GCTACTGATT ATATTCATTT 


TCAGTACTAT 


ATTATTCAAA 


ATGATGAATT 


AGGTCGTACC 


480 




ATTTTAAATG AACTTGGTAA 


AAAAGCGGAA 


CAAGGTGTAG 


AAkjTTAAAAT 


TCTTTATGAT 


540 




GACATGGGTT CTCGTGGACT 


GCGTAAAAAA 


GGCTTACGCC 


CGTTTCGCAA 


TAAAGGTGGA 


600 


or 
JO 


CATGCTGAAG CATTTTTCCC 


ATCAAAATTA 


CCTTTAATTA 


ACTTGCGTAT 


GAACAATCGA 


660 




AACCATCGAA AAATTGTTGT 


AATAGATGGG 


CAAATTGGAT 


ATGTTGGTGG 


TTTTAATGTT 


720 




GGTGATGAGT ACTTAGGTAA 


ATCAAAAAAA 


TTCGGCTATT 


GGCGAGATAC 


GCATTTACGA 


780 


40 


ATTGTCGGGG ATGCAGTGAA 


TGCATTGCAA 


TTACGATTTA 


TTCTAGATTG 


GAATTCACAA 


840 




GCCACACGTG ACCACATCTC 


CTATGATGAT , CGTTATTTCC 


CAGATGTAAA 


TTCTGGTGGA 


900 


45 


ACAATTGGCG TTCAAAT AG C 


TTCTAGTGGT 


CCTGACGAAG 


AATGGGAACA 


GATTAAATAC 


960 


GG CTATTTGA AAATGATTTC 


ATCTGCTAAA 


AAATCGATTT 


ATATTCAATC 


TCCCTATTTC 


1020 




ATACCTGATC AAGCCTTTTT 


AGATTCTATT 


AAAATTGCGG 


CATTAGGTGG 


TGTTGATGTC 


1080 


SO 


AATATCATGA TTCCTAATAA 


ACCTGACCAT 


CCGTTTGTTT 


TTTGGGCTAC 


TTTAAAAAAT 


1140 




GCAGCATCCT TATTAGATGC 


CGGTGTTAAA GTATTTCACT 


ACGACAATGG 


CTTTTTACAC 


1200 




TCAAAAACAC TTGTTATAGA 


TGATGAAATT 


GCAAGTGTGG 


GAACAGCTAA 


TATGGACCAT 


1260 
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AAATTAAAAC AAGCTTTTAT AGATGATTTA GCAGTATCTT CTGAATTAAC AAAAGCACGT 1380 

TATGCTAAGC GAAGTCTTTG GATTAAATTT AAAGAAGGTA TTTCACAATT ATTGTCACCT 1440 

5 ATCTTATAAA ATAGAAATAT GAGGAGTGTA aCTTTAATGC AACAATCAGA CGTCATTAGT 1500 

GCTGCCAAAA AATATATGGA ATCTATTCAT CAAAATGATT ATACAGGCCA TGATATTGCG 1560 

CATGTATATC GTGTCACTGC TTTAGCTAAA TCAATCGCTG AAAATGAAGG TGTTAATGAT 1620 

10 ACTTTAGTCA TTGAACTCGC ATGTTTGCTT CATGATACCG TTGACGAAAA AGTTGTAGAT 168 0 

GCTAACAAAC AATATGTTGA ATTGAAGTCA TTTTTATCTT CTTTATCACT ATCAACCGAA 1740 
GATCAAGAGC ACATTTTATT TATTATTAAT AATATGAGCT ATCGCAATGG CAAAAATGAT 
CATGTCACTT TATCTTTAGA AGGTCAAATT GTCAGGGATG CAGATCGTCT TGATGCTATA 
GGCGCTATAG GTGTTGCACG AACATTTCAA TTTGCAGGAC ACTTTGGTGA ACCTATGTGG 

ACAGAACATA TGTCACTAGA TAAGATTAAT GATGATTTAG TTGAACAGTT GCCACCATCT 1980 

GCAATTAAAC ATTTCTTTGA AAAATTACTT AAGTTAGAAT CTTTAATGCA TACAGATACG 2040 

GCGAAGATGA TTGCTAAAGA ACGTCACGAC TTTATGATGA TGTACTTGAA ACAGTTTTTT 2100 

25 ACGGAATGGA ATTGTCACGA CTAGACATTG AAGTTGTAGT ATGATGATGC GATGTAATGG 2160 

CGTGTTGTTG TGGAAGCTTG GTGTCATGCC ATGTTACTTT GATGTGTTGT TGTGGGAGCT 2220 

TGGTGACATG TCATG CT ACT TTGATGTGCT GGTACCACGA TGCGTCTTGA TGTAGTGCTA 22 80 

30 TGATGTGGCA TTGCGGTGTT ATGGTGTTAT AGACAGGTTT GGCGTTGATG CCATGTTACT 2340 

TTGATGTGCT GGTACCACGA TGCGACTTGA TGTAGTGCTA TGATGTGGCA TTGCGGTGTT 24 00 

ATGGTGTTAT AGACCGGTTT GATGTTGATG CCATGTTACT TTGATGTGCT GGTGCTACGA 2460 
TGCGACTTGA TGTAGTGCTA TGATGTGGCG TTGCGCTGTT ATGGTGTTAT AGCCAGGTTT 
GGTGTTGATG TCATGCCGTT ACGATTCTAT GATATGTTGT TGGGACGTTG CAATGTGTAT 
TATGCCGTTG TGACGTTATT ATTTCACACT GTTACATGTA TAAGTGAATT GCTGTGGAAA 

TTTGCGACAT ATACTGCTAC ACTGATGAAT CATTGTGTCA AGATGACATT GCGATGAAGA 2700 

ATGACAACTC TGTTATTAAC CACTTTTTAC ATACTGAAAA CTCGTTAATA TTATTTCAAA 2760 

4S TAAAAACAGC AGTAGGATGA CTTTCACATT TGAAATCATC TTACTGCTGT TTCTATTTAT 2820 

CACATATTGT ATAATGTGAC ACTAAGTTTC GCTATTGAAG CGAAAAATAA TGTGCGCCCT 2880 

ATAAAGTTAA AATTATCTTC AACTTTTAGG GTGCACATTA TTTGGACTTG CTAAGGTTAT 2940 

50 TTCTTTTTCT TTTTAGACAC AACTTGTGTG TTTTTGCCTT TTTTATTGCt GCCGC CGTTG 3000 
TGCTCTCTTT CATACGCTTC AATGAAAGGT TGTACTTCTT TTTTAGCGAC TTTTTCATAA 
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CCAAGTGCTG ATGCTGAGCT TAATGAAATC CAGATAATCA TAATTGGTGA AATGACCATC 
ATCATGTAAC CCA^ACG TTGTTCGTCT GGCATCGTTT TACTTGATAC ATATGCTTGG 
ATAAAGTATA AAACACCGGC AATAATTGTA ATCCAAATAT CAGGACGTCC TAAATCG AA C 
CATAAGAAGT GTGGATATTT AAACAAACCA TCTACAAGTT GGTCTTTAAG TACAAAGTAT 
AATCCCATGA TGATTGGTAA TTGGATTAGC ATTGGTAAAC AACCCAACAT ACTCTTAATC 
GGGTTCATGT CATACTTTTT ATATACTTGC ATTAATTCTT GGTTTGCAGC CATTTTTTCT 
TCTTGTGTAC GCGnCaCGTT cACTTTTTCT TGAATTTTTT CAACTTCTGG CTTTGCAACT 
TTCATTTTTT GACGCATCAT ATGACTATTT TTATAGTTTG ACAACATGAA TGGTAATAAA 
ATAATACGAA TTAC CAATAC AAGGATAATA ATAGCTAAAC CATAATTGTC GTTTAATAAG 
1T ATO CCA ACCAAT CCAA TACATTTTTC ATTGGATCTA CGAATGTATT GTAGAAAAAy 
cwCtACGTTT TTCAGGTTTA GAATAGTCAC AACCAGCCAA AAAGACCATA ATACCTAAAA 
ATAATGGTAG TAACGCTTTT TTCTTCATTT TTCCACCTCT ATCATTATAT TCACATAGGA 
TTTATTCTAT CACATTAATG AGTACGTATG AAACAATAAG TGGAAAAATT TAACTAATTA 
TTAAAAAAAT CTTTGAATCG ATTAACAGTC TTTTCAATAT TTTCACTTTT AGAAATGGCT 
GAAATGACTG AAATTCCATT GGCACCTGCT TCTACAATCG GCGCCACATT ATTAGTATTG 



30 — x.w^.i LiUAAACGT TGCAATCATT 4080 

TCTGGACCTA CTGGTATATG CGCGTCATGC TTCGACGGCG TAGGATAGAT TGGTCCAACA 
CCTATATAAT CmACATGAGT TAAATCAGAT TTTG CATACT CATCTAAATC ACTAATACTA 

- AGTCCAATAA ™"™»GT GAAATATTGT G CTATCTCTT TGACTTTCGC ATCATCTTGA 

CCGACATGTA TACCATCCGC GTTAATTTCT TTTGCCAAGG ATACATCATC ATTAACGATA 
AAAGGCACAT CATATTGATG ACAGAGATGC TGTAATTCTT TAGCTAATAC AAGTTTATCG 

40 TTTCCTTTTA AAGCTGATTC ACC 

(2) INFORMATION FOR SEQ ID NO: 79,: 

(i) SEQUENCE CHARACTERISTICS: 
4S (A) LENGTH: 1808 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 79: 
TGGAnCCAAT ATTAGAAATG ATTAAAACAT TAACAGGTAT TAATAGTCCT TCAGGAGnCA 
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TAACAAATAA AGGTGCGTTA TTAATAACAG TGCCAGGCAA AAATGATGAA GTACAACGCT 
GTATTACTGC TCATGTTGAT ACTTTAGGTG CaATGGTTAA AGAAATTAAA GAAGATGGTC 
GCTTaGCAAT AGAATTAATT GGAGGATTCA CGTATAACGC GATTGAGGGT GAATATTGCC 
AAATTAAAAC TGATGCTGGT CAAATATATA CAGGAACAAT TTGTCTGCAT GAAACAAGTG 
TTCATGTATA TAGAAATAAT CATGAAATAC CTAGAGATCA AAAGCATATG GAAATAAGAA 
TTGATGAAGT AACTACATCA GAAGAAGATA CAAAGAGTTT AGGTATTTCA GTAGGTGATT 
TTGTTAGCTT TGATCCACGT ACAGTTATCA CGTCATCAGG TTTTATTAAA TCTCGTCATT 
TAGATGATAA AGCTAGCGTA CGgTtGATAC TACAATTACT AAAGAAATTA AAAGAAGAGC 
AAATAATATT ACCACATACA ACGCAATTTT ATATTTCTAA TAACGAAGAA ATAGGTTACG 
GTGCAAATGC ATCAATTGAT TCGAAAATCA AAGAATATAT TGCATTAGAT ATGGGCGCGT 
TGGGAGACGG TCAAGCATCG GATGAATATA CAGTTTCTAT TTGTGCCAAA GATGCTTCAG 
GTCCATATCA TAAGCAATTG AAATCGCACC TAGTTAATCT TTGCAAAATA AATAACATTC 
CATATAAAGT AGACATATAT CCATATTATG GTTCAGATGC TTCAGCAGCT TTACATGCTG 
GTGCGGATAT CAGACATGGT TTATTTGGCG CTGGCATTGA ATCATCTCAT GCAATGGAAC 
G AACA CAT AT TGATTCTATT AAAGCGACAG AGAAATTACT ATATGCATAT TGCTTATCAC 
CAATTGAGTA AACAATTAGT GTTGACAAAT GTGaACGACC TATGTAATAT AATGAACTAT 
30 AAAAATAATT AGAATTTTCT AAAGAAATAG TAG CAG ATAT GAAACGTAGC AAATAGAAAG 

CTAATGGGTG ATGGGAATTA GCACGCCATA TCTTGTGAAT TGGACTTTGG AAAACAATTG 
AATGAGTTTT GAAAGTGAAC ATGAATTATG TTAACTAAGG TGGCACCACG GTAACGCGTC 

AAGGTAGCAA CAGAAAGTTA GTGGATGATG TGAACTAACA CCGAGATTAA TGAAATTGGG 
TTTTGTCTGC AACAGAAAAA TTATATATAG TAAAGAGTGA ACTATGAATA TTTCGAATAT 
TCGGTTAATT TAGGTGGTAC CACGCGTCAc nTCCTTTATA TTGATAAGGA TGCTGGCGCT 
TTTTTGAAAG GAGCGTATAG AATGGATATA TTTTATAAAA AAATAAAAGC AAATGTAACG 
CCCGAAGTTT TAGCACAACT TCATTCCAAG AAGaTCATTT TGGAAAGTAC AAATCAACAA 
CAAACTAAAG GTCGCTATTC AGTTGTTATT TTTGATATTT ATGGCACTTT AACTTTAGAT 
AATGATGTAT TATCAGTAAG TACTTTAAAA GAATCGTATC AAATCACTGA AAGACCGTAC 
CATTATTTAA CGACTAAnAT AAATGAAGAC TACCATAATA TTCCAAGATG AGGCAACTTA 
AGTCATTA 
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ii) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1320 base pairs 
(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

TGGTCGTCAA TTTCTTGATT ATATCTATAA TCCTCATTTT CAATATTAGA GTCTGTAGAA 60 

TCATCGATAT TATTATCATT CGCATGACTA GAAGCAGAAT CATTATTTTT ATCATTG CTT 120 

75 TCTTCTTTTT TGAAGTCTTT ATTTATCAAG TAAATTTCTT CATCAAAATC AGCTTGTTGA 180 

GATGTATCAT CTTTATTTTG ATTAGAAAAA TGTGTAGCCT TTGATCTTTT TCTTTGCCGT 24 0 

CTTTTCTTAG ATGTATTCCT CGTAAATAAT TCTAATTCAT CTTTATCTTC ATTTGATTCT 3 00 

20 TGTTGATCGT TCTTCGTTTT ATCATCCATC AATACTCACA CCCTTTAATA AGATGGTAAA 360 

TGGGCACGGA ATCTTTCAAT AAATTTCTCT CCACGCTCTT CAAAAGTACT ATATTGATCC 420 

CAACTCGCAC AAGCAGGTGA CAATAATACA ACATCATTTG GTTCTATAAT ATCTTGTACT 480 

TTATCAACAG CGTCTTCGAC ATTGTTCGCT TCAATGACCG ATTTCCCTTG ACTATTACCT 54 0 

AGTTTAGCAA ACTTAGCTTT CGTTTGTCCG AATACAACCA TCGCGCGAAC ATTTTCCATA 600 

TAAGGAATGA GTTCGTCAAA TTCATTCCCT CGATCCAAAC CACCACATAA CCAAATGATT 660 

GGTTGATTAA ATGAATTTAA GGCAAACTGT GTTGCTAGCG TGTTTGTTGC TTTGGAATCA 720 

TTATAATATT TATTAGTTCT ATTAGTACCA ACATATTGCA ATCTATGCTC TATTCCTGAA 180 

AATGTAGTTA AACTATCAAT AATTGCtTTA ATAGGTACAC CAGCanAATA CAAGCAAGCA 840 

GAGCTGCTAA TATATTTcTA AATTATGTTC ACCAGGCAAT ACTAGAtCTT CAGTGTTAAT 900 

AATa CGAACA CCTTTATaAA CGATAAAACC ATCTTtAATA TAAaTACCAT CArCTtCTTG 960 

40 TTGAGTTGAG AAATACAATG TCTTAGCTTT TAATTCTTCC GACTCTATCA CTTGTCTTTG 102 0 

ATGATAATTA CAAATCAAAT AATCCTCTTC CGTTTGATTT TTATATATTT GCTTTTTAGC 108 0 

ATTTTGATAG TTTTCTAAAT TTTCATGGTA ATCTAGATGC GCCGAATAAA TGTTAGTAAT 114 0 

TATAGCAATG TGTGGTTTAT ACTTTTCGAT TCCAAGTAAC TGGAATGACG ACAACTCTGT 120 0 

AACTAAATAA TCTGTAGGCT TTACTTCTTG TGCTACTTTA GATGCAACAT AACCAATATT 1260 

GCCGGATAAT CTTCCAGTTA AGCGACTTTT TTTAAACATA TCTCCAATTA GAGAAGTAAC 1320 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4280 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TTTACACCAA TCAAAAAATC GAACTGATAT AAATAAGTAC AAAGCTTATC TATCAATCCG 60 

w ATTTAGTTAT AAAACAAAAA AAGCCACAGT AATGTGGCTT TTTGTTATAT TCAGTATCAA 12 0 

AATGGTATCA AT AG C CATTT TCGGAAGTCA AGAATGGCTT AACAACGCGG TTTAAAGCTA 180 

TCCAATACTA CCTTC CATTT CGAACTTGAT TAAACGGTTC ATTTCGACCG CGTATTCCAT 24 0 

15 TGGAAGTTCT TTTGTAAATG GTTCGATGAA TCCCATAACA ATCATTTCTG TCGCTTCTTC 300 

TTCAGAAATA CCACGACTCA TTAGATAGAA TAATTGTTCT TCAGAAACTT TTGAAACCTT 3 60 

GGCTTCATGT TCTAATGATA TTTGATCGTT GAATACTTCG TTATATGGAA TTGTATCTGA 42 0 

20 

TGTTGATTCG TTATCTAAGA TTAATGTATC ACATTCAATA TTTGAACGAG CACCTTTTGC 4 80 

TTTACGTCCA AAATGAACAA TACCGCGATA AATAACTTTA CCACCATTTT TAGAAATAGA 54 0 

TTTAGAAACA ATTGTAGAAG ATGTATTAGG TGCTTTATGA ATCATTTTAG CACCGGCATC 60 0 

25 

TTGAACTTGT CCTTTACCAG CAAATGCAAT AGATAATGTA CTACCTTTTG CACCTTCACC 66 0 

TAAAAGAACA CAGTTTGGAT ATTTCATCGT TAACTTAGAA CCTAAGTTAC CATCTACCCA 720 

TTCCATATTT CCGTTTTCAT AAACAAAAGT ACGTTTTGTA ACTAAATTGT ATACATTGTT 780 

30 

CGCCCAGTTT TGAATCGTAG TATAACGAAC GTGCGCATCT TTATGCACAA TGATTTCCAC 84 0 

AACAGCAGAG TGTAAAGAAC TAGTTGTATA AACTGGTGCA GTACAACCTT CTACGTAATG 900 

35 TACAGAAGCA CCTTCATCAG CAATGATTAA TGTACGTTCA AATTGACCCA TGTTCTCAGA 960 

-GTTAATACGG- AAATAAGCTT GTAGTGGCGT ^TCTAGTTTG ATATTTTTAG GTACATAAAT 1020 

GAAGGAACCA CCTGACCATA CTGCTGAGTT TAACGCCGCA AATTTGTTAT CTGCTGCAGG 1080 

40 TACTACAGAA GCAAAGTATT TTTTGAATAA TTCTTCATTT TCTTGTAAAG CACTATCTGT 114 0 

ATCTTTAAAG ATAATACCTT TTTCTTCAAG TTCTTTTTCC ATATTATGGT AAACAACTTC 1200 

AGATTCATAT TGAGCAGAAA CACCAGCTAA ATATTTTTGT TCAGCTTCAG GAATTCCTAA 1260 

45 

TTTATCGAAA GTTCTTTTAA TTTCTTCTGG CACTTCATCC CATGAACGTT CAGCTTGTTC 1320 

TGAAGGCTTT ACATAGTAAG TAATGTCATC GAAATTCAAT TCTGATAAGT CGCCACCCCA 1380 

TTGAGGCATT GGCATTTTAT AAAACAATTT TAATGATTTA AGACGGAAAT CTAACATCCA 1440 

50 

TTCCGGCTCA TTTTTCATGT TAGAAATTTC TCTAACGATA TTCTCAGTTA AACCACGTTC 1500 

TGATCTGAAA ATGGACACAT CATCGTCGTG GAATCCATAT TTATAATCCC CAACATCAGG 1560 
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TTTAATTCAT GATGTAAACC ATATTATAAC AATGACATGA CATCTTATAA AAATTTTTAT i 680 

ACTTTTATAT GTCTAATATC AAAATTATCT ATGATTAACA GCATTCTATT CTTCTTCAGT 174 0 

CGTACCTTCT GCTTTACCTT CTTTAGCAAC AGTACCTTTT TCCAATGCTT TCCAAGCTAA 1800 

TGTGGCACAT TTAATACGAG CTGGGAATTG AGATACACCT TGCAATGCTT CAATATCTCC I860 

CATTTCTTCT GTAATCACAT AGTCTTCACC AAGCATCATT TTCGTAAATT CTTGGCTCAT 192 0 

TTGCATTGCT TCTCCAAGTG AATGACCTTT AACAGCTTGT GTCATCATCG ATGCACTTGC 1980 

CATTGAAATC GAACAACCTT CACCTTCAAA CTTAGCATCT TTTATAATG C CG TCTTCTAT 204 0 

ATCAAATGTT AGTCGTATAC GGTCACCGCA TGTCGGGTTA TTCATATCTA CTGTCATAGA 2100 

CCCGTTATCT AATACACCTT TATTTCTAGG ATTTTTATAA TGATCCATAA TGACAGATCT 2160 

ATATAATTGA TCTAGATTAT TAAAATTCAT AAGAGAAAAA CTCCTTCGTT TGTTTCAAGG 2220 

20 CATTTATTAA CTG AT CAACG TCTTCTTTCG TGTTGTATAT ATAAAAACTC GCTCTAGCTG 2280 

TTGAAGACAC ATTTAACCAT TTCATTAACG GTTGCGCACA ATGATGCCCA GCTCTAACCG 23 4 0 

CTACACCTTC TGTATCTACG GCTGTAGCAA CATCGTGTGG ATGTACATCT TGTAAATTAA 24 00 

25 ACGTTATTAC ACCTGCACGA CGATCCTTTG GCGGGCCATA AATTTCAATT CCTTCAATTG 24 60 

CAGACATTTG CTCATAAGCA TATATCGTTA ATTCTTGTTC ATATTTATGA ATTGCATCAA 252 0 

AACCTATGCG TTCTAAATAG CGAATAGCTT CTGCAAGCCC AATTGCTTGA GCAATTAATG 258 0 

GAGTACCCGC CTCAAATTTA GTAGGTAAAT CAGCCCATGT TGCATCATAC TTACTTACAA 264 0 

AATCAATCAT GTCGCCACCG AACTCAATCG GTTCCATTTT TTGTAGTAAC TCACGTTTAC 2700 

CAAATAATAC GCCAATACCT GTTGGTCCAA GCATTTTATG ACCACTAAAA CTATAAAAAT 2760 

CAGCATTCAT TTCTTGCATA TCAAGTTTCA TATGTGGTGC TGctTGCGCC CCATCAACAC 2 820 

TGATfiATTGC ACCATGTTGA TGAGCTATTT CTGCAATGGT TTTAACATCA TTAATTGTAC 2880 

CGAGCACATT AGATATATGT GCAATAGCAA CGATCTTTGT TTTATCATTA ATCGTTTGCT 2940 

TAATATCCTC GATGTTTAAT TCACCGTCAG CTGTCATTGG TATAAATTTC AATGTCGCAT 3000 

TTTTACGCTT TGCTAACTGT TGCCAAGGAA CAATATTGGC ATGATGTTCC ATTTCAGTGA 3060 

45 CAACAATTTC ATCGCCCTCT TCAACATTTG CATCACCATA GCTATGTGCT ACAAGGTTAA 3120 

TCGACGCAGT TGTTCCGCGT GTAAAAATGA TTTCTTCAAA ATACTTCGCA TTAATAAAAC 3180 

GACGAACGGT TTCACGGGCA TTTTCATAAC CATCAGTTGC CAATGATCCT AATGTATGAA 324 0 

SO CACCACGATG AACGTTTGAA TTATAACGCT TGTAGTAATC TTCTAAAACA TTTAACACTT 3 300 

GCACAGGCGT TTGACTTGTC GCTGTTGAAT CAAGATATGC TAAACGTTTG CCATTGACTT 3360 
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CTTCATTCAC GACCTTTCTT AAATAAAAAT CCTAATCATT TAAATACTGA CGTTGTATTA 34 80 

GTCTTATACC AATATCGACA GTCTATATCT ATTACAAACT TTTATTTTCA AAATATTATT 3540 
5 

TAGAAACTTT GCGTTCAATT ACTTCTCTCA ATTGACGTTT AACGTCTTCG ATAGGTAATT 3600 
CACGTACTAC TGGATCTAAG AAACCATGTA TAACAAGACG TTCCGCTTCT CTTTGAGAAA 3660 
TACCACGACT CATTAAATAG TAAAGTTGAT CTGGATCAAC ACGACCTACT GATGCAGCAT 3720 

10 

GACCAGCTTG TACATCATCT TCATCAATTA ATAAAATAGG ATTCGCGTCA CCACGAGCAT 3 780 

GTTCAGATAA CATTAATACA CGTGATTCCT GATTAGCAAT TGATTTAGTT CCACCATGCT 384 0 

15 TAATGTAGCC GATACCATTA AATACAGACG ATGCATGTTC TTTCATAACA CCATGTTTAA 3900 

GGATATAACC ATCTGTTTCT TTACCATATT GTACGATTTT AGATGTTAGA TTAATTTTTT 3 960 

GTTCGCCTGT ACCTACAACT ACTGATTTAA GTGAACTTGT TGAACGATCA CCAAATAAAT 4 020 

20 TTGTTGTATT ATCAATAATT TGGCTACCCT CATTCATTAA ACCTAGTGCC CAATTAATTG 4 080 

AGGCATCCGC TTCAGTAATA CCACGTCGAA TGATATGACC TGTAAAGCCT TTATCCATAT 414 0 

AGTCCACTGA GCCATATGTG ATATTTGAAT TTGCACCAGC AATCACTTCA GAAATAATAT 4 200 

25 TtAATTGATT TCCTTCACCA GATGCATTTG mTAAGTAATT TTCAACATAT GTGACTTCGG 4260 

CGCTTTCTTC AGTAACGATG 4 2 80 

(2) INFORMATION FOR SEQ ID NO: 82: 

30 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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- (Xi) SEQUENCE" DESCRIPTION T SEQ ID NO: 82 : -■ 

TCnGACTCGA ACGGTGmAAC TAttCCGTTG TaATTCCgGA GgAAsCAAGG TATGCCCATC 60 

TGCaAAGAAA gaATGsAATG AACTTTTTGG AAATGTAGAA GTGGTAAATA AAGATAAAGG 120 

ATATTACATT CTGAGAAGTA TAAAAGCTTG AAATGAAATG GATATTCTGT TATAGTTATA 180 

45 TAATGTAAAA ATTTATGTTC AATAAGTGTG TACTTTTACG TTAAATAGAT AAGTTAATTA 240 

AGAATAAATA TAGAATCGAA AATGGTGTCA TCATTAGTGT TGCCGTTTTC TTTTTGTCTT 300 

TTTATTAATA TGCTTATGGT ATTTAGCTAA AAGCGGATCA CATAATTTTT GAGGGGTGAA 360 

50 TCTGTTTGGC AGGTCAAGTT GTCCAATATG GAAGACATCG TAAACGTAGA AACTACGCGA 420 

GAATTTCAGA AGTATTAGAA TTACCAAACT TAATAGAAAT TCAAACTAAA TCTTACGAGT 480 
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CTGGTAATTT 


GTCATTAGAG 


TTTGTGGATT 


ACCGTTTAGG 


AGAACCAAAA 


TATGATTTAG 


600 




AAGAATCTAA 


AAACCGTGAC 


GCTACTTATG 


CTGCACCTCT 


TCGTGTAAAA 


GTGCGTCTAA 


660 


5 


TCATTAAAGA 


AACAGGAGAA 


GTTAAAGAAC 


AAGAAGTCTT 


TATGGGTGAT 


TTCCCATTAA 


720 




TGACTGATAC 


AGGTACGTTC 


GTTATCAATG 


GTGCAGAACG 


TGTAATCGTA 


TCTCAATTAG 


780 


10 


TTCGTTCACC 


ATCCGTTTAT 


TTCAATGAAA 


AAATCGACAA 


AAATGGTCGT 


GAAAACTATG 


840 


ATGCAACAAT 


TATTCCAAAC 


CGTGGTGCAT 


GGTTAGAATA 


TGAAACAGAT 


GCTAAAGATG 


900 




TTGTATACGT 


ACGTATTGAT 


AGAACACGTA 


AACTACCATT 


AACAGTATTG 


TTACGTGCAT 


960 


1S 


TAGGTTTCTC 


AAGCGACCAA 


GAAATTGTTG 


ACCTTTTAGG 


TGACAATGAA 


TATTTACGTA 


1020 




ATACTTTAGA 


GAAAGACGGC 


ACTGAAAACA 


CTGAACAAGC 


GTTATTAGAA 


ATCTATGAAC 


1080 




GTTTACGTCC 


AGGTGAACCA 


CCAACTGTTG 


AAAATGCTAA 


AAGTCTATTG 


TATTCACGTT 


1140 


20 


TCTTTGATCC 


AAAACGCTAT 


GACTTAGCAA 


GCGTGGGTCG 


TTATAAAACA 


AACAAAAAAT 


1200 




TACATTTAAA 


ACATCGTTTA 


TTTAATCAAA 


AATTAGCTGA 


GCCAATTGTA 


AATACTGAAA 


1260 




CTGGTGAAAT 


TGTAGTTGAA 


GAAGGTACAG 


TGCTTGATCG 


TCGTAAAATC 


GACGAAATCA 


1320 


25 


TGGATGTACT 


TGAATCAAAT 


GCAAACAGCG 


AAGTGTTTGA 


ATTGCATGGT 


AGCGTTATAG 


1380 




ACGAGCCAGT 


AGAAATTCAA 


TCAATTAAAG 


TATATGTTCC 


TAACGATGAT 


GAAGGTCGTA 


1440 




CGACAACTGT 


AATTGGTAAT 


GCTTTCCCTG 


ACTCAGAAGT 


TAAATGCATT 


ACACCAGCAG 


1500 


30 


ATATCATTGC 


TTCAATGAGT 


TACTTCTTTA 


ACTTATTAAG 


CGGTATTGGA 


TATACAGATG 


1560 




ATATTGACCA 


TTTAGGTAAC 


CGTCGTTTAC 


GTTCTGTAGG 


TGAATTACTA 


CAAAACCAAT 


1620 


35 


TCCGTATCGG 


TTTATCAAGA 


ATGGAAAGAG 


TTGTACGTGA 


AAGAATGTCA 


ATTCAAGATA 


1680 


CTGAGTCTAT 


CACACCTCAA 


CAATTAATTA 


ATATTCGACC 


TGTTATTGCA 


TCTATTAAAG 


1740 




AATT^CTTTGG 


TAGCTCTCAA 


TTATCACAAT 


TCATGGACCA 


AGCAAACCCA 


TTAGCTGAGT 


1800 


40 


TAACGCATAA 


ACGTCGTCTA 


TCAGCATTAG 


G A C CTGGTGG 


TTTAACACGT 


GAACGTGCTC 


1860 


AAATGGAAGT 


ACGTGACGTT 


CACTACTCTC 


ACTATGGCCG 


TATGTGTCCA 


ATTGAAACAC 


1920 




CTGAGGGACC 


AAACATTGGA 


TTGATTAACT 


CATTATCAAG 


TTATGCACGT 


GTAAATGAAT 


I960 


45 


TCGGCTTTAT 


TGAAACACCA 


TATCGTAAAG 


TTGATTTAGA 


TACACATGCT 


ATCACTGATC 


2040 




AAATTGACTA 


TTTAACAGCT 


GACGAAGAAG 


ATAGCTATGT 


TGTAGCACAA 


GCAAACTCTA 


2100 




AATTAGATGA 


AAATGGTCGT 


TTCATGGATG 


ATGAAGTTGT 


ATGTCGTTTC 


CGTGGTAACA 


2160 


SO 


ATACAGTTAT 


GGCTAAAGAA 


AAAATGGATT 


ATATGGATGT 


ATCGCCGAAG 


CAAGTTGTTT 


2220 




CAGCAGCGAC AgcATGTATT 


CCATTCTTAG 


AAAATGATGA 


CTCAAACCGT 


GCATTGATGG 


2280 
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CAGGTATGGA ACACGTTGCA GCACGTGATT CTGGTGCGGC TATTACAGCT AAGCACAGAG 24 00 

GTCGTGTTGA ACATGTTGAA TCTAATGAAA TTCTTGTTCG TCGTCTAGTT GAAGAGAACG 2460 

GCGTTGAGCA TGAAGGTGAA TTAGATCGCT ATCCATTAGC TAAATTTAAA CGTTCAAACT 2520 

CAGGTACATG TTACAACCAA CGTCCAATCG TTGCAGTTGG AGATGTTGTT GAGTATAACG 2580 

AGATTTTAGC AGATGGACCA TCTATGGAAT TAGGAGAAAT GGCATTAGGT AGAAACGTAG 2 640 

TAGTTGGTTT CATGACTTGG GACGGTTACA ACTATGAGGA TGCCGTTATC ATGAGTGAAA 27 00 

GACTTGTGAA AGATGACGTG TATACTTCTA TTCATATTGA AGAGTATGAA TCAGAAGCAC 2760 

GTGATACTAA GTTAGGACCT GAAGAAATCA CAAGAGATAT TCCTAATGTT TCTGAAAGTG 2820 

CACTTAAGAA CTTAGACGAT CGTGGTATCG TTTATATTGG TGCAGAAGTA AAAGATGGAG 2880 

ATATTTTAGT TGGTAAAGTA ACGCCTAAAG GTGTAACTGA GTTAACTGCC GAAGAAAGAT 294 0 

TGTTACATGC AATCTTTGGT GAAAAAGCAC GTGAAGTTAG AGATACTTCA TTACGTGTAC 3 0 00 

CTCACGGCGC TGGCGGTATC GTTCTTGATG TAAAAGTATT CAATCGTGAA GAAGGCGACG 3060 

ATACATTATC ACCTGGTGTA AACCAATTAG TACGTGTATA TATCGTTCAA AAACGTAAAA 3120 
25 TTCATGTTGG TGATAAGATG TGTGGTCGAC ATGGTAACAA AGGTGTCATT TCTAAGATTG 3180 
TTCCTGAAGA AGATATGCCT TACTTACCAG ATGGACGTCC GATCGATATC ATGTTAAATC 3 24 0 

CTCTTGGTGT ACCATCTCGT ATGAACATCG GACAAGTATT AGAGCTACAC TTAGGTATGG 33 00 

30 CTGCTAAAAA TCTTGGTATT CACGTTGCAT CACCAGTATT TGACGGTGCA AACGATGACG 3360 

ATGTATGGTC AACAATTGAA GAAGCTGGTA TGGCTCGTGA TGGTAAAACT GTACTTTATG 34 20 

ATGGACGTAC AGGTGAACCA TTCGATAACC GTATTTCAGT AGGTGTAATG TACATGTTGA 3480 
AACTTGCGCA CATGGTTGAT GATAAATTAC ATGCGCGTTC AACAGGACCA TATTCACTTG 354 0 

tTACACAACA "ACCACTTGGC - GGTAAAGCGC - AATTGGGTGG-AGAAGGTTTT GGIGAGATGG 3.600 

AGGTATGGGC ACTTGAAGCA TATGGTG CTG CATACACATT ACAAGAAATC TTAACTTACA 3660 
AATCCGATGA TACAGTAGGA CGTGTGAAAA CATACGAGGC TATTGTTAAA GGTGAAAACA 3720 
TCTCTAGACC AAGTGTTCCA GAATCATTCC GAGTATTGAT GAAAGAATTA CAAAGTTTAG 3780 
GTTTAGATGT AAAAGTTATG GATGAGCAAG ATAATGAAAT CGAAATGACA GACGTTGATG 3840 
ACGATGATGT TGTAGAACGC AAAGTAGATT TACAACAAAA TGATGCTCCT GAAACACAAA 3900 
AAGAAGTTAC TGATTAATAC GCAATTTACA AAACAGGCAA AAAGATACTA AGCTGAATTT 3 960 

50 TATTGATGAT TCAGTTTAGT ACTTTAAGCC ATTTTAAATA AATGCAAATC AATCAAATAG 4020 
CACAGCTAAT CTAAATTGAA GGAGGTAGGC TCCTTGATTG ATGTAAATAA TTTCCATTAT 4080 
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AAACCTGAAA CAATCAACTA CCGTACATTA AAACCTGAAA AAGATGGTCT ATTCTGTGAA ' 420 

5 AGAATTTTCG GACCTACAAA AGACTGGGAA TGTAGTTGTG GTAAATACAA ACGTGTTCGC 426 

TACAAAGGCA TGGTCTGTGA CAGATGTGGA GTTGAAGTAA CTAAATCTAA AGTACGTCGT 432 

GAAAGAATGG GTCACATTGA ACTTGCTGCT CCAGTTTCTC ACATTTGGTA TTTCAAAGGT 438 

w ATACCAAGTC GTATGGGATT ATTACTTGAC ATGTCACCAA GAGCATTAGA AGAAGTTATT 444 

TACTTTGCTT CTTATGTTGT TGTAGATCCA GGTCCAACTG GTTTAGAAAA GAAAACTTTA 450 , 

TTATCTGAAG CTGAATTCAG AGATTATTAT GATAAATACC CAGGTCAATT CGTTGCAAAA 456( 

, 5 ATGGGTGCAG AAGGTATTAA AGATTTACTT GAAGAGATTG ATCTTGACGA AGAACTTAAA 462( 

TTGTT A CG CG ATGAGTTGGA ATCAG CT ACT GGTCAAAGAC TTACTCGTGC AATTAAACGT 468 ( 

TTAGAAGTTG TTGAATCATT CCGTAATTCA GGTAACAAAC CTTCATGGAT GATTTTAGAT 474C 

20 GTACTTCCAA TCATCCCACC AGAAATTCGT CCAATGGTTC AATTAGATGG TGGACGATTT 480C 

GCAACAAGTG ACTTAAACGA CTTATACCGT CGTGTAATTA ATCGAAATAA TCGTTTGAAA 48S0 

CGTTTATTAG ATTTAGGTGC ACCTGGTATC ATCGTTCAAA ACGAAAAACG TATGTTACAA 4920 

* GAAGCCGTTG ACGCTTTAAT TGATAATGGT CGTCGTGGTC GTCCAGTTAC TGGCCCAGGT 4980 

AACCGTCCAT TAAAATCTTT ATCTCATATG TTAAAAGGTA AACAAGGTCG TTTCCGTCAA 5040 

^ AACTTACTTG GTAAACGTGT TGACTATTCA GGACGTTCAG TTATTGCAGT AGGTCCAAGC 5100 
TTGAAAATGT ACCAATGTGG TTTACCAAAA GAAATGGCAC TTGAACTATT TAAACCATTC 

GTAATGAAAG AATTAGTTCA ACGTGAAATT GCAACTAACA TTAAAAATGC GAAGAGTAAA ' 5220 

^ ATCGAACGTA TGGATCATGA AGTTTGGGAC GTATTGGAAG AAGTAATTAG AGAACATCCT 5280 
GTATTACTTA ACCGTGCACC AACACTTCAT AGACTTGGTA TTCAAGCATT TGAACCAACT 
TTAGTTGAAG GTCGTGCGAT TCGTCTACAT CCACTTGTAA CAACAGCTTA TAACGCTGAC 
4o TTTdACGGTG ACCAAATGGC GGTTCACGTT CCTTTATCAA AAGAGGCACA AGCTGAAGCA 
AGAATGTTGA TGTTAGCAGC ACAAAACATC TTGAACCCTA AAGATGGTAA ACCTGTAGTT 
ACACCATCAC AAGATATGGT ACTTGGTAAC TATTACCTTA CTTTAGAAAG AAAAGATGCA 
,5 GTAAATACAG GCGCAATCTT TAATAATACA AATGAAGTAT TAAAAGCATA TGCAAATGGC 
TTTGTACATT TACACACTAG AATTGGTGTA CATGCAAGTT CGTTCAATAA TCCAACATTT 
ACTGAAGAAC AAAACAAAAA GATTCTTGCT ACGTCAGTAG GTAAAATTAT ATTCAATGAA 
10 ATCATTCCAG ATTCATTTGC TTATATTAAT GAACCTACGC AAGAAAACTT AGAAAGAAAG 
ACACCAAACA GATATTTCAT CGATCCTACA ACTTTAGGTG AAGGTGGATT AAAAGAATAC 
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GAAGTATTCA ACAGATTTAG CATCACTGAT ACATCAATGA TGTTAGACCG TATGAAAGAC 6000 

TTAGGATTCA AATTCTCATC TAAAGCTGGT ATTACAGTAG GTGTTGCTGA TATCGTAGTA 6060 

TTACCTGATA AGCAACAAAT ACTTGATGAG CATGAAAAAT TAGTCGACAG AATTACAAAA 6120 

CAATTCAACC GTGGTTTAAT CACTGAAGAA GAAAGATATA ATGCAGTTGT TGAAATTTGG 6180 

ACAGATGCAA AAGATCAAAT TCAAGGTGAA TTGATGCAAT CACTTGATAA AACTAACCCA 624 0 

ATCTTCATGA TGAGTGATTC AGGTGCCCGT GGTAACGCAT CTAACTTTAC ACAGTTAGCA 63 00 

GGTATGCGTG GATTGATGGC CGCACCATCT GGTAAGATTA TCGAATTACC AATCACATCT 6360 

TCATTCCGTG AAGGTTTAAC AGTACTTGAA TACTTCATCT CAACTCACGG TGCACGTAAA 64 20 

GGTCTTG CCG ATACAGCACT TAAAACAGCT GACTCAGGAT ATCTTACTCG TCGTCTTGTT 64 80 

GACGTGGCAC AAGATGTTAT TGTTCGTGAA GAAGACTGTG GTACTGATAG AGGTTTATTA 654 0 

GTTTCTGATA TTAAAGAAGG TACAGAAATG ATTGAACCAT TTATCGAACG TATTGAAGGT 66 00 

CGTTATTCTA AAGAAACAAT TCGTCATCCT GAAACTGATG AAATAATCAT TCGTCCTGAT 6660 

GAATTAATTA CACCTGAAAT TGCTAAGAAA ATTACAGATG CTGGTATTGA ACAAATGTAT 6720 

25 ATTCGCTCAG CATTTACTTG TAACGCACGA CATGGTGTTT GTGAAAAATG TTACGGTAAA 6780 

AACCTTGCTA CTGGTGAAAA AGTTGAAGTT GGTGAAGCAG TTGGTACAAT TGCAGCCCAA 684 0 

TCTATCGGTG AACCAGGTAC ACAGCTTACA ATGCGTACAT TCCATACAGG TGGGGTAGCA 6900 

30 GGTAGCGATA TCACACAAGG TCTTCCTCGT ATTCAAGAGA TTTTCGAAGC ACGTAACCcT 6960 

AAAGGTCAAG CGGTAATTAC GGAAATCGAA GGTGTCGTAG AAGATATTAA ATTAGCAAAA 7020 

GATAGACAAC AAGAAATTGT TGTTAAAGGT GCTAATGAAA CAAGATCATA CCTTGCTTCA 7080 

35 GGTACTTCAA GAATTATTGT AGAAATCGGT CAACCAGTTC AACGTGGTGA AGTATTAACT 7140 

- GAAGGTTCTA-TTGAACCTAA GAATTAeTTA TCTGTTGCTG GATTAAACGC GACTGAAAGC 720 0_ 

TACTTATTAA AAGAAGTACA AAAAGTTTAC CGTATGCAAG GTGTAGAAAT CGACGATAAA 7260 

CACGTTGAGG TTATGGTTCG ACAAATGTTA CGTAAAGTTA GAATTATCGA AGCAGGTGAT 7320 

ACGAAGTTAT TACCAGGTTC ATTAGTTGAT ATTCATAACT TTACAGATGC AAATAGAGAA 73 80 

GCATTTAAAC ACCGTAAGCG TCCTGCAACA GCTAAACCAG TATTACTTGG TATTACTAAA 7440 

GCATCACTTG AAACAGAAAG TTTCTTATCT G CAG CAT CAT TCCAAGAAAC AACAAGAGTT 7 500 

CTTACAGATG CAGCAATTAA AGGTAAGCGT GATGACTTAT TAGGTCTTAA AGAAAACGTA 7 560 

SO ATTATTGGTA AGTTAATTCC AGCTGGTACT GGTATGAGAC GTTATAGCGA CGTAAAATAC 7620 

GAAAAAACAG CTAAACCAGT TGCAGAAGTT GAATCTCAAA CTGAAGTAAC GGAATAACAA 7680 
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ATGTTGACGA 
GGAGGATAAA 
TGGTCTTAAA 
TGAAGACGTT 
ACCTGTATCT 
TGCGACAATA 
TTTAACCTAA 
ATCACATGCC 
CAGATTCTCC 
ACTCACCACA 
ACTCAGCGTT 
ACATCCCTGG 
GTGTAAAAGA 
GTGTTGACGG 
AATTTAGTTT 
CATTATGCCT 
CTCTAAGTTA 
ACAAAGAATT 
GAAGTATTCG 
GTAGGTGGTT 
GGTTTACGTT 
TTAGCTAACG 
GACACTCACA 
AGCTTTTACC 
CATCGCCATA 
CTAGAGAATT 
GTAAAACGAC 
AAaCACACGA 
CTATCACATC 



ATTCTCTTGT 
TTATTGTCTA 
GAAACGCTTA 
GAAGTATATT 
TTTTTCAAAA 
GTAGCATTGA 
AAATGAACCA 
AACTATTAAC 
AGCTTTAAAT 
AAAACGTGGT 
ACGTAAATAT 
TATCGGACAT 
CTTACCAGGT 
ACGTAGACAA 
TTAATTAAAT 
CGTAAAGGAT 
GTAACTAAAT 
CTTTATTCAG 
AAGAAGCAAT 
CTAACTATCA 
GGTTAGTTAA 
AAATTTTAGA 
AAATGGCTGA 
CTGAGTGTGT 
TCTATCGTAT 
TTCATTAGAA 
TACGACTGAA 
AGGTGCTTCA 
TGCTGCAACA 



TCAATGTTAA 
AGGAAAAAGT 
AAGCGTTAAA 
TAATGACTCG 
GCAAACATGC 
TTAAATGAGA 
CCTGGATGTG 
CAATTAGTAC 
AAAGGTTTCA 
GTATGTACTC 
GCACGTGTGc 
AACTTACAAG 
GTGCGTTACC 
GGTCGTTCAT 
CTTAAACTTA 
CAGTACCTAA 
TAATTAACAA 
CATTCGACCT 
CAACAACATT 
AGTACCAGTA 
CTATGCACGT 
TG CAGCAAAT 
AGCAAACAAA 
TCTATATTAA 
TTATTCAGTA 
AAAACTCGTA 
CGTATTCTTT 
CAAATGGACT 
ACAGCAGCTT 



TATATTAAAG 
tGCACGCTTT 
GAAAGATCAA 
CGTGTTAAGC 
TTTGGGTAAA 
ATTAGTAAGT 
TGGGATTAAA 
GTAAACCAAG 
ACAGTAAAAA 
GTGTAGGTAC 
gTtTATCAAA 
AACACAGTGT 
ATATTGTACG 
TATACGGAAC 
AAATATTTAA 
AAGAGACGTA 
AATTATGTTA 
AGTTGAACAA 
ATGCCAGTAT 
GAAGTTCGTC 
CTTCGTGGTG 
AATACAGGTG 
GCATTTGCTC 
TGAATTTTCA 
ATATAAACTG 
ATATCGGTAT 
ATTACACTGG 
GGATGGAGCA 
GGGAAGGTCA 



GTTGATGCAA 
AACAAACAAC 
GTTACATCTT 
CAAATCAATC 
CATGTAGGTA 
GTTTTACTTA 
AAGTGAAGAG 
ACAAAGCAAA 
GAAAAAATTT 
AATGACACCT 
CAACATCGAA 
TGTACTTGTA 
TGGAGCACTT 
TAAGAAACCT 
TATAAGGAAG 
TTACCAGATC 
GATGGTAAAC 
CGCAGgtTCG 
TAGAAGTTAA 
CAGAGCGTCG 
AAAAAACGAT 
GTGCCGTTAA 
ACTACCGTTG 
TTAAGCGTTC 
GAAGGAGAAA 
CATGGCTCAC 
CCGTATCCAC 
AGAACAAGAC 
CCGTGTAAAC 



GCAGAACTTT 
ATTTTGTAGT 
TGATTATTGC 
AGAAAAATAT 
TTAACGTCAA 
CTAAATTTTA 
AGGAGGACAT 
ATCAAAAAAT 
ACTGACTTAA 
AAAAAACCTA 
ATT AACG CAT 
CGTGGTGGAC 
GATACTTCAG 
AAAAACTAAG 
GGAGGATTTA 
CAATTCATAA 
GTGGAACAGC 
TGATGCATTA 
AGCTCGTCGC 
TACTACTTTA 
GGAAGATCGT 
GAAACGTGAG 
GTAAGATAAA 
ATGCTTAGGG 
AAATACATGG 
ATTGATGCTG 
AArGknGGTG 
CGTGGTATTA 
ATTATCGATA 
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CAGTTACAGT ACTTGATGCA CAATCAGGTG TTGAACCTCA AACTGAAACA GTTTGGCGTC 9600 

AGGCTACAAC TTATGGTGTT CCACGTATCG TATTTGTAAA CAAAATGGAC AAATTAGGTG 966 0 

5 CTAACTTCGA ATACTCTGTA AGTACATTAC ATGATCGTTT ACAAgCTAAC GCTGCTCCAA 9720 

TCCAATTACC AATTGGTGCG GAAGACGAAT TCGAAGCAAT CATTGACTTA GTTGAAATGA 9780 

AATGTTTCAA ATATACAAAT GATTTAGGTA CTGAAATTGA AGAAATTGAA ATTCCTGAAG 984 0 

10 

ACCACTTAGA TAGAGCTGAA GAAGCTCGTG CTAGCTTAAT CGAAGCAGTT GCAGAAACTA 9900 

GCGACGAATT AATGGAAAAA TATCTTGGTG ACGAAGAAAT TTCAGTTTCT GAATTAAAAG 996 0 

AAGCTATCCG CCAAGCTaCt AcTAACGTAG AATTCTACCC AGTACTTTGT GGTACAGCTT 10020 

15 

TCAAAAACAA AGGTGTTCAA TTAATGCTTG ACGCTGTAAT TGATTACTTA CCTTCACCAC 10080 

TAGACGTTAA ACCAATTATT GGTCACCGTG CTAGCAACCC TGAAGAAGAA GTAATCGCGA 1014 0 

20 AAGCAGACGA TTCAGCTGAA TTCGCTGCAT TAGCGTTCAA AGTTATGACT GAC CCTTATG 1020 0 

TTGGTAAATT AACATTCTTC CGTGTGTATT CAGGTACAAT GACATCTGGT TCATACGTTA 1026 0 

AGAACTCTAC TAAAGGTAAA CGTGAACGTG TAGGTCGTTT ATTACAAATG CACGCTAACT 10320 

25 CACGTCAAGA AATCGATACT GTATACTCTG GAGATATCGC TGCTGCGGTA GGTCTTAAAG 10380 

ATACAGGTAC TGGTGATACT TTATGTGGTG AGAAAAATGA CATTATCTTG GAATCAATGG 10440 

AATTCCCAGA GCCAGTTATT CACTTATCAG TAGAGCCAAA ATCTAAAGCT GACCAAGATA 10500 

30 AAATGACTCA AGCTTTAGTT AAATTACAAG AAGAAGACCC AACATTCCAT GCACACACTG 10560 

ACGAAGAAAC TGGACAAGTT ATCATCGGTG GTATGGGTGA GCTTCACTTA GACATCTTAG 10620 

TAGACCGTAT GAAGAAAGAA TTCAACGTTG AATGTAACGT AGGTGCTCCA ATGGTTTCAT 10680 

35 

ATCGTGAAAC ATTCAAATCA TCTGCACAAG TTCAAGGTAA ATTCTCTCGT CAATCTGGTG 10740 

GTCGTGGTCA ATACGGTGAT GTTCACATTG^ AATTCACACC AAAGGAAAGA GGCGCAGGTT 10800 

TCGAATTCGA AAACGCTATC GTTGGTGGTG TAGTTCCTCG TGAATACATT CCATCAGTAG 10 860 

40 

AAGCTGGTCT TAAAGATGCT ATGGAAAATG GTGTTTTAGC AGGTTATCCT TTAATTGATG 10920 

TTAAAGCTAA ATTATATGAT GGTTCATACC ATGATGTCGA TTCATCTGAA ATGGCCTTCA 10980 

4S AAATTGCTGC ATCATTAGCA CTTAAAGAAG CTGCTAAAAA ATGTGATCCT GTAATCTTAG 11040 

AACCAATGAT GAAAGTAACT ATTGAAATGC CTGAAGAGTA CATGGGTGAT ATCATGGGTG 11100 

ACGTAACATC TCGTCGTGGA CGTGTTGATG GTATGGAACC TCGTGGTAAT GCACAAGTTG 11160 

50 TTAATGCTTA TGTACCACTT T C AG AAATGT TCGGTTATGC AACATCATTA CGTTCAAACA 11220 

CTCAAGGTCG CGGTACTTAC ACTATGTACT TCGATCACtA TGCTGAAGTT CCaAAATCaA 11280 
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GCCTAGGXTA AAAXACAAGG ^ . 
T-GGXGAA tccatcocta ^ 

TATTATTTTT AAXXAGAAXA „! 
XAXAAXGGCA AAAGAAAAAT ^ .AAACAACA, GCCAAXAXCG 

GXACTAXCGG TC a™ CATGGTAAAA CAAGA™. AGCAGCAAXC GCXACXGXAX ,„ 

xaccaaaaaa xggxgagxca gttgcacaat caxaxgagax gax^acaac gcxccagaag „, 

AAAAAGAACG TGGTATCACA AXCAA.ACXX CIm A GXACCAAAGX GACAAAGGXC 

*cxacgcxca cgtxgacxgg ccaggacacg cxgacxacgx atgaoxggxg lle 

CX.™ ««„ AXCXTAGXAG TATCTGCXGC „CCA AXGCCACAAA m 

ctcgtgaaca catxctxtta xcaccxaacc ™ A c. AGCA„AGXA GXAXXCXXAA 11S . 

ACAAAGTXGA CAXGGTXGAC GAXGAAGAAX TATXAGAAXT AGXAGAAATG GAACXXGGXG „„ 

ACTTAXXAAG CGAAXAXGAC XXCGCACXG ACGAXGTACC XGXAAXCGCX GGXXCAGCAX l20( 

TAAAAGCTTT AGAAGGCGAX GCXCAAXACG AAGAAAAAAX GXTAOAAXXA ATGGAAGCXG J. 

XAGAXACTTA CAXXCCAACX CCAGAAOGTG AXXCXGACAA ACCAXXCAXG A^CCAGXXG J, 

AGGACGXAXX CXCAAXCACX GGTCGXtSGTA ^eXAC AGGCCGTGTT GAACGTGGXC 1221 

AAATCAAAGT XGGTGAAGAA GXXOAAAXCA XCGGX^CA XgACACATCX AAAACAACXG 1230 
XXACAGGXGX 

-OCA^ acgxgcxgxx GCXCGXGAAG ACGXAOAACC XGGTCAAGXA XXAGCXGGXG 1242 

CTGGXXCAAX XACACCACAX ACXGAAXXCA AAGCAGAAGX AXACGXAXXA XCAAAAGACG lMt , 

AAGGXGGAOG XCAGACXCCA TTCXXCXCAA AGXAXCGXCC ACAA1TCTAX XXCCGXACXA 12M( 

CTGASGXAAC XGGXGXXGX. CACXXACCAG AAGGXACXGA AAXGGXAA TC CCXGGXGAXA 1260< 

ACGXTGAAAX GACAGXAGAA XXAAXCGCXC CAATCGCGAX XGAAOACOGX ACXGGX^X 12t6t 

-TCCXCA AGGXGGACGT ACPGXAGGAX CAGGCGXTGX XACT^g A^AAAXAAX 1272C 

xtgxaa^xc XTAGAXXXXA XAXAAAAACA AGAXCGCXCA atcgaggggc cxtxxxxxaa 127eo 

XCXCXAAAX. ^XAAX^C XAXXCGAXTT AGAAGAACAA TAAXTGATGA AAGAGXGAGX laM0 

AATAAAACXX axaacxcaxa axacxg™ aaxaaaaxxg xtgagtcxtg GACAXXGXAA l2900 

AATGCXCCCX XCAAAGXTXT CATTXXXXCa ATGXCXACXX XGAAGGGAGC ATTTCAXTAG 12960 
- -AXGXCTC AGAXXCAXAX CX^AA^A AXTXAAATGC XXAAXT^ XXAAAXA OT 
=CTCX«TXC XAXGAXXTXT AAAAATACAG CXACAGCGXA XXXXAATGAT ^TCA^ 
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TCAGAAAGAA 


TGCACCTGGT 


CGTACTTTCA 


AATAATGTGA 


AAAATCTTCT 


CCAATCATCA 


13200 




TTAAATCTGA 


TTCATTAAAG 


CGTACATGTA 


AGT CATTTGT 


TGCTTCTTTA 


ATAACTTGAT 


13260 


5 


ATGCTTTCTC 


GTTATTATGG ACAGGCAAAT 


ACCCTTTAAT 


ATAATTCAAA 


TCATAGTTAA 


13320 




TATCATTTGC 


TATTGCTAAA 


CCTTGTAGAA 


GCTTATCCAT 


TTTGTCCATT 


ACATGATTCT 


13380 


10 


GTATATCTGA 


AT CG AAAGTT 


CTAACTGTAC 


CTTTACAAAA 


TGCTTGATCA 


GGAATAACGC 


13440 


TATCTGTGGT 


GCCTGCTTGA 


ATCATTCCAA 


ATGAAAGTAC 


AG CTTGTTT A 


ACTGGATCGA 


13500 




TCGTACGTGA 


AATTATTTTT 


TGTGCACTTA 


AAATGAACTC 


TGCCATGATT 


ACTATTGGGT 


13560 


15 


CAATGGTTTC 


ATGAGGTTTG 


GCACCATGAC 


CACCACGACC 


TTTAAATGTG 


ACGCTAAATT 


13620 


CATCTGGAGA 


GGCCATGATT 


GCCCCCGCAC 


GTGAATGAAT 


AGTTCCAGTA 


GGATAACCAC 


13680 




TCCATAAATG 


TGTACCGTAA 


ATTCTATCTA 


CATTTTCCAG 


ACATCCAGCA 


TCTATCATTT 


13740 


20 


CTTGAGAACC 


ACCTGGCATG 


ATTTCTTCAC 


CGTACTGGAA 


TATTAATACA 


ACATT AC CTT 


13800 




CTAATAAATG 


TTTATGTTCA 


TCTAAAATCT 


CTGCTACAGT 


AAGTAAAATT 


GCTGTATGAC 


13860 




CATCATGCCC 


ACACGCATGC 


ATACATCCTG 


GATTTTTAGA 


CTTATAAGGC 


ACATCGTTTA 


13920 


25 


ATTCCTCGAC 


AGGTAACGCA 


TCAAAGTCAG 


CTCTTAATGC 


AATGGTAGGT 


CCTGTGCCCA 


13980 




AGCCTTTAAA 


TGTGGCTTTG 


ATACCATTGC 


GG C CG AT AGG 


AGTTTCAATA 


TCACAAGATA 


14040 




ACTGGCTTAA 


TTGGTTAACA 


ATATAATCAT 


GTGTTTGAAA 


TTCTTCAAAA 


GATAACTCAG 


14100 


30 


GATATTGGTG 


TAAATAACGT 


CTGAGTTGAA 


TTGTTTTATT 


TTCTTTATTA 


TTTGCTAGTT 


14160 




GGAACCAATC 


TAACACCCTT 


ATCACTACTT 


TCTAAAATAA 


TGTTTATAGT 


ATAACATTTT 


14220 




ATGAAATTAT 


CGTACTAAAT 


GATTGCTTTG 


AGATATTTTA 


TCTATGAATG 


ATAAGGCTTT 


14280 


35 


CAAGTTATGT 


AGAATTACTG 


TATGATAAAG 


GTATTACCAA 


ACAATACTTA 


AGGGGGATTA 


14340 




TMACTGTGG" 


TTCAATCATT 


ACATGAGTTT 


TTAGAGGAAA - 


ATATAAATTA 


TCTAAAAGAA _ 


_ 14400 


40 


AATGGTTTGT 


ATAATGAAAT 


AGATACAATT 


GAAGGTGCAA 


ACGGACCAGA 


AATCAAAATC 


14460 


AATGGGAAAT 


CATACATTAA 


CTTATCTTCA 


AATAATTATT 


TAGGACTAGC 


AACAAATGAA 


14520 




GATTTGAAAT 


CaGctGCAAA 


AG CAGCTATT 


GATACACATG 


GTGTAGGTGC 


AGGCGCTGTT 


14580 


45 


CGTACAATCA 


ATGGTACATT 


AGATTTACAC 


GAPGAATTAG 


A. AG ft. A Af*A PT 


ft ^2.1^ ft ft ft ft '1 *• I *■ t * 


14640 




AAAGGAACAG 


AAGCTGCAAT 


AG CTTATCAA 


TCAGGATTTA 


ATTGTAATAT 


GGCTG CTATT 


14700 




TCAGCTGTCA 


TGAATAAAAA 


TGATGCTATT 


TTATCAGATG 


AGCTTAATCA 


TGCATCAATT 


14760 


50 


ATTGATGGAT 


GTCGCTTATC 


TAAAGCTAAA 


ATTATTCGAG 


TTAACCATTC 


AGACATGGAT 


14820 




GATTTACGTG 


CGAAAGCAAA 


AGAAGCAGTT 


GAATCAGGTC 


AATACAATAA 


AGTGATGTAT 


14880 
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ATTGCAGAAG 


AATTTGGTTT ATTAACTTAT 


GTTGACGACG 


CTCATGGTTC AGGTGTTATG 


15000 


5 


GGTAAAGGCG 


CTGGTACGGT TAAACATTTT 


GGTTTACAAG 


ATAAAATCGA TTTCCAAATA 


15060 




GGTACGCTTT 


CTAAAGCAAT TGGTGTCGTT 


GGCGGTTATG 


TAG CAGGTAC AAAAGAGTTA 


151*5 n 




ATAGATTGGT 


TAAAAGCACA ATCACGACCA 


TTCTTATTCT 


CTACATCATT AGCACCTGGG 


1 «5l on 


10 


GATACCAAAG 


CAATAACTGA AGCAGTTAAA 


AAGTTAATGG 


ATTCAACTGA ATTACATGAT 


1524 0 




AAATTATGGA 


ACAATGCACA ATATTTAAAA 


AATGGATTGT 


CAAAATTAGG ATATGATACA 


153 00 




GGTGAGTCAG 


AAACTCCAAT TACACCAGTA 


ATTATTGGTG 


ATGAAAAAAC AACTCAAGAA 


153 60 


15 


TTTAGTAAGC 


GTTTAAAAGA CGAAGGTGTC 


TATGTGAAAT 


CTATCGTTTT CCCAACAGTA 






CCAAGAGGTA 


CAGGACGTGT AAGAAATATG 




CACATACAAA AG A CATGTTA 


15480 




GATGAAGCAA 


TTGCGGCTTA TGAAAAAGTA 


GGAAAAGAAA 


TGAAGTTGAT TTAATATTTA 


15540 


20 


TTTATTCCCA 


CGGCAAATAT TGTCGTGGGC 


TTTTTTTAAT 


GTTTAGTTTA TTAACAGT 


15598 




(2) INFORMATION FOR SEQ ID NO: 83 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



30 


<xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


83 : 






AAGTAAATCA 


ACTTACTGGG ATAAGAATAA AGG CG ATT AT 


AGTAACAAGT TGATTTTATT 


60 


35 


CGAAAAACAT 


TTTGAACCGG TTCTGGGTAT CAAGATGCAA 


CATAGTGGAG GTCATAG CTT 


120 




TGGCCACACG 


ATTATTACGA TTGAAAGTCA AGGAGATAAA 


GCAGTTCATA TGGGTGATAT 


180 




ATTCCCAACT 


ACTGCACATA AAAATCCTCT ATGGGTAACG 


GCATATGATG ATTATCCTAT 


240 


40 


GCAATCGATT 


CGTGAAAAAG AACGCATGAT AC CAT ATTTT 


ATTCAGCAAC AATATTGGTT 


300 




CTTGTTTTAT 


CATGATGAAA ACTACTTTGC TGTAAAATAC AGCGATAATG GTGAAAACAT 


360 




AGATGCATAT 


ATTTTACGTG AAACATTAGT TGATAATAAC 


TAAAATAAAG ATGTATTACT 


420 


45 


AAACAAATTT 


TCAAAAATAA AAAATTGAGC CACATCCAAT 


CTTACTAATT AGGGTGTGGC 


480 




TCATTTTTAA 


GTTTTACgAT CCAAATCAAA TATGGaTAAA 


ATTCgTATTA ACGCTCTACa 


540 




ATGtTAATGA 


CTTCACCAGT ATATGCATCT GCATAAAAAT 


CATAATGAAT ATTTTGACCA 


600 


50 


tttitaatag 

A 


TTGTAATTCC ACCTTGATAA ACTAAACGGT 


ATTTAT CAGT TTCAGGATGA 


660 
661 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 5738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



20 



25 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GCAGACGGTA CAGCAGTTAA AGTCGCACCA AaACTGTAGT GAATcTAATC GGTGcATTCT 
TTTTAGGATT AGTTGTCGCG CTTATATATA TCTTCTTCAA AGTAATTTTC GATAAGCGAA 
TTAAAGATGA AGAAGATGTA GAGAAAGAAT TAGGATTGCC TGTATTGGGT TCAATTCAAA 
AATTTAATTA AGGATGGTTG CTACTTATGT CAAAAAAGGA AAATACGACA ACAACACTAT 
TTGTATATGA AAAACCAAAA TCAACAATTA GTGAAAAGTT TCGAGGTATA CGTTCAAACA 
TCATGTTTTC AAAAGCAAAT GGTGAAGTAA AGCG CTTATT GGTTACTTCT GAAAAGCCTG 
GTGCAGGTAA AAGTACAGTT GTATCGAATG TAGCGATTAC TTATGCACAA GCAGGCTATA 
AGACATTAGT TATTGATGGC GATATGCGTA AgcCAACACA AAACTATATT TTTAATGAGC 
AAAATAATAA TGGACTATCA AGCTTAATCA TTGGTCGAAC GACTATGTCA GAAGCAATTA 
CGTCGACAGA AATTGAAAAT TTAGATTTGC TAACAGCTGG CCCTGTACCT CCAAATCCAT 
CTGAGTTAAT TGGGTCTGAA AGGTTCAAAG AATTAGTTGA TCTGTTTAAT AAACGTTACG 
ACATTATTAT TGTCGATACA CCGCCAGTTA ATACTGTGAC TGATGCACAA CTATATGCGC 
GTGCTATTAA AGATAGTCTG TTAGTAATTG ATAGTGAAAA AAATGATAAr AATGAAGTTA 
AAAAAGCAAA AG CACTTATG GAAAAAGCAG GCAGTAACAT TCTAGGTGTC ATTTTGAACA 
AGACAAAGGT CGATAAATCT TCTAGTTATT ATCACTATTA TGGAGATGAA TAAGTATGAT 
TGATATTCAT AACCATATAT TGCCTAATAT CGATGACGGT CCGACAAATG AAACAGAGAT 
GATGGATCTT TTAAAACAAG CGACAACACA AGGTGTTACA GAAATCATTG TAACATCACA 
TCACTTACAT CCT CGATATA CCACACCTAT AGAAAAAGTG AAATCATGTT TAAACCATAT 
TGAAAGCTTA GAGGAAGTAC AAGCACTAAA TCTAAAGTTT TATTATGGTC AGGAAATAAG 
AATTACCGAT CAAATCCTTA ATGATATTGA TCGAAAAGTT ATTAACGGTA TTAATGATTC 
ACGCTATTTA CTAATAGAAT TTCCATCAAA TGAAGTTCCA CACTATACTG ATCAATTATt 
TTTCGAATtA CAGAGTAAAG GCTTTGTACC GATTATTGCA CATCCAGAGC GGAATAAAGC 
AATAAGTCAA AACCTTGACA TACTATACGA TTTAATTAAC AAAGGTGCTT TAAGTCAAGT 
GACAACGGcG TCATTAGCGG GTATTTCCGG TAAAAAAATT AGAAAATTAG CAATTCAAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960- 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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25 



30 



50 



GTTCTTAATG AAAGACTTAT TTAATGATAA GAAATTACGT GATTATTATG AAGATATGAA 
CGGATTTATT AGTAATGCGA AGTTAGTTGT TGATGATAAA AAAATTCCTA AACGAATGCC 
ACAACAAGAT TATAAACAGA AAAGATGGTT TGGGTTATAA ACAGCAAATG AGGGGTTTTA 
TGGCACATTT ATCTGTGAAA TTGCGGCTTT TAATACTAGC ATTAAT CG AT TCACTGATAG 



15S0 
1620 
1680 

TGACATTTTC AGTATTCGTA AGTTATTACA TTTTAGAACC GTATTTCAAA ACATATTCTG 

1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 

TTTGTAATTT CTATCCAGAA CGTATTATTC TACTTGGCCA TGGTGAAAAC AGTATTTATT 



w u ' muRUi n-raWAAOC GTATTTCAAA ACATATTCTG le00 

TCAAATTATT AATATTGGCA GCTATATCAC TATTCATATC GCATCATATT TCaGCATTTA 
TTTTTAATAT GTATCATCGA GCGTGGGAAT ATGCCAGTGT GAGTGAATTG ATTTTAATTG 
is TTAAAGCTGT GACGACATCT ATCGTTATTA CGATGGTGGT CGTGACAATT GTTACAGGCA 
ATAGACCGTT TTTTAGATTG TATTTAATTA CTTGGATGAT GCACTTGATT TTAATAGGTG 
GCTCAAGGTT ATTTTGGCGT ATTTATCGGA AATACCTTGG AGGTAAGTCA TTTAATAAGA 
20 AGCCAACTTT AGTTGTTGGT GCTGGTCAAG CAGGTTCAAT GCTGATTAGA CAAATGTTGA 
AAAGTGACGA AATGAAACTT GAACCGGTAT TAGCAGTCGA TGATGACGAA CATAAACGCA 
ATATCACAAT TACTGAGGGT GTAAAAGTCC AAGGTAAAAT TGCGGATATT CCAGAACTAG 
TGAGGAAATA TAAGATTAAA AAAATCATCA TTGCAATTCC AACTATTGGT CAAGAGCGTT 
TGAAAGAAAT TAATAATATT TGCCATATGG ATGGCGTTGA GTTATTGAAA ATGCCAAATA 
TAG AAGACG T CATGTCTGGT GAGTTAGAAG TGAACCAACT TAAAAAAGTT GAAGTAGAAG 
ATTTACTAGG CAGAGATCCT GTTGAATTAG ATATGGATAT GATATCAAAT GAATTGACGA 
ATAAAACTAT TTTAGTTACG GGTGCAGGTG GTTCAATAGG ATCAGAAATT TGTAGACAAG 



35 xwxwmmac AUTATTTATT 2640 

TAATCAATCG TGAATTGCGA AATCGCTTCG GwAAAAATGT TGATATCGTT CCTATTATAG 2700 
CGGAfeTGCA AAATAGAGCG CGTATGTTTG AAATTATGGA AACGTATAAA CCATACGCAG 2760 
^ TTTATCATGC AGCAGCACAC AAGCACGTGC CGTTAATGGA AGACAACCCT GAAGAAGCAG 2820 
TACG T AATAA TATTTTAGGT ACGAAAAATA CTGCTGAAGC TGCTAAAAAT GCAGAGGTAA 2880 
AGAAATTCGT TATGATTTCT ACGGATAAAG CCGTTAATCC GCCTAATGTC ATGGGAGCTT 
4S CAAAGCGAAT TGCAGAAATG ATTATTCAAA GTTTAAATGA TGAAACGCAT CGAACAAATT 
TTGTTGCAGT GAGATTTGGT AATGTACTTG GATCGAGAGG ATCTGTGATT CCACTTTTCA 
AAAGTCAAAT TGAAGAAGGT GGGCCAGTTA CTGTGACACA TCCTGAAATG ACACGTTACT 
TTATGACAAT TCCTGAAGCT TCTAGACTAG TTTTGCAGGC AGGGGCATTA GCAGAAGGTG 
GCGAAGTATT TGTGCTAGAT ATGGGAGAAC CAGTGAAAAT TGTAGATTTG GCACGTAATT 



2940 
3000 
3060 
3120 
3180 
3240 
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3540 
3600 
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CCGGCGAAAA AATGTTTGAA GAGCTTATGA ATAAAGATGA GGTTCATCCT GAACAAGTAT 3360 

TTGAAAAAAT TTATCGTGGC AAAGTACAAC ATATGAAATG TAATGAAGTT GAAGCGATTA 3420 

5 TTCAAGACAT CGTCAATGAC TTTAGTAAAG AAAAAATTAT TAACTATGCC AATGGCAAAA 3480 
AGGGAGATAA TTATGTTCGA TGACAAAATT TTATTAATTA CTGGGGGCAC AGGATCATTC 
GGTAATGCTG TTATGAAACA GTTTTTAGAT TCTAATATTA AAGAAATTCG TATTTTTTCA 

10 CGCGATGAGA AAAAACAAGA TGACATTCGA AAAAAATATA ATAATTCAAA ATTAAAGTTC 3660 

TACATTGGTG ATGTGCGTGA TAGTCAAAGT GTAGAAACAG CAATGCGAGA TGTTGATTAC 3720 

GTATTCCATG CAGCAGCTTT AAAACAAGTG CCGTCATGTG AATTCTTTCC AGTTGAGGCA 3 7 BO 

GTGAAGACAA ATATTATTGG TACAGAAAAT GTCTTACAAA GTGCTATTCA TCAAAATGTT 3840 

AAAAAAGTCA TATGTTTATC TACAGATAAG GCAGCGTATC CTATTAATGC TAGGGGTATT 3 900 

TCAAAAGCAA TGATGGAAAA AGTATTCGTA GCCAAATCAA GAAATATTCG TAGTGAACAA 3960 

ACGCTTATTT GTGGTACAAG ATACGGTAAT GTGATGGCTT CAAGAGGATC AGTAATACCT 4 020 

TTGTTTATCG ACAAAATCAA AGCTGGAGAA CCTTTAACGA TTACAGATCC TGATATGACA 4080 

AGATTTTTAA TGAGCTTAGA AGATGCGGTA GAACTAGTTG TTCATGCATT TAAGCATGCA 414 0 

GAGACAGGAG ATATTATGGT TCAAAAAGCA CCAAGCTCAA CGGTAGGGGA TCTTGCGAC C 4 200 

GCATTATTAG AATTGTTTGA AGCTGATAAT GCAATTGAAA TCATTGGTAC GCGACATGGA 4 260 

GAGAAAAAAG CAGAAACATT GTTGACGAGA GAAGAATACG CACAATGTGA AGATATGGGT 4 32 0 

GATTATTTTA GAGTGCCGGC AGACTCCAGA GATTTAAATT ATAGTAATTA TGTTGAAACC 4 380 

GGTAACGAAA AGATTACGCA ATCTTATGAA T ATAACTC CG ATAATACACA TATTTTAACG 4440 

35 GTGGAAGAGA TAAAAGAAAA ACTTTTAACA CTAGAATATG TTAGAAACGA ATTGAATGAT 4500 

TMwiAGCTT CAATGAGATA GGAGAGATTG ACGTTGAATA TTGTAATTAG AGGAGCAAAA 4560 

GGTTTTGTAG GAAAAAACTT GAAAGCAGAT TTAACTTCAA CGACAGATCA TCATATTTTC 
GAAGTACATC GACAAACTAA AGAGGAAGAA TTAGAGTCAG CATTGTTGAA AGCAGACTTT 

GTCGTGCATT TAGCGGGTGT TAATCGACCT GAACATGACA AAGAATTCAG CTTAGGAAAC 4740 

GTGAGTTATT TAGATCATGT ACTTGATATA TTAACTAGAA ATACGAAAAA GCCAGCGATA 4800 

TTATTATCGT CTTCAATACA AGCAACACAA GATAATC CTT ATGGTGAGAG TAAGTTGCAA 4860 

GGGGAACAGC TATTAAGAGA GTATGCCGAA GAGTATGGCA ATACGGTTTA TATTTATCGC 4920 
TGGCCAAATT TATTCGGCAA GTGGTGTAAG CCGAATTATA ACTCAGTGAT AGCAACATTT 
TGTTACAAAA TTGCACGTAA CGAAGAGATT CAAGTTAATG ATCGGAATGT TGAACTAACG 



4620 
4680 



4980 
5040 
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ATTGAAAATG GTGTACCTAC AGTACCAAAC GTATTTAAAG 


TGACATTGGG 


AGAAATTGTA 


5160 


5 


GATTTATTAT ACAAGTTCAA ACAGTCACGT CTCGATCGAA 


CATTGCCGAA 


ATTAGATAAC 


5220 




TTGTTTGAAA AAGATTTGTA TAGTACGTAT TTAAGCTATC 


TACCTAGTAC 


aGACTTTAGT 


5280 




TAyCCCTTAC TTATGAATGT GGATGATAGG GGTTCTTTTA 


CAGAATTTAT AAAAACACCG 


5340 


10 


GATCGTGGTC AAGTTTCTGT AAATATTTCT AAACCAGGTA 


TTACTAAAGG 


TAATCACTGG 


5400 




CATCATACTA AAAACGAAAA ATTTCTAGTC GTATCAGGTA 


AAGGGGTAAT 


TCGTTTTAGA 


5460 




CATGTTAATG ATGATGAAAT CATTGAATAT TATGTTTCTG 


GCGACAAATT 


AGAAGTTGTA 


5520 


IS 


GACATACCAG TAGGATACAC ACATAATATT GAAAATTTAG 


GCGACACAGA 


TATGGTAACT 


5580 




ATTATGTGGG TGAATGAAAT GTTTGATCCA AATCAGCCAG ATACGTATTT 


CTTGGAGGTA 


5640 




TAGCGCATGG aAAAACTGAA rTTAATGACA ATAGTTGGTA 


CAAGGCCTGA 


AATCATTCGT 


5700 


20 


TTATCATCAA CGATTAAAGC ATGTGATCAA TATtTTAA 
(2) INFORMATION FOR SEQ ID NO: 85: 






5738 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9062 base pai 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 





ATCATCAACA AGAATGATAT 


TTTTCCCATC 


TACTATATCT TTTACCG CAG 


ATAACTTCAC 


60 


35 


TCTCACACCT TGCTCACGTA 


ATTCTTGAGT 


TGGTTGAATA AATGTTCTTG 


CAACATATTG 


120 




ATTTTTAACT AGTCCCATTT 


CATATGGCAA ACCTATTTCT TCAGCATAAC 


CACTCGCAGC 


180 




TGATAGCGAT gAATTGGGTA 


CACCGATGAC 


CATATCAGCA TTTACAGGGC 


TTTCTTGGGC 


240 


40 


TAATTTTTTA CCAGAAGCTT 


TACGTACTGC 


ATGGACATTT TTACCAGCTA 


TTGTTGAGTC 


300 




TGGTCTAGCA AAATAAATAT 


ATTCCATCGC 


AGAAATTGCA GTTGTCGTAT 


GATGTGTATA 


360 




AGATTTAACT GTAATACCTT 


TATCGTTAAT 


CACGACATAT TCACCTGCAT 


GAATATCTTG 


420 


45 


AACAAATTCT GCACCTAACA 


CATCTATTGC 


ACATGTTTCA CTTGCAAGGA 


TGTATGTCCC 


480 




ATCTTTCATT TTACCTACAA 


CAAGTGGTCT 


GATAGCATTT GGATCTACTG 


CGCCATATAA 


540 


SO 


CGCATCTTTA GTTAAAATCG 


CAAATGTAAA 


ACCGCCTTTA ACTTTTCGCA 


AACTTTCTTT 


600 


CAACGCTTCC TCAAAAGTAG 


GAGCTTTACT 


TCGACGTATC AAATGCATAA 


TGACTTCAGT 


660 




ATCAGAAGAC GAATGGAAGA 


TAGCACCTTG 


TTTTTCTAAA TTCTGACGCA 


ATGATTTAGC 


720 
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CGGTTGAATA TTTTCAATAC CTTTATTACC TGAAGTAGCA TAACGGACGT GACCAATTGC 
ATGTTGATAT CCTTTTAATC GTTCCATTTG ATCATCTTTA ATCGCTTCAG TTAGTAAGCC 
TAATCCTCGC TCGCCTTTTA ATTCATTTTG ATCAGAAACA ACT AT AC CTG CACCTTCTTG 
ACCACGATGT TGCAAACTAT GAAGTCCCAT AT At GTTAGT TGCGCTGCtT CaGGATGATT 
CCAAATACCA AACACGCCAC ATT CTTCGTT TAATCCTGAG TAGTTAAACA TTGaGCAATT 
GCCCCtTCCC ATATTTGTTT AATATCTGAA ACATTTTCAC TAAT CTCTGT aTATGGTGTT 
GTTACCTTG r aATTAT CACT ATCTGTTAAA AGTCCAATTT CTATTGCATT ATCAATATTT 
AAAGTTTTAC CTGATTTAAC AGAAACAACA TATCGGCCTT GCGTCTCACT AAACAATTGT 
GCATTTGTTA TATCTATTGA AGATTTTAAT CCTAAAC CGT AATG CGCACT TAGTTTAGCT 
AAGGTAATCA GTAAGCCACC TTTACCAACT GTTTGAACAT GTGATAATAG TCCTTCACGA 
ATAGCGGTCT TGATTGATTC ACCTTTTTCA ACTTCTGAAC TCAAATCTAA TGACTCAAAT 
TCATGATTAA CTTTGCCATA AATTAACTTT TCAAGTTGAC TACCACCAAA GTCGTCCTTA 
GTATCACCGA TTAAATATAA TTTAT CTCCA ACTTGAGGTT CAAAATCATT TAAATAATTT 
ACATTTTCAA TCAAACCTAC CATTCCAACA ACTGGTGTTG GGAAAATAGA AGTACCTTTC 
GTTTCGTTAT ATAAAGATAC ATT AC GAG AA ACTACTGGTG TCTTAAGAAT GTCGCATGCT 
TCTGCCATAC CTTTCGTTGA ATCTATCAAC TGTTGATAGA TTTCTTTCTT TTCAGGAGAA 
C CAT AATTT A AACAATCTGT CATTGCTAAT GGTGTTGCAC C CACGGCAAT TAAATTTCGA 
TAAGCTTCAG CTACTACCAT CTTTCCACCT TCATATGGAT TGTTATATAC ATAACGCGCT 
TCACCATCAA TTGTTGAAGC AATTG CCTTA TTTGTGCCTT CCACACGTAC TACCGATGCT 
TGAAGTCCTG GCTT AATTAT CGTATTGGCA CCAACTTGTT GGTCGTATTG ATCATATAAA 
~ TAGTGTTTAG ATGCTATAGT CGGATGCTTA AGTAATTTAA AGAAAGTATC TTTAACATCG 
ATGTGTGTAT AATCATTTTT AGAAGTATTA TAATCTTTTT CTTCTCCTTC TAAAATATAT 
ACAGGTGCTT CATCAGCTAG TGGTTCAACT GGAATGTCAG CATAAACTTC GTCATCATAT 
GTTAAAACAA AACGATTTGT ATCTGTAACT TCACCTATAA CAGCACTATC CAATTCGTGC 
TTATCAAATA AATCTAAGAA TTTTTGTTCA GTACCTTTTT CAACAACTAG TAACATACGT 
TCTTGAGTTT CTGAAAGCAT CATTTCATAA GGAGAAATAC CTGG CTCACG TGTTGGCACT 
TGTTCTAATC TCAAATGTAA CCCACTACCA CCTTTTGCCG C CATTTCAG A CGATGAAGAT 
GTTAAACCAG CAGCACCCAT ATCTTGAATA CCAACTAATT CATCAAATGT AATTG CTTCA 
AGTGTTGCTT CCATT AATTT TTTACCTACA AATGGATCAC CGATTTGTAC AGAAGGTCGT 



840 

900 
960 
1020 

ioeo 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 
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CGACCAGTTT TCAAACCAAC ATAAATGACC GAATTACCTA CACCTTTTGC TGTGCCTTTT 


2640 




TGAATCATGT CGTGATTGaT AACACCAACA CACATTGCAT TAACAAGTCG ATTGCCATCA 


2700 


5 


TAACGTTCAT CAAATTCGAT TTCACCAGCA GTTGTTGGaA TACCAATGCA GTTACCATAA 


2760 




CCTCCGATAC CCTTTACAAC ACCTTTAAGT AATCTTTGGT TTTGTTTATT ATCTAATTCT 


2820 


10 


CCAAATCTAA GACTGTTTAA CAAATTAATA GGTCTAGCCC CAATAGAGAC AATGTCACGA 


2680 




ATGATTCCAC CAACGCCTGT AGCAGCCCCT TGATATGGTT CAATTGCTGA TGGATGATTG 


2940 




TGAGACTCTA CTTTAAATAC TACGGCTTGA TTATCACCTA TATCGACTAC CCCTGCACCT 


3000 


IS 


TCACCAGGCC CCATAAG CAC ATGGTcACCT GACGTAGGAA ATTGCTTTAA AAACGGTTTA 


3060 




GAATGTTTAT AAGAGCAATG TTCACTCCAC ATAACAGAAA AGATACCTGT TTCTGTAAAG 


3120 




TTAGGTTGTC TGCCTAAAAT ATCGCAAACT TTTTCATATT CTTGATCaCT TAATCCCATA 


3180 


20 


TCTTGATATA C 1 1 TTTCAAG TTTAATTTCT TCAACGCTTG GTTCGATAAA TTTAGACATG 


3240 




TTGTTCCCTC CAAUi rrnA CCATCGCTTC AAATAATTTC ACACCACTAT CAGTACCTAA 


3300 




CAACGTTTCT AAAGCTCTTT CagGATGtGG CATCATGCCA CAT ACATTG C CrrrTTCGTT 


3360 


2S 


AACAATTCCT GCAATATCAT CATATGAACC GTTCGGATTA TTCACATATT TCAGAATAAT 


3420 




TTGATTGTTA GCTTTTAATT GTTGATATAT TTCATCAGTA CAATAATAAT GACCTTCACC 


3480 


30 


GTGAGCTACA GGATATATAA CTTTTTCACC TTGTTCATAA AGATTTGTAA ATGCCGTTTG 


3540 


ATTATTCACT ATTTCTAACT CTTCATTTCT ACTAATAAAT AAATGTGAAT CGTTATGCAA 


3600 




TAATGCACCA GGTAATAAGC CTATTTCAGT TAAAATTTGA AAC CCATTAC AAACACCTAA 


3660 


35 


TACTGGCTTA CCTTCAGCTG CAAGACGTTT AACTTCCGAA ATAATCGGsG CTACACTAGC 


3720 




CATTGCCCCA GATCTTAAGT AATCCCCGAA TGAAAATCCA CCAGGAATAA GTACGCCATC 


3780 




AAATGCACTT AGTGATGTTT CTCTATAATC TACATATTCC GCTTCAACAC CACTTTTAAT 


3840 


40 


AGCAGCATTA AACATGTCTC TATCACAATT CGAACCTGGA AAAACAAGAA CCGCAAATTT 


3900 




CATTTTATGC ATTCTCCTTT TCATCATCTA ACACTTTATA GCTATATTCT TCAATCACTG 


3960 




TATTTGCAAA CAAIU'TITCA CTTAGAGTTG TAATAATGTT GTGTACCTTT TCATCACTAA 


4020 


45 


CCTCATCCAC TGTCATATAT AATACTTTTC CTACACGAAT ATCATTCACT TGTGCATAAC 


4080 




CTAAGTCATG TACAGCTCGA GTAAGCGTTT GTCCTTGCGT ATCTAATACT TGTGGTTGTA 


4140 




ATGTGATATG TAGTTCAATT GTTTTCATTA TTTTAAATCC TCCAATTTGT TTAAAAATAT 


4200 


SO 


TTGATATGTT TCAATCAGTG ATCCAGTGTT ATTTCTATAT ACATCTTTAT CAAAGTTTGC 


4260 




ATTGGTAGCT TTATCCCAAA TTCGACATGT ATCTGGAGAT ATTTCATCCG CTAACAAAAT 


4320 
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ATCCATTAAT TGTTTCAACA CATTATTAAT CTTTAATGCT TTGGATTTTA GTATTTCAAT 4440 

ATCTTCATCT GATG CTATAT TGAGCAATTT AACATGGTCA TCCGTTATCA ACGGATCATT 4500 

5 TAACGCATCA TTTTTATAGA AAAATTCTAC AAGTGGTTCT CTAAAAACTT CACCATTTTC 4560 

AAAACCTAAA CGCTTTGTAA TAGATCCACT AGCAATATTA CGAACAACTA CTTCTAATGG 4620 

AATTATTTTC ACAGGCTTAA CTAATTGTTC TGTTTCAGAT AATTGTTTAA TAAAGTGACT 4680 

10 TTCTATTCCA TTTTCTTGTA AATATTTAAA TATAATAGAA GTAATTTGAT TATTTAATCG 4740 

CCCCTTACCT GCCATTGTGT CTTTCTTAGC CCCGTTTCCA GCAGTAACTT CATCTTTATA 4 800 

TTCAACTCTT AATTCATTTT CTTGATTTGT TGAGAAAATG CGcTTCGCTT TTCCTTCATA 4 86 0 
TAATAATGTC ATGCTTTAAT TACTCCCCTC AAATTTAGCG TACATATCTT GTTCAGTTTG 
GTTTACATCA TTCGTTAGTA CAGTCATATG CCCCATTTTT CTGCTATCTT TACGCTCAGA 
CTTACCATAA ATATGTAAGT GCCACTCTGG ATGTTCATTA AATTCATTTT CCAATAAATC 

TAAATCTTTA CCTAGTAAGT TCATCATGAC TGCTGGCTTT AATAATTCAA TTGAATTTGG 5100 

TAATGATTGT CCGGTAACTG CTAAAATATG AGTATCAAAT TGTGAATAAT CACATGCTTC 5160 

AATTGAATAA TGTCCGGAAT TGTGAGGCCT TGGTGCT AT C TCGTTCACAT ACAATTGGTT 5220 

GTT ACT AT CT ATAAAAAATT CAACTGTAAA TGTTCCAATG AAATGAATCG ATTGGATAAT 52 80 

TTTATTAACT TGCTCTTTCG CCTCAGCTGT TTTATCTATT CTCGCTGGAA CAATTGTTTT 5340 

GAAAAGTATT TGATTTCTAT GCTCATTTTC TTGTAATGGG AAAAAAGTGA TTTGATTGTT 54 00 

GTTTCCTCTT GTAACAGTAA GAGATACTTC TTTCTTGATA TTCAAATATT TTTCAGCTAC 5460 

GCATTCACTA GTTTCAATTA ATTTAAAACC TTCTTGTAAG TCTTTTTCGT TGTTAATTAA 552 0 

35 AACTTGACCT TTGCCATCGT AGCCACCAAA TCTAGTTTTT ACAATAAAAG GATATCCTAA 5580 

TGTTTCAATT - GCTTTGTCAA-TATCTGTAGA TTCTTTTACT GAAATGAACG GGACAACTTT SG4 0 

GGTACCAGCA CTTTTTAATG TTTCTTTTTC AGTTAAGCGA TCTTGTAATA ACTGTATAGC 5700 

40 TTGGTAACCT TGCGGAATAT TGTACTTTTC ACATAATAGT TTTAATTGTT GGGCTGAAAT 5760 

GTTTTCAAAT TCATAAGTAA TCACATCACA TTTTTGTCCT AATTGATTGA GTGCCTTTTC 5820 
ATCGTCATAC TTGGCTTGTA TAAATTCGTG TGCAACGTAT CTACATGGAC AATCTTCAGA 
AGGATCCAAT ACAACCACTT TATAACCCAT TTTTTGAGCT GATTGTGCCA TCATCTTTCC 
AAGCTGACCA CCACCAATAA TGCCAATAGT CGCACCAAAC TTTAATTTAT TGAAGTTCAT 
TTTGCATGTC CTCCACTTTT TGAATTAACG AAGATTCATA CTGATTTAGT TTTTCAACTA 
AAGAAGGATT TTGAATACTT AACATTCTTG CTGCAAGTAT ACCTGCGTTT TTAGCACCTG 



5880 
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AAGAATCTAT 

TTAATGATGC 
TTATACCTCT 
CGGATACTAC 
GCATAATTTT 
TTTCAAAAGT 
AATTTCTGCT 
CTTTTGCTAT 
GAAGTTTTTA 
GGGTTACAAG 
ATATTAGTTG 
GAAAAAATTG 
GTATTAAACG 
GTACCATTAC 
GATGTGGACG 
GTACCTTGCA 
GGTAAAAATG 
CTACTTCAAA 
TATTTAAAAG 
GATGTGGTCA 
AAATTAAAAG 
GTTCCTGGTG 
GAAAAAATGC 
CTCTACAAAG 
TTCAATTCTA 
TTCTGTCCCA 
ATTTTGCAAT 
TTCAAAGTAG 
TCAACAATTT 



ACCCTTTAAA 
AACCATACCT 
TTCTCTCGCT 
TTGTTTTTCG 
CCAATCGGAA 
TTGAATTGTG 
TTTTCAATCA 
GTAAATTAGT 
TGGTTGCTAA 
ATCAAGTTGA 
GTAATGATGG 
GTATGATTTc 
AACTAAATAG 
CAAAACAAGT 
GTTTTCATCC 
CACCGCTCGG 
CAGTTGTAAT 
AAAATGCATC 
ATGCTGATGT 
AAGAAGGAGC 
GTGACGTTGA 
GCGTTGGTCC 
GTCGAGGTAT 
TTATAATGGC 
GTCAACCTTG 
CTCCCTAATA 
TGCAATGAAT 
ACATTGAATA 
TATATAGACA 



CTTTTTGTTT 
GGTAAATGTG 
TCAGAAGCAA 
TACGGAATTT 
GAACTGCCCA 
AATTACTTTA 
AAAATCGAAC 
TTTATAAACT 
AATTTTAGAT 
AGCG CTAAAA 
CGCTAGTCAA 
AGAAATCGTA 
ACTAAATAAT 
TAGCGAACAG 
AATAAATATA 
CATCATGGAA 
TGGACGAAGT 
AGTAACAATC 
CATTGTCAGT 
AGTAATTATC 
TTATGATGCG 
ATTAACAATT 
TGATTCGTAA 
AGTAGTTGAC 
CCGGGGTGGG 
ACTGAGTTTT 
AACAAGAAAA 
AATGAAGGCT 
GTAGTTAATT 



CAATCGGCAC 
CCGCACCGCC 
ATTGAACCAT 
CAAAATAATC 
TAATGACTGC 
GTTGTATATT 
TTTATTTTGA 
AACAAAGTTA 
GGTAAACAAA 
GAAAAGGGTT 
AGTTATGTTA 
CATTTGGAAG 
GATGATTCTG 
AAAATATTAG 
GGGAAATTAT 
ATATTAAAAC 
CATATTGTCG 
TTACATTCTC 
GCAGTTGGTA 
GATGTTGGCA 
GTTAAAGAAA 
ACTATGGTAT 
AGAGCCTGAG 
TGAACGAAAA 
ACGACGAAAT 
AATGAAGTCT 
ATCTGGGACA 
TTGAAGGGAG 
GAATGAAAAT 



TCCAATAACT 
AGCGCCTGCA 
CATTTTTGGC 
CAACATGTTA 
TACTTTCACT 
ATAGATATAG 
TTTTTTATTT 
GGATATTGAC 
TTGCCAAAGA 
TTACACCTAA 
GATCAAAAAA 
AAA.CAGCTAC 
TAAGTGGTAT 
AAGCAATCAA 
ATATCGATGA 
ATGCTGATAT 
GACAACCAGT 
GTTCAAAAGA 
AGCCTGGTTT 
ATACGCCAGA 
TTGCTGGAGC 
TAAATAATAC 
ACATAAATCA 
TTCGCTTGTA 
AAATTTTACG 
TTTAACCCAC 
TTAATCGATC 
CATTTCACTT 
AAGCTTGTAA 



GGTAGCGTCG 
ATGATAATGT 
GTACGATGTG 
CAACTCTCTT 
TTGTACACCC 
CATAACAAGC 
GAATTTACGT 
AATAGGAGGA 
CTACAGACAG 
ATTATCCGTT 
GAAAGCAGCT 
TGAAGAAGAA 
TTTGGTACAA 
TCCTGAAAAA 
ACAAACTTTT 
TGATTTAGAA 
TTCTAAGTTA 
TATGGCATCA 
AGTAACAAAA 
TGAAAATGGC 
TATTACACCA 
TTTGCTTGCA 
ATGTTCTATG 
ACAAGCTTTT 
AAAATATCAT 
ATTAAATATT 
AAATGCTCCC 
TGTACTTGGC 
CAAGTTTTCA 



6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 
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GTTGGGGATG GGCCCCAACA CAGAAGCTGT GACTATGATA AAGTACTACT ACATAGTTAA 804 0 

TCATTAGTGG TTCTTTATCA TTTTCGCCTC CCTTTTCTTA TTGTTTTGAT ACACAAAAAT 8100 

TTAAGTTCAA ACTGTCGAAT AAAGTTATAT TTGATTTCAA ATTATCCCTA AATTATTAAT 8160 

TkTACAATTG TGGCAGATTT TCAAAATAAT AATTATTTCC TCATTATTTA TAAATTTATA 8220 

TTTAAATTTC ATTCTTTATA GGGTAAGATT AGGACTATAG TATGATGTGT ArATAATATA 8280 

AATTAAGGTA TAGTAAAGCT AACTCAGAAA TGACTTATCA TTCGGAGGTT ACATTATGAA 834 0 

TAAACTATTA CAGT CATT AT CAGCCCTCGG TGTTTCTGCT ACACTAGTAA CACCAAATTT 84 00 

AAATGCAGAT GCAACGACGA ATACTACACC ACAAATTAAA GGCGCTAATG AT AT CGTTAT 8460 

TAAGAAAGGT CAAGATTATA ACCTTCTAAA CGGCATAAGT GCATTTGATA AAGAAGATGG 8520 

AGATTTAACC GATAAAATTA AAGTCGATGG CCAAATTGAT ACATCTAAAT CTGGTAAATA 8580 

TCAAATTAAA TATCATGTCA CTGATTCAGA TGGTGCAATT AAAATTTCCA CTAGGTATAT 8640 

TGAGGTTAAA TAGCCCTCAT CACTATACTG CAAATAAAAT GGTAGCAAAC GAACATGTTT 8700 

TGCT AC CATT TTATTTGTTA TTCTAACTTC ATCTGCAACT TTAACCCAAA TATTGTATTT 8760 

25 TTT CTGT AT A CCAAAGGACT ACCTATCAAA TTATTAAAAC TTAACTGCTC TTTTTAAAAA 8820 

AATGTTTTGA TTTTGAACAA ACAAATTTCC ACTTTTCATT GTTTAACGAT AAATTACTTT 8880 

TGGCAAATTC CTTATTAAAA TGTTTGCGCT TCCTTTCAAT CAACTAGCCA TCATTTTCAA 894 0 

30 TTTATTAGAC AATTTCAAAC TTTTTTTATT TTCATTCAAT TAACCTTTAA TTGAAAGCTA 9000 

TTCTCAACTT TCCTTTTAAA TATGAAGCAA TTTTTTCAAA AACGCTATTA GTCACAAAAT 9060 



20 



35 



GT 

(2) INFORMATION FOR SEQ ID NO: 86 



- {if SEQUENCE CHARACTERISTICS 1 " 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



4S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

AAATATTTTT TCAAAACTAT GTGAAAATGG aCCATGTCtA aATCATGTAA TAATGCAGyA 
CATAATGCCA ACGGTCTmT C TTTATTGTCC CATGCATCAT GACCAATAAA TGACTCATCA 

50 ATTAATCGTC TAACTATTTC ATACACACCT AAAGAATGTC CAAAGCGACT ATGTTCTGCT 

GTGTGAAAAG ATAGGTACAG TGTTCCTAGT TGT CTAATTC GACGTAACCT TTGGAATTCC 
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TCTTTAAAAA CTTTTTCTTC TACTAATTTT AAATCTACAT ATGCGTTAGT CATTATTCCC 
CTCCTTTTCG TTTAATATAA TATTTAATTT ACTTAAAATG CTTTGTACAT AAGTG CTAAG 
TCTAACTTTT CGCCATACAT TTCTGGCTCA TAAGAGCGTA AGATTGTAAA ACCTTGCTCT 
TTATAGTAAG CTACTGCTTC TTCATTTTTA TTATCTACTT CTAAGTAAAC ACCTTCAAAT 
TTATCTTCAA AACGTGATAA TCCTTCATTT AACAATGCTG TACCATAACC TGTATGTTGC 
GATTCTGGTT TAACATAATG AGCTGATAAA TATAATTCTT CACCGTAAAT AAAGTTAGCA 
AAG CCAACGA TGTCATTACC TTCTTCAACG ACTAAGAATA ATTGTTCTTG AAGTCTTTTC 
i$ TTTAAATGAT GTTCATTATA TGAAGCTtCT AACAAGTGAT TAACTGTTGT CGCAGCGTAT 

ATATTTAAGT ATGTATTAAA CCAAGCTTTA GTTGCGACAT CTCTAATTTG AACAACATCT 
TTTTCAGTTG CTTGTCTTAC CTTGAACATG ACTTTCTCCC CTTATTAACA AGTTTTAATA 
20 ACGGCATTAT ACCACAACTT GCTCAATACT TAATAAACAA TGATTGTCTA TTCAATTTAT 

ATATt TATAT TTTCCGTTAA AATTAAAAAT AAAAAATAAC GAAGCAAAAA At CACTTCGT 
TTAGTATGAG GTATGTCTTA TTGCAATATA CTATTCCACT CAGTTGCACG TGCTAAGGCA 
25 TAGTTGTCTT TCATGATGTC ACCAGGCTTT TCAGCAGTTC CAATAATATA ACCATTTAAA 

GTGGCACCTA rAAAGTCTAA ACTATATTTC ATTTG CGTAA TTGCTGGTTC GCTTTTATTT 
TTGGACAATC TCCACCAACT AAAATAACTC T AAAATC CTT TTCGGCCATT TGTGCCTTAA 
AATTAGGATA TCGTTT AT CT TGTAATGTTT CTGACCAATG TTCGATAAAT GCTTTCAATG 
GTGCTGAAAT GCTATACCAA TACACTGGTG ATGCAAAAAT AATTGTATCA CTAGCCAATA 
TTTTATCTAG AATCGGCAAA TAGTCATCGT CATATGAAGT AATAGTCTCT GCTGTATGTC 
TCACGTCACG TATCGGTTTA AACTGATGTT GTGTCACGTC AATCCATTGA TACTCTAAAT 
CTTGCAAAGC GAATTTTGTT AATTGTG CAG TATTACCGTT TGGTCTACTC CCACCAAACA 
AAACAGTAAT CATTTTAGCC TAACCTCACT TTTGATTAAT AAATATCTGT GTTTTTCGTT 
ACCTAATTAT ACTATCATAA GCTTTGCCTA CCGAATAGTA AAACGCTTAC AACTTTTATA 
TAAATTTGAC GAAATTTCGT CATGCCTTAT ATAACGTCGT TTGTGATACG GGGCTAATTC 
45 ATGATGAAAT TAGATACATA TATCACCATT AAATACAATT CATTTAGTCT TCAATCGGAA 

ACAGTTCATC GATATATTGA ATCTCATCAT CTGATAAAAC GATATCTGCA GCTTTAATAT 
TTTCAACGAC TTGTTCTGCA CGTTTTGCAC CAGGAATAAT CACATCGATA GCTGGTCTCG 
so TTAAATAAAA TGCTAATACA ATGTTCGCAA TTGAAGTTTG ATGTGCTGCA GCTATGCTTT 

CCAAAGCTTT TACGCGACGC ACATTTTCTT CAAATACACC TGGTTTAAAA TCACGACGTG 
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GCTAATGGGA AATATGGAAT AAATGTGATT TGGTGATCAA CACAATATTG TAATACTGCC 2160 

TCATTTTCGC GATGCAATAA ATT AT ATT CT AACTGTACAA CATCAACGTA ACCATCTTTA 222 0 

TTTGCTTCTT TAAGTTGATC TAATGTGAAA TTTGATACAC CAATTGCTTT AATCTTCCCT 228 0 

TGTTCCTTAA GCTCTTGTAA TGCTGCAACT GCTTGATCTT TCGGAGTGTT GTTATCCGGA 2 34 0 

AAATGAATAT AATATAAATC GATATAATCA GTTTGTAGAC GTTTCAAACT ATTCTCAACT 2400 

TGTTGTTTTA AATATTCCGG TTGATTGTTC TGATGTACTT CTTGATTTTC ATCAAATTCA 24 6 0 

TGAGACCCTT TCGTAGCAAT TTTAATTTGC TCTCGCGGAT ATTCTTTAAC AACTTCTCCA 252 0 

ACCAATTCTT CTGATCGTTC TGGCCCATAA ATATATGCCG TATCTAATAA ATTAATACCA 258 0 

TGATTAATGG CTTGACGAAC AACATCTTTT CCTTGTTCTT CATCTAAGTT CGGATATAAA 264 0 

TTATGCCCAa CCTAtGCGTT CGTCCCAAGT GCGATTGGAA ACACTTCAAC ATCAGATTTA 2700 

CCTAAGTTTA CAAATTGCTn CATTAGACCC AGCnCCTT 2738 
(2) INFORMATION FOR SEQ ID NO: 87; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 87; 

GATTAGATGA TATTTAACGA AAATTAaGrT GmAATACTtG AATGTArGAa GTCTGATGTC 
GAAAATAGCT ATTAAAATAG AGTAGACGTA ATG t AAATGA AAGCACCTAA AATAGAAAAA 
55 TTTCAAAAAT AGCGTAATTA TTATAATAAA TAGACTGCCA ATAAAATGCA ATTTTTCACT 180 

TATAACATTC~ TTCAAAAAAT- AATAGCAAAA- TTATGTAAAA AATATCTTGT_CATGGCAAGA _ 

TTGGCTGTGC TATAATCTAT CTTGTGCTTA AGAACGGCTC CTTGGTCAAG CGGTTAAGAC 
ACCGCCCTTT CACGGCGGTA ACACGGGTTC GAGTCCCGTA GGAGTCACCA TTTTTTAGGT 
CTCGTAGTGT AGCGGTTAAC ACGCCTGCCT GTCACGCAGG AGATCGCGGG TTCGATTCCC 
GTCGAGACCG TACAAATGCC TATCCAAGAG GATAGGCATT TTTTTGCGTT TAATATTATA 480 
TTAATAAAAG ATATATGGAC GAATGATAAT CATATTGATT TATCTGTTCG TCCATTTTCT 54 0 

TTAAAATGTA TGAACCTCAA GTAACTTAGT GGTTGGATAT GAAAGATAAA CGTAGACAAT 600 
AAAATCTTTA TTAGACGTAC AAACATATGC TACTGTCAAC ATATTTCTTC GTTGTGATAT 660 
GCCACCAGTC CTCCATAACA TCAATTGTTA AAGTAACGAA TAACGAATAA TGATATTTAT 720 
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GACCTCATCA TTGTGTTAAA TATCATTGTC ACAATCCGCC GTGAGAAACT AATAAAAAAT 
AGTAATATAT AAGTTTATAT TGGAAAATAG AATTAATAGC TTATAAATGG TAAATTATAT 
AATAGGTTAC T AT ACG TTAT AAGACGGAAA ATGCGCACAA TAACAAAAAT AGTAAGCGAC 
ATCCTGTGAT TTTTTACACA AACATAAACG ATAAAGAACA AAAAATGATA AAATAATATT 
AATGATTTAA GAAAAGAGGT TTATGCAAAT GGCTAGAAAA GTTGTTGTAG TTGATGATGA 
AAAACCGATT GCTGATATTT TAGAATTTAA CTTAAAAAAA GAAGGATACG ATGTGTACTG 
TGCATACGAT GGTAATGATG CAGTCGACTT AATTTATGAA GAAGAACCAG ACATCGTATT 
^ ACTAGATATC ATGTTACCTG GTCGTGATGG TATGGAAGTA TGTCGTGAAG TGCGCAAAAA 

ATACGAAATG CCAATAATAA TGCTTACTGC TAAAGATTCA GAAATTGATA AAGTGCTTGG 
TTTAGAACTA GGTGCAGATG ACTATGTAAC GAAACCGTTT AGTACGCGTG AATTAATCGC 
20 ACGTGTGAAA GCGAACTTAC GTCGTCATTA CTCACAACCA GCACAAGACA CTGGAAATGT 

AACGAATGAA ATCACAATTA AAGATATTGT GATTTATCCA GACGCATATT CTATTAAAAA 
ACGTGGCGAA GATATTGAAT TAACACATCG TGAATTTGAA TTGTTCCATT ATTTATCAAA 
25 ACATATGGGA CAAGTAATGA CACGTGAACA TTTATTACAA ACAGTATGGG GCTATGATTA 

CTTTGGCGAT GTACGTACGG TCGATGTAAC GATTCGTCGT TTACGTGAAA AGATTGAAGA 
TGATCCGTCA CATCCTGAAT ATATTGTGAC GCGTAGAGGC GTTGGATATT TCCTCCAACA 
30 ACATGAGTAG AGGTCGAAAC GAATGAAGTG GCTAAAACAA CTACAATCCC TTCATACTAA 

ATTTGTAATT GTTTATGTAT TACTGATTAT CATTGGTATG CAAATT AT CG GGTTATATTT 
TACAAATAAC CTTGAAAAAG AGCTGCTTGA TAATTTTAAG AAGAATATTA CGCAGTACGC 
GAAACAATTA GAAATTAGTA TTGAAAAAGT ATATG AC GAA AAGGGCTCCG TAAATGCACA 
AAAAGATATT CAAAATTTAT TAAGTGAGTA TGCCAACCGT CAAGAAATTG GAGAAATTCG 
TTTTATAGAT AAAGACCAAA TTATTATTGC GACGACGAAG CAGTCTAACC GTAGTCTAAT. 
CAATCAAAAA GCGAATGATA GTTCTGTCCA AAAAGCACTA TCACTAGGAC AATCAAACGA 
TCATTTAATT TTAAAAGATT ATGGCGGTGG TAAGGACCGT GTCTGGGTAT ATAATATCCC 



4Q - .^v- wk.wu_um/u, uivuviAALL GTAGTCTAAT. 2100 

2160 
2220 

45 AGTTAAAGTC GATAAAAAGG TAATTGGTAA TATTTATATC GAATCAAAAA TTAATGACGT 2280 



TTATAACCAA TTAAATAATA TAAATCAAAT ATTCATTGTT GGTACAGCTA TTTCATTATT 
AATgCACAGT CATCCTAGGA TTCTTTATAG CGCGAACGAT TACCAAACCA ATCACCGATA 
50 TGCGTAACCA GACGGTCGAA ATGTCCaGAG GTAACTATAC GCAACGTGTG AAGATTTATG 

GTAATGATGA AATTGGCGAA TTAGCTTTAG CATTTAATAA CTTGTCTAAA CGTGTACAAG 
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GTGATGGTAT TATTGCAACA GACCGCCGTG GACGTATTCG TATCGTCAAT GATATGGCAC 2640 

TCAAGATGCT TGGTATGGCG AAAGAAGACA TCATCGGATA TTACATGTTA AGTGTATTAA 2700 

GTCTTGAAGA TGAATTTAAA CTGGAAGAAA TTCAAGAGAA TAATGATAGT TTCTTATTAG 276 0 

ATTTAAATGA AGAAGAAGGT CTAATCGCAC GTGTTAACTT TAGTACGATT GTG CAGG AAA 2 82 0 

CAGGATTTGT AACTGGTTAT ATCGCTGTGT TACATGACGT AACTGAACAA CAACAAGTTG 288 0 

AACGTGAGCG TCGTGAATTT GTTGCCAATG TATCACATGA GTTACGTACA CCTTTAACTT 294 0 

CTATGAATAG TTACATTGAA GCACTTGAAG AAGGTGCATG GAAAGATGAG GAACTTGCGC 3 000 

CACAATTTTT ATCTGTTACC CGTGAAGAAA CAGAACGAAT GATTCGACTG GTCAATGACT 3060 

TGCTACAGTT AT CT AAAATG GATAATGAGT CTGATCAAAT CAACAAAGAA ATTATCGACT 3120 

TTAACATGTT CATTAATAAA ATTATTAATC GACATGAAAT GTCTGCGAAA GATACAACAT 3180 

TTATTCGAGA TATTCCGAAA AAGACGATTT TCACAGAATT TG AT CCTG AT AAAATGACGC 3 24 0 

AAGTATTTGA TAATGTCATT ACAAATGCGA TGAAATATTC TAGAGGCGAT AAACGTGTCG 3 300 

AGTTCCACGT GAAACAAAAT CCACTTTATA ATCGAATGAC GATTCGTATT AAAGATAATG 3360 

25 GCATTGGTAT TCCTATCAAT AAAGTCGATA AGATATTCGA CCGATTCTAT CGTGTAGATA 3420 

AGGCACGTAC GCGT AAAATG GGTGGTACTG GATTAGGACT AGCCATTTCG AAAGAGATTG 34 8 0 

TGGAAGCGCA CAATGGTCGT ATTTGGGCAA ACAGTGTAGA AGGTCAAGGT ACATCTATCT 3 54 0 

TTATCACACT TCCATGTGAA GTCATTGAAG ACGGTGATTG GGATGAATAA TAAGGAGCAT 3600 

ATTAAATCTG T CATTTT AG C ACTACTCGTC TTGATGAGTG TCGTATTGAC ATATATGGTA 3660 

TGGAACTTTT CTCCTGATAT TGCAAATGTC GACAATACAG ATAGTAAGAA GAGTGAAACG 3720 

rAACCTTTAA CGACACCTAT GACAGCCAAA ATGGATACAA CTATTACGCC ATTTCAGATT 3780 

ATTCATTCGA AAAATGATCA TCTcAGAAGGA" ACGATTGCGA CGGTATCTAA TCTGAATAAA 3 840 

CTGACGAAAC CTTTGAAAAA TAAAGAAGTG AAGTCCGTGG AACATGTTCG TCGTGAT CAT 3 900 

AACTTGATGA TTCCTGATTT GAACAGTGAT TTTATATTAT TCGATTTTAC GTATGATTTA 3 960 

CCGTTATCAA CATATCTTGG TCAAGTACTG AACATGAATG CGAAAGTACC AAATCATTTC 4 020 

45 AATTTCAATC GTTTGGTCAT AGATCATGAT GCTGATGATA ATATCGTGCT TTATGCTATA 4 080 

AGCAAAGATC GCCACGATTA CGTAAAATTA ACAACTACAA CGAAAAATGA TCATTTTTTA 4140 

GATGCATTAG CAGCAGTGAA AAAAGATATG CAAC CATACA CAGATATCAT CACAAACAAA 42 00 

50 GATACAATTG ATCGTACGAC GCATGTTTTT GCACCAAGTA AACCTGAAAA GTTAAAAACA 4 2 60 

TATCGCATGG TATTTAACAC GATTAGTGTT GAGAAAATGA ATGCTATACT ATTTGACGAT 4 320 
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GCAAACTATA ACGATAAAAA 


TGAAAAATAT 


CATTATAAAA 


ACCTGTCCGA 


AGATGAAGCG 


4440 




AGTTCCAGCA 


AAATGGAAGA 


AACGATTCCA 


GG AACCTTTG 


ATTTTATTAA 


TGGTCATGGT 


4500 


5 


GGTTTCTTAA 


ACGAAGACTT 


TAGATTGTTT 


AGTACGAATA 


ATCAGTCAGG 


CGAGTTAACA 


4560 




TATCaACGTT 


TCCtTAATGG 


TTATCCAACG 


TTTAATAAAG 


AAGGTTCTAA 


TCAAATTCAA 


4620 


10 


GTCACTT GGG 


GTGAAAAAGG 


CGTCTTTGAC 


TATCGTCGTT 


CGTTATTACG 


CACCGACGTT 


4660 


GTTTTAAATA 


GTGAGGATAA 


TAAATCGTTG 


CCGAAATTAG 


AGTCTGTACG 


TTCAAGCTTA 


4740 




GCGAACAATA 


GTGATATTAA 


TTTTGAAAAA 


GTAACAAACA 


TCGCTATCGG 


TTACGAAATG 


4800 


15 


CAGGATAATT 


CAGATCATAA 


TCACATTGAA 


GTG CAGATTA 


ACAGTGAACT 


CGTACCGCGT 


4860 




TGGTATGTAG 


AATATGATGG 


CGAATGGTAT 


GTTTATAACG 


ATGGGaGGCT 


TGaATAAATG 


4920 




AACTGGaAAC 


TGACAAAGAC 


ACTTTTCATT 


TTCGTGTTTA 


TTCTTGTCAA 


CATCGTGTTA 


4980 


20 


GTATCGATTT 


ATG TTAATAA 


AGTCAATCGC 


TCACACATTA 


ATGAAGTCGA 


GAGTAACAAT 


5040 




GAAGTTAATT 


TTCAGCAAGA 


AGAAATTAAA 


GTACCGACTA 


GTATATTGAA 


TAAATCAGTT 


5100 




AAAGGTATAA 


AATTAG AG PA 


AATTACAGGG 


CGATCAAAAG 


ACTTTAGTTC 


TAAAGCTAAA 


5160 


25 


GGCGATTCGG 


ATTTG AC CA C 


ATCAGATGGT 


GGAAAATTAT 


TGAATGCGAA 


CATTAGTCAA 


5220 




TCGGTAAAGG 


T CAGTG A PA A 


TAACTTAAAA 


GATTTGAAAG 


ATTATGTTAA 


CAAGCGCGTA 


5280 




TTTAAAGGTG 


CTGAATATCA 


ATTAAGCGAG 


ATTAGTTCAG 


ATTCTGTAAA 


ATATGAACAA 


5340 


30 


ACGTATGATG 


ATTTT C CG AT 


TTTAAATAAC 


AGTAAAGCGA 


TGTTAAACTT 


TAATATAGAA 


5400 




GATAACAAAG 


CGACTAGTTA 


TAAACAATCA 


ATGATGGATG 


ACATTAAGCC 


CACAGATGGT 


5460 


35 


GCAGATAAGA 


AGCATCAAGT 


GATTGGTGTG 


AGAAAAGCAA 


TCGAGGCATT 


ATATTATAAT 


5520 


CGTTACTTGA 


AAAAAGGTGA 


TGAAGTCATT 


AATGCTAGAC 


TCGGTTACTA 


CTCAGTCGTG 


5580 




AATGAAACGA 


ATG TT CAATT 


GTTACAACCA AACTGGGAAA 


TTAAAGTGAA 


GCATGACGGT 


5640 


40 


AAGGATAAAA 


CGAATACTTA 


CTATGTCGAA 


GCGACAAATA 


ATAACCCTAA 


AATTATTAAT 


5700 




CATTAATATG 


AATCGTAATA 


AGCTAGCATT 


GCAAGCTCAT 


CATATGTGAG 


AAGCGGTGCT 


5760 




AGCTTTTTTG 


CTGGTACGGT 


TT ATT ATGG C 


TGATGTTTTT 


GCGTCTCCAA 


CGTGCGCATT 


5820 


45 


T ATT CAT ATT 


TTAAGTAGAA 


CCGCATTGTA 


AAATTAGTGT 


AACTGTTATT 


TTAAAAACTT 


5880 




TAGTATTTGT 


CTAATCATTG 


TTATAATAAT 


TAAGAAATTC 


ATTGCACGTG 


ATTATCAAAA 


5940 




TTTAAATATA 


AGAAACCGGT 


CGATGAACTA 


AAGTTACATA 


ATAGGAAAGG 


TATACAAAAC 


6000 


SO 


AGCTAATATA 


CTGATAGTTT 


CTGTAGGGAA AATCGTATAT 


TTGCACTGAT 


GTATATTGCA 


6060 




GTCATATAGA 


GAGATTGACT 


GTTTAAAGAG 


AAAGGATGAG 


CCGCTTGATA 


OGCATGAGTG 


6120 
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TAGTTGATGT TGGTTTGACT GGAAAGAAAA TGGAAGAATT GTTTAGTCAA ATTGACCGTA 6240 

ATATTCAAGA TTTAAATGGT ATTTTAGTAA CCCATGAACA TATTGATCAT ATTAAAGGAT 6 300 

TAGGTGTTTT GGCGCGTAAA TATCAATTGC CAATTTATGC GAATGAAAAA ACTTGGCAGG 6 360 

CAATTGAAAA GAAAGATAGT CGCATCCCTA TGGATCAGAA ATTCATTTTT AATCCTTATG 6420 

AAACAAAATC TATTGCAGGT TTCGATGTTG AATCGTTTAA CGTGTCACAT GATGCAATAG 64 80 

ATCCGCAATT TTATATTTTC CATAATAACT ATAAGAAGTT TACGATTTTA ACGGATACGG 6 540 

GTTACGTGTC TGATCGTATG AAAGGTATGA TACGTGGCAG CGATGCGTTT ATTTTTGAGA 6 6 00 

GTAATCATGA CGTCGATATG TTGAGAATGT GTCGTTATCC ATGGAAGACG AAACAACGTA 6 660 

TTTTAGGCGA TATGGGTCAT GTATCTAATG AGGATGCGGC TCATGCAATG ACAGACGTGA 6720 

TTACAGGTAA CACGAAACGT ATTTACCTAT CGCATTTATC ACAAGACAAT AACATGAAAG 67 80 

ATTTGGCGCG TATGAGTGTT GGCCAAGTAT TGAACGAACA CGATATTGAT ACGGAAAAAG 684 0 

AAGTATTGCT ATGTGATACG GATAAAGCTA TTCCAACGCC AATATATACA ATATAAATGA 6 900 

GAGTCATCCG ATAAAGTTCC GCATTGCTGT GAGACGACTT TATCGGGTGC TTTTTTATGT 6 960 

TGTTGGTGGG AAATGGCTGT TGTTGAGTTG AATCGGCTTG ATTGAAATGT GTAAAATAAT 7020 

TCGATATTAA ATGTAATTTA TAAATAATTT ACATAAAATC AATCATTTTA ATATAAGGAT 708 0 

TATGATAATA TATTGGTGTA TGACAGTTAA TGGAGGGAAC GAAATGAAAG CTTTATTACT 7140 

30 TAAAACAAGT GTATGGCTCG TTTTGCTTTT TAGTGTAATG GGATTATGGC AAGTCTCGAA 7200 

CG CGGCTG AG CAG CAT AC AC CAATGAAAGC ACATGCAGTA ACAACGATAG ACAAAGCAAC 7260 

AACAGATAAG CAACAAGTAC CGCCAACAAA GGAAGCGGCT CATCATTCTG GCAAAGAAGC 732 0 
GGCAACCAAC GTATCAGCAT CAGCGCAGGG AACAGCTGAT GATACAAACA GCAAAGTAAC 7380 

ATCC&ACGCA CCATCTAACA AACCATCTAC AGTAGTTTCA-ACAAAAGTAA- ACGAAACACG 7-44 0- 

CGACGTAGAT ACACAACAAG CCTCAACACA AAAACCAACT CACACAGCAA CGTTCAAATT 7500 

40 

ATCAAATGCT AAAACAGCAT CACTTTCACC ACGAATGTTT GCTGCTAATG CACCACAAAC 7560 
AACAACACAT AAAATATTAC ATACAAATGA TATCCATGGC CGACTAGCCG AAGAAAAAGG 7620 
GCGTGT CAT C GGTATGGCTA AATTAAAAAC AGTAAAAGAA CAAGAAAAGC CTGATTTAAT 7680 

45 

GTTAGACGCA GGAGACGCCT TCCAAGGTTT ACCACTTTCA AACCAGTCTA AAGGTGAAGA 7740 
AATGGCTAAA GCAATGAATG CAGTAGGTTA TGATGCTATG GCAGTCGGTA ACCATGAATT 7 800 

SO TGACTTTGGA TACGATCAGT TGAAAAAGTT AGAGGGTATG TTAGACTTCC CGATGCTAAG 7860 

TAcTAACGTT TATAAAGATG GAAAACGCGC GTTTAAGCCT TCAACGATTG TAACAAAAAA 7920 
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TGAAGGCATT AAAGGCGTTG AATTTAGAGA TCCATTACAA AGTGTGACAG CGGAAATGAT 
GCGTATTTAT AAAGACGTAG ATACATTTGT TGTTATATCA CATTTAGGAA TTGATCCTTC 

AACACAAGAA ACATGGCGTG GTGATTACTT AGTGAAACAA TTAAGTCAAA ATCCACAATT 8160 

GAAGAAACGT ATTACAGTTA TTGATGGTCA TTCACATACA GTACTTCAAA ATGGTCAAAT 8220 

TTATAACAAT GATGCATTGG CACAAACAGG TACAGCACTT GCGAATATCG GTAAGATTAC 8280 

ATTTAATTAT CGCAATGGAG AGGTATCGAA TATTAAACCG TCATTGATTA ATGTTAAAGA 83 4 0 

CGTTGAAAAT GTAACACCGA ACAAAGCATT AGCTGAACAA ATTAATCAAG CTGATCAAAC 84 00 

ATTTAGAGCA CAAACTGCAG AGGTAATTAT TCCAAACAAT ACCATTGATT TCAAAGGAGA 84 60 

AAGAGATGAC GTTAGAACGC GTGAAACAAA TTTAGGAAAC GCGATTGCAG ATGCTATGGA 8520 

AG CGTATGGC GTTAAGAATT TCTCTAAAAA GACTGACTTT GCCGTGACAA ATGGTGGAGG 85 80 

20 TATTCGTGCC TCTATCGCAA AAGGTAAGGT GACACGCTAT GATTTAATCT CAGTATTACC 864 0 

ATTTGGAAAT ACGATTGCGC AAATTGATGT AAAAGGTTCA GACGTCTGGA CGGCTTTCGA 8700 

ACATAGTTTA GGCGCACCAA CAACACAAAA GGACGGTAAG ACAGTGTTAA CAGCGAATGG 8760 

CGGTTTACTA CATATCTCTG ATTCAATCCG TGTTTACTAT GATATAAATA AACCGTCTGG 8820 

CAAACGAATT AATGCTATTC AAATTTTAAA TAAAGAGACA GGTAAGTTTG AAAATATTGA 88 80 

TTTAAAACGT GTATATCACG TAACGATGAA TGACTTCACA GCATCAGGTG GCGACGGATA 8 94 0 

TAGTATGTTC GGTGGTCCTA GAGAAGAAGG TATTTCATTA GATCAAGTAC TAG CAAGTT A 9000 

TTTAAAAACA GCTAACTTAG CTAAGTATGA TACGACAGAA CCACAACGTA TGTTATTAGG 9060 

TAAACCAGCA GTAAGTGAAC AACCAGCTAA AGGACAACAA GGTAGCAAAG GTAGTAAGTC 9120 

TGGTAAAGAT ACACAACCAA TTGGTGACGA CAAAGTGATG GATCCAGCGA AAAAACCAGC 9180 

TCCAGGTAAA GTTGTATTGT TgtAGCGCAT AGAGGAACTG TTAGTAGCGG TACAGAAGGT 9240 

TCTGGTCGCA CAATAGAAGG AGCTACTGTA TCAAGCAAGA GTGGGAAACA ATTGGCTAGA 93 00 

ATGTCAGTG C CTAAAGGTAG CGCGCATGAG AAACAGTTAT TTCATAATCA ACAGTCATTG 9360 

ACGTAGCTAA GTAATGATAA ATAATCATAA ATAAAATTAC AGATATTGAC AAAAAATAGT 9420 

45 AAATA 9425 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3886 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
AGTTGTAATG TCACATTTCC AGAGTCTGAA ATTATCTTTA TCACGTTACA TTT ACT AGG C 60 

TCTAAAATGA CTGAACATAC AGCATCTTCA ATTACCTTTG AATACCATGA TTTATCGCAA 120 

AATATACATG AATTG AT CAC TTGTGTTAGC CAAGAATTAG GCATTGATAT GTCAAAAGAC 180 

AACAAGTTAC ATACCAGTCT GATCACACAT ATCAAACCAG CTATACATCG TATTAAATAC 24 0 

GATATGCTAC AACCTAATCC TTTGAGGCAA GAAGTTATGC GTCGCTATCC TCAAATCATT 300 

GAAGC CGTTA GCAAGCATAT TAGTCCAATT GAACAAGATG CTGCTATTCG CTTCAACGAA 36 0 

GATGAATTAA CATACATTAC AATTCACTTC G CAT CAAGT A TAGAGCGTGT TGCAACACAT 4 20 

AAACAATCAA TGATTAAGGT TGTCTTACTA TGTGGTTCTG GTATAGGCAC GTCACAACTT 4 80 

TTAAAATCAA AACTAAATCA CCTGTATCCT GaGTTnCACA TTTGGGAtGc CTATTc CATT 54 0 

TaTcAATTGG aAGaAAGTCG ATTATTGCAA GATAACATTG ATT ATGT CAT TTCAACAGTA 600 

CCTTGTGAAA TATCAGCTGT ACCAGTTATT CATGTCGATC CATTTATCAA TCAACAATCT 660 

CGTCAAAAAT TGAATCAAAT TATCAATGAC TCAAGAGAAC AACGAGTCAT GAAAATGGCA 720 

ACTGATGGCA AGTCACTCGC AGATTTATTG CCTGAACATC GCATCATTAT AAATAAACAA 780 

CCATTATCAA TTGAATCCGC AATTG CAGTG GCTGTGCAAC CTTTAATCAA TGATGGCATT 840 

GTCTATTCAA ATTATACAGC TGCAATTTTA AAACAATTTG AACAATTCGG GTCATATATG 90 0 

GTCATTAGTC CACATATTGC ACTTATTCAC GCTGGTACTG ATTATGTACA GAATGGTGTA 96 0 

GGTTTCGCAC TAACATATTT CACTGAAGGG ATTATCTTTG GTAGTAAAGC TAACGATCCC 1020 

GTTCACCTTG TAATTACATT AG CAACGG AC CACCCCAATG CACATTTAAA GGCATTGGGA 1080 

35 CAGTTAAGCG AATGCTTAAG CAACGACTTA TATCGACAAG ATTTCTTAGA TGGGAATATT 114 0 

TTT^AATTA AACAACACAT TGCTTTAACT ATGACAAAGG- AGGCTTAATA ACGTGTCATT _ 120 0 

AGACATTTTG TCAACAACAC GCATCATTGT AAAAGAACAA GTAAATGATT GGACTGAAGC 1260 

40 TATAACTATA GCTTCTCAGC CATTACTACA AGAACAAATT ATTGAACAAG GCTATGTTCA 1320 

AGCAATGATT GATAGCGTTA ATGAACTTGG ACCTTATATC GTTATCGCAC CTGAAATTGC 1380 

AATTGCACAT GCAAGACCGA ACAATGACGT ACATCAAGTT GGTTTAAGTC TATTAAAGTT 144 0 

GAATCAACAT GTGGCATTTT GTGATGAAGA TCACTACGCA TCTCTCATTT TTGTATTGAG 1500 

TGCCATCGAC AAT CATT CAC ACTTATCTGT ATTACAAAAT TTAGCAACCG TACTGGGCGA 1560 

TAACCAAACA GTCCAGCAAC TATTAACTGC AACAAATGCA CAAGACATTA AAAACATTTT 1620 

AAAGGAGCAT GATTAATATG AAAATTTTAG TAGTATGTGG CCACGGTTTA GGAAGTAGTT 1680 
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AAGTTGAACA TAGTGACATT ATGACAGCAA GTCCAGAGAT GGCTGACTTG TTTATTTGTG 
GTAGAGATTT AGCTGAAAAT GCCGAACGTC TAGGGGATGT CTTAGTTCTT GATAATATTT 
TAGATAAAGC TGAATTACAA CAAAAGCTCT CAGAAAAATT ACAACAACTT AACATGATTT 
AAAGGAGGTA CGACCTATGC AAGCAATCCT TAATTTTATA GTCGATATTT TAAGTCAACC 
AGCCATTCTT GTTGCACTGA TTG CCTTTAT AGGTTTAATC GTTCAGAAAA AACCTGCCGC 
AACGATCACT TCAGGAACCA TTAAAACGAT ATTAGGCTTC TTAATTTTAA GTGCAGGTGC 
TGATGTCGTC GTTCGATCTC TTGAACCATT CGGCAAAATA TTCCAACACG CATTTGGTGT 
, 5 GCAAGGTATC GTACCTAACA ACGAAGCTAT CGTCTCACTA GCCTTAAAAG ATTTTGGAAC 

AACAGCTGCA CTCATCATGG TCTGTGGCAT GATTGTTAAT ATTTTAATTG CCCGCTTCAC 
TAATTTAAAA TATATCTTTT TAACAGGTCA TCATACATTT TACATGGCTG CGTTTTTAGC 
20 AATCATTTTA ACAGTCAGTC ATATTAAAGG CTGGCTAACG ATTGTTATCG GCGCACTCGT 

ATTAGGATTA ATCATGGCAG TATTACCTGC ATTACTCCAA CCTACGATGC GAAAAATTAC 
AGGGAATGAC CAAGTAG CTT TAGGTCATTT TGGCTCAATC AGTTACTTTG CCGCAGTGCT 
?5 GTAGGTCAAT TATTCAAAGG TAAGTCTAAA TCAACGGAAG AGATTAAATT TCCAAAAGGC 

TTAAGTTTCT TACGAGAAAG TACAATTAGT ATCTCGATTA CGATGGCATT ACTTTACTTC 
ATCGCATGCT TATTTGCGGG CGTTAGTTAT GTACACGAAT CTATTAGTGA TGGTCAAAAC 
TTTATTGTCT TTTCATTAAT TCAAGGTGTG ACATTTG CTG CTGGTGTATT TATTATTTTA 
ACGGGCGTTC GTTTAATCTT AGCTGAAATC GTCCCAGCAT TTAAAGGAAT TTCTGAAAAG 
r5 CTTGTACCAA ATTCTAAACC TGCATTAGAC TGCCCTATTG TGTTCCCTTA TGCACAAAAT 

GCAGTATTAA TTGGATTCTT TGTCAGCTTT ATTACAGGTG TCATCGGTAT GTTTATCTTA 
TTCTTATTTG GTGGCGTCGT CATTTTACCT GGCGTAGTTG CACACTTCTT CTTAGGTGCA 



45 



SO 



CTAAATGGTA TCCTAATCAC GTTTTTACCA TTATTATTCT TGCCATTTTT AGGCGAATTA 
GGTGGTGCTG CAACAACATT CTCAGATACA GACTTTTTAG CTGTCGGTAT CGTGTTCGGT 
AACGCAGTAA AATATATGGG ATTATTTGGT GCGATTCTAT TTATTATTAT CGTAGGTGCG 
ACAACAATTT TATTAAAAGG CCGTCAAAAA GAACAGCAAT AGTGTTAACG TAGAAATATA 
AAACACCGTC ACATATTGAG TGAATGCCCC TTT t ATCAAG AGGAAAGCCA CTTACTTATG 
GACGGTGTTT TGTATTATAT TAAATGATAC TTAGCCATAC TATCGACAGC TGCTAAAATT 
GCTTCTTCTT GTGTCGCAAT CGGTTCCCAA CCAAGTAATG TTTTTgCACG TTCGTTACTT 
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CTTTTTGAGC TATTTCAGGT 
TTTGTCCATC AGCCGTCGCA 
AAATGTGCAG TTCAGCTACG 
GTTTCATTGA ACCATTCAAT 
CTAATGATGG CCCAAAAATT 
CATTCT 



50 



CCTAGACTCA AAATAAAGTC TGGTAATTTT TTAGTAGAAA 
CTCTTTTCTT TAATTAATTT TGCAATTTCC AACAAATTAA 
ATAAATCGCT TGCCATTAGC TTGTTCATTT GTCATTGCCA 
TCTCTCACAT CAACAACATT TAACGGAATT TGCGGTACAC 
AAATTTTCTA ATAAATGAAA GCTTCCTGAA ACGTGTGCAT 
GCAACTGGAT TGATTGTGGC AAATTCTACT GTTGTATTTT 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 879 base pairs 

(B) TYPE; nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GTCATCTATC AAAAATTTGG TATACAGACC GACAATTATT AATTAATAAT TTAATTTCCC 
AGGCAATACC AGTGATTAAA TATCCACAAA TACAACATAA AGAACAACCA TT AG AAT CT A 
TTTCACAACT TATATTGTCT AAGATGACAT CTAATCAATA GTGTTTAAAT TTCTCAGTGG 
CTGTGAATGA GGTTTAAAAG TACTATAAAA CGTAAACTTT GATACTTTAA AATACGCAAA 
AAACGGTAAA CCCTAATTCA TATTATAGAG TTTACCGTTT TATTTTTTAA CTTGCAT CAT 
AGTTATATTA ACATTATTGT TGGTAGTTTG GATCAGTAAC CATTGCTTGT CCAGTATAAT 
CAACCGTTAC AATTGAATAT TTTCCaTTTG CATTTGGGTC TTTAAAACTA AACACATACT 
TATAGTTGCC ATTATGTTCT TCAATAGAAT AATCATTATA CACTTTATTA TTACTACCAA 
- ATTTATTTG C_ TTCATTATTA GCCGCATTTA^ AAGCTGTTTG GA^TTTGGC J^TTGCTGTA 
AAGCTTGATT TTTATTTCCA TTAAACGGAT AAATTTGACG TGCAACCGGC GCGG CATTTT 
GnCCATAATA TGGTGCAACG TAACTTGATT TTTGATTATT ATTCGCTTGG TTATTACTTG 
ATTGGTTATT ATTTGTTTGG TTTTGGTCAT TGTTTGTTGC ATTTGAATTA GATTGTTGCT 
GGTTATCGTT TGCACTATTA TCTTTATTAT CTTTGTTTAC GTCTTTACTA TCATCTTTAT 
TATCTTTCTT ATCTTTAGAT GAATCATTTG TTTTTTTATC TTGTTGTTCA GTTTTCGCTT 
TATCATCTTT TTCTTTATTA CCGTCTTTTT GTTGGTCACT ATCTTGACCA CATGCAGCTA 
AAAATAATGA TAATGCTAGT AACCCTGTAA CTAATCTTTT CATACATATC TCCTCCTATA 
ATTCGATATT CATTGAATAA TCTTGAAATA CATATCTACC ATGTGTATCT TTTCATGGCT 
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3660 
3720 
3780 
3840 
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1020 
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TAAGGTTfTTT 


1A1AL 


CCTAAi'ri'lT 


GTTCATTATT 


ATTTAATTTT 


TGTGAATTTT 


1140 




ATGtTTkCTA 


TAAATTTA it 
x nnn XXX i\n 1 


TATTTTACTT 


TAACAATTCA 


TTACGCATTT 


AGCATTTCAA 


1200 


5 


GGTATACACA 


ATA TTT 2i TT a 


CTATGATTTC 


ATTTTATCTG 


CTGCAAAAAC 


AATCATTATA 


1260 




ACTCTTTTTC 


CATAATTMia 


*rv *'p^m»a T*/** r^s^ 
1 v- 1G1ATCCG 


^^^^ % n ^n^^ m ^^^^ 

TTACATCACC 


TGTTTGAAAA 


TGATGTTCAC 


1320 


10 


r&Arr&rTTT 




PPM"!"!'?! T*K A A 


ATGCTTGAGC 


ACGAGGATTA 


TGCTCCCAAA 


1380 


CTCCTAGCCA 


A ATTT^rRTPT 
nt\X X X inlul 


TTATTATGTT 


CTTGAGCAAT 


TTTTTCGGCC 


AATTCTATCA 


1440 




ATTG TGAACP 


X V_ X X CCLtLCA 


PrTTP AAA /"•'■n 

LUTTGAAAGT 


CTTTCaAAAA 


ATATATGCGC 


TGCACTTCTA 


1500 


15 


AATAGGTCTC 




1 CAtj X I lXiAG 


CACTATTAAT 


ATTCATCTTT 


ATATAACCAA 


1560 




CATTCGCACC 




rp A A A A A rn« m m 

1 AAAAATAAT 


GAAATGAATC 


TACATGGTTA 


ATCTCTTGTG 


1620 




xrvu-ix X x \_ X \— 




TTGTCTTTAA 


AAAATTGATC 


AAAATCTTTG 


TCATCATAGT 


1680 


20 




I L» X L-A 1 AA 


aatgttctag 


TTGCTAATTC 


AACTAATTCA 


CT AG CATTTT 


1740 




GTTCTGAAAT 


TTrTTTPRTT 


A I CCCAGCCA 


TATAAATCCT 


CCAATAAACA 


GTG AT CGAAT 


1800 




V^r\AU"WX /-v X X r\ 


L. I lAiOl I A 1 


TTTTCAGCCA 


AAACTATTTA 


AAAATACATT 


AACACAAATC 


1860 


25 




1 G 1 ATTGATT 


GTGTGTAACA 


TCAATAAATG 


ATACATTTAT 


TCCAGTAAAA 


1920 




x vjvjUwVj X AX X 


x iCAAAAGAG 


AAAAAGAGAG 


GATGTATCGT 


TGTGATAGAA 


ACATTTAAAG 


1980 




U«i X XljXAAX 


I GATAAAGAT 


GAGAGTGGTA 


AAGTGACACC 


AACTTTCAAA 


CAATTAT CGC 


2040 


30 




AC C 1 AAAGG A 


GATGTGCTGA 


TTAAAGTACA 


TTACTCTGGT 


ATAAATTATA 


2100 




fxrYvxn X 13^, XXX 


AVjL-VjAC l LAA 


GATCATAATG 


CAGTCGTAAA 


ATCGTATCCT 


ATGATTCCAG 


2160 


35 


GAATAGATTT 


AIjV- X\jtxAAV—A 


All GTTG AAT 


CCGAAGCACC 


AGGCTTTGAa 


AAAGGAGAAC 


2220 


AAGTAATTfiT 


AnUjAu X 1 A X 


G A C CTAGGTG 


TCAGCCATTA 


TGGCGGTTTT 


AGTGAATATG 


2280 




CGGGHYST A A A 


A X CAv?AA 1\jG 


ATTATCAAG C 


TTCCTGATAC 


TTTAACATTA 


GAAGAATCAA 


2340 


40 


TGATATATWS 


CACAGC 1\jG 1 


TATACTG CCG 


GTTTAG CAAT 


TGAAAGACTT 


GAAAAAGTTG 


2400 




G AATG A AT A T 


X vjAAGA IVjOT 


CCTGTACTCG 


TTCGCGGTGC 


TTCAGGTGGT 


GTCGGTACTT 


2460 




tag rjvffr a rr 


V^AXvjC X XAA1 


GAACTTGGTT 


ATAAAGTTAT 


CGCAAGTACA 


GGTAAACAAG 


2520 


45 




X LAA X X AL X 1 


GAACTTGGTG 


CCAAAGAAGT 


TATCGATCGA 


CTTCCTGTTG 


2580 




AAGATGATCA 


TAAAAAGCCA 


CTCGCATCAT 


CAACTTGGCA 


AGCTTGTGTA 


GACCCTGTTG 


2640 




GTGGCGAAGG 


TATTAATTAT 


GTTACAAAGC 


GTTTAAATCA 


TAGTGGGTCA 


ATTACAGTTA 


2700 


SO 


TTGGTATGAC 


TGCCGGTAAT 


ACTTATACTA 


ATTCTGTATT 


CCCTCACATT 


TTAAGAGGTG 


2760 




TAAACATTTT 


AGGAATTGAC 


TCGGTATTTA 


CTGCTATGAA 


ATTAAGACAG 


CGCGTTTGGC 


2820 
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TTGATGAACT TCCAGAACAA CTTAACAAAG TAATTAAACA TGAAAATAAA GGGCGCATTG 294 0 

TTATCGATTT CGGTGTAGAT AAATAGTATT CATGAAAAAG ACATCCCGTT ATGCGAGATG 3 000 

TCTTTTTTAA TTTAGTATTT GATATACATA CCGCCTGAAT CTGGTTCGGT AGGTATAAAT 3 060 

CCAAATTTTG TATATAATTT ATCCGCTGGG TAGTCTGCAA TCAGAcTAAC GTATGTACTC 3120 

TCAACAGCCA CACCTTTAAT ATATTGCATA ATATGCTCCA TAATTAGACT GCCGTAACCT 3180 

TGACCTTGGT AACTTTTCAA AACTGCAATA TCAACAATTT GAAAAACAGT TCCGCCATCG 3 240 

CCAATCACTC TACCCATACC AATTAACCGA TCTTTATCAT ACAAGGTTAC TGTAAATAAG 3 3 00 

GCATTAGGTA ATCCTTTTTC aGCTGTTCGC GCGTCTTTGG ACTCATACCT GCGTTAATCC 3 360 

TTAATGCGCA ATAATCCTCG CAAGTCGGAA TATCATATGT CACTTTAACC ATTATTTACC 3420 

CCACTTTTCA TCACACAATA TATCAACCTA GTATAAATGT TTATTTACAA TAGTCTTATT 34 80 

CGCTTCTTTA AACACTTCAT GATGACTTGA AACATAACCC TCTGCATTCG CATCTGGTTG 354 0 

GATATATGTT TTAGCAAGGT TCGCTGCATT TGCACCATCA CTAAATGCAC TTGCAATTAG 36 00 

ATGTGATTTT GCATCATGAT AAACAATATC TCCACACGCA TAGATACCAG GTATACTAGT 3 6 60 

TGTCGTATTA CCAAATCCTT TAACACGACA ATCATCATGC ATATCTAGCT TTGAAGATGT 3720 

TtCACTCAAT AATGTATTAC AACGATCAAA CCCATGACTA ATAATGACAT CGTCAAATTT 3780 

AACTGTATGC CTATCGCCAC TTTCAACATG TTCCAAAACA ACTTCACTTA TATGCGTTTC 384 0 

30 ATCATCATTG CCGACCAAGT ATTTAATACG TGTTTTTGGG CATAGTTTCA CATTTAAATC 3 900 

TGTCACCAAC GTTTT CATCG CTTCATGACC ACTTACATCT TCTTTTCGAT AAACAACTGT 3 96 0 

CACGCTTTTA GCAATCTTGG CAATATCATG CGCCCAATCT AATGCTGTAT TTCCTCCACC 402 0 

35 TGATATTAAT ACATCTTTAT CTTTGAAACG TCTGTAACTT TGTACAACAT AATGTAAATT 4080 

AGTT5ATTGA -TATCTCTCTA- CACCTTTAAC- ATCTAATTGT TTTGGATTAA TAATACCCGC 4140 

ACCAATTGCA ATGATAACTG CTTTCGATGT ATATATTTCT CCCGCTTCTG TTTCAACTTC 4200 
GAAATGACGT TCTGCCTTTT T CCT AATATC TACCACACGT TCATTCAAAT GAACTTCCGG 4260 
TTTAAAATAT AATCCTTGCT TAATTGTATC TTTTAAAATT TCATGACAAG GTTTTGGCGC 4320 
AATGCCGCCA ATATCCCAAA TAATTTTTTC AGGGTAAATT CTCATCTTAC CCCCTAATTC 4380 
AGATTGAACA TCTATCAATC TTACAGACAT ATCTCGCAAT CCAGCATAAA AGCTTGCATA 444 0 

CAAACCAGAC GGACCGCCAC CAATGATTGT AACATCTTTC ATTATGTGCC TCCTATGACT 4500 
CTCTATATTC ATTTCTTTCA TTAACGTGCT CAAATTGATA ATTATTATCA TTTAAAGCCA 4560 
TTATACTATT AATATTTATA TTGTTAAAAT AAATCGCATA GTTAG CCATG AATTATCAAT 4620 



25 



40 



45 
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GAAAGATGTG 


TATATTTTTT 


AGTTCTAGTT 


ATATTATTTT 


TTAAAAGACT 


CATCACGTGG 


4740 




TTCTTTAAGA 


ATTGCTTGTC 


TTAAAAGGAA 


AAATAGCAAC 


AATAAACCTG 


CAAGCATACC 


4800 


5 


TGTGTGCCCA 
TTGGAAGAAT 


ATACCTGCAA 
CCTTTTGTC 


AGCCTGCnAA 


TGCTTCTGGA 


GAGTATGATT 


TACCAGTGAC 


4860 
4879 


10 
15 


(2) INFORMATION FOR SEQ ID NO: 90: 








(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 






20 


ATAATGTCTT 


AGaTTGATTG 


111111 


TAAlTrnTl 


GAAATTAAAT 


TAATCTGTAS 


60 




yTAATAAAAA ATTTGAATAA 






TAG CT Ay AT A 


CTTTGTGAAT 


120 




innl 1 V — HA — Hi. 1 




/""TV* 7\ R f+ TV T* A TV 

(j 1\jAA(jA 1 AA 


CjAGTATTATA 


AATnATCTTT 


AAATAAATAT 


180 


25 


ATGTGAAGTA 


AAAATTACAC 


Vj I 1 AOvJA 1A1 


CGATTATGgT 


CATTTCkTTT 


AACATATTAA 


240 




CTgGGGaACG 


TTAAAAGTTA 


TV /^/^/*^1^<T*/^ |\H"»7V 


T Cy AAC t AAA 


AACAAGGTCA 


CAGTAGTATG 


300 




TTTTAATCTG 


GCGTCTATTA 


CAAATAAAAA 


TTACATCTAT 


AATTATTCGT 


TTTCTTTTTT 


360 


30 


GAAAGTAATA 


GCCAATTAAT 


ATCATACATA 


CTGGAGTGAC 


TATAAGGAGG 


ACATTATTAT 


420 




GAGAG CAGCA 


GTTGTAACGA 


AAGATCACAA 


AGTAAGTATT 


GAGGACAAAA 


AGTTAAGAGC 


480 


35 


TTTAAAACCT 


GGTGAAGCGT 


TGGTACAAAC 


GGAATATTGT 


GGCGTTTGTC 


ATACCGATTT 


540 


ACATGTTAAG 


AATGCTGATT 


TTGGTGATGT 


TACAGGCGTT 


ACTTTAGGTC 


ATGAAGGTAT 


600 




TGGTAAAGTC 


ATCGAAGTTG 


CGGAAGATGT 


AGAATCATTA 


AAAATTGGAG 


ACCGTGTGTC 


660 


40 


TATCGCTTGG 


ATGTTCGAAA 


GCTGTGGAAG 


ATGTGAATAT 


TGTACAACAG 


GTCGTGAAAC 


720 




ACTTTGCCGT 


AGTGTGAAAA 


ATGCTGGTTA 


TACAGTAGAT 


GGTGCAATGG 


CTGAACAAGT 


780 




TATTGTTACT 


GCAGACTATG 


CTGTGAAAGT 


ACCTGAAAAA 


TTAGATCCAG 


CAGCAGCGTC 


840 


45 


TTCTATTACA 


TGCGCAGGTG 


TGACAACTTA 


TAAAGCTGTA 


AAAGTAAGTA 


ATGTAAAACC 


900 




TGGACAATGG 


TTAGGTGTTT 


TTGGTATAGG 


TGGTTTAGGT 


AACCTAGCTT 


TACAATATGC 


960 




TAAAAACGTT 


ATGGGGGCTA 


AAATTGTTGC 


CTTCGACATC 


AATGATGATA 


AATTAGCATT 


1020 


50 


CGCGAAAGAA 


TTAGGTGCTG 


ATGCTATTAT 


TAATTCTAAA 


GATGTTGATC 


CAGTTGCAGA 


1080 




AGTTATGAAA 


TTAACTGATA 


ACAAAGGATT 


AGATGCAACA 


GTGGTAACTT 


CAGTTGCTAA 


1140 



55 



662 
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30 



35 



TTTACCTGTT GATAAAATGA ACTTAGATAT C CCAAGAT t A GTGCTTGATG GTATTGAAGT 1260 

AGTAGGTTCA CTTGTTGGTA CAAGACAAGA CTTACGTGAA GCGTTTGAAT TTGCTGCTGA 1320 

AAATAAAGTA ACACCTAAAG TTCAATTAAG AAAATTAGAA GAAATCAATG ATATTTTTGA 1380 

AGAAATGGAA AATGGTACTA TAACTGGTAG AATGGTTATT AAATTTTAAA AATATCAACT 144 0 

GACTATATAG ATAAAGAAGG TAGTGCTCTG AACACTATCA TTATTAATCA AACCCCGAGG 150 0 

TTTTCCTGAA AAGATAGTGG nAAATCCCCG TGTTTTTTGG GTTTGAGGnG GTTGTnTGTA 156 0 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
GTCCTGTnGC TGCAATGAAT ACGCCTAAAA ATCCAGGGAT GTAATGGATA CTTTGTGGTA 6 0 

25 GTACTAATGA TAGAAATGAT AAAAATGAAA TCACAAAGGC TACGCTCGCA AAAGCTTGAC 120 

ATGTACGCTT ATCGCCATAA TCTAACCCTG TACGTATATG TAATAAATAC TGTAAT CCGA 18 0 

TACTTAAATA CATAATTGCC ACGCATAAGA AGAATGGGAA GAATGTCTTT TCAAAGTCCG 24 0 

G AT AT AGG CT GTTAGATAGG AAGACCATGA TGAACATATT AAACATCATA AACGAGACGT 300 

CTTTGAATGT AACTTGACCA AATCGATTTG TAAAAAATGT TTGATGAGAC CACATTAACC 3 60 

ATAAGAACAA ACTCATGACG ATGTATTTGA AAAATAAATC AGCTGAAATG GAACCGTTTT 420 

GTGTTGTTAA AATCACATGT GCAATTTTTT GAATGGCATA GACGAAAATT AAATCAAAGA 480 

ACW^T»fG~GAATCCTG'<A~CG " " 54 °" 

TAAAATTTTA ACTCCTTTAA GATGTGTAAT TAATTTACTA AGTATACTAT TTATTTTTTC 600 

40 

TAGTGAAT AG GGGCAGATTT GGCGATGAAG TGGAAGGAGA GGTGACTGCA AGGTAATTGC 660 
GGAATTAACA ATCATCAGCG ATTTAATATT TGACTGGAGA CGTCATGGTA ATAAAAAATT 720 

45 GATGAGAAAT TGATGGTGAA ACCAGCTGTG AATAs CGaTG cAATGATrsA TAGaATTTAA 780 
TTAGAGTCAT TACGCGaAAT GATTAATGAT AATTTGTGGT AAATCAAAGC aTAATTTTGT 840 
ACTATAGATG AGGATGATAG AGCATATTTA AGAGGGTGAA ATGTTAAAGT GAAACCGTTT 900 

SO ACGTTTCCGA TTGCCCAAAC AAATTACATC ATTGTATAAT ATGATTTGTT AAATGCATAA 960 

CAAGAATGAA AATGTAACAT ACGTAGCAAT TGGTTTCATA AATTGGATGT TAGTGGCGTA 1020 



55 



563 



10 



15 
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TGACGAGAGT CGTATTAGCA GCAGCATACA GGACACCTAT TGGCGTTTTT GGAGGTGCGT 1140 

TTAAAGACGT GCCAGCCTAT GATTTAGGTG CGACTTTAAT AGAACATATT ATTAAAGAGA 1200 

CGGGTTTGAA TCCAAGTGAG ATTGATGAAG TTATCATCGG TAACGTACTA CAAGCAGGAC 1260 

AAGGACAAAA TCCAGCACGA ATTGCTGCTA TGAAAGGTGG CTTGCCAGAm ACAGTACCTG 1320 

CATTTACGGT GaATAAAGTA TGTGGTTCTG GGTTAAAGTC GATTCAATTA GCATATCAAT 13 80 

CTATTGTGAC TGGTGAAAAT GACATCGTGC TAGCTGGCGG TATGGAGAAT ATGTCTCAAT 1440 

CACCAATGCT TGTCAACAAC AGTCGCTTTG GTTTTAAAAT GGGACATCAA TCAATGGTTG 1500 

AT AG CATGGT ATATGATGGT TTAACAGATG TATTTAATCA ATATCATATG GGTATTACTG 1560 

CTGAAAATTT AGTAGAGCAA TATGGTATTT CAAGAGAAGA ACAAGATACA TTTGCTGTAA 162 0 

ACTCACAACA AAAAGCAGTA CGTGCACAGC AAAATGGTGA ATTTGATAGT GAAATAGTTC 1680 

20 CAGTATCGAT TCCTCAACGT AAAGGTGAAC CAATCGTAGT CACTAAGGAT GAAGGTGTAC 174 0 

GTGAAAATGT ATCAGTCGAA AAATTAAGTC GATTAAGACC AGCTTTCAAA AAAGACGGTA 1800 

CAGTTACAGC AGGTAATGCA TCAGGAATCA ATGATGGTGC TGCGATGATG TTAGTCATGT 1860 

25 CAGAAGACAA AGCTAAAGAA TTAAATATCG AACCATTGGC AGTGCTTGAT GGCTTTGGAA 192 0 

GTCATGGTGT AGATCCTTCT ATTATGGGTA TTGCACCAGT TGGCGCTGTA GAAAAGGCTT 1980 

TGAAACGTAG TAAAAAAGAA TTAAGCGATA TTGATGTATT TGAATTAAAT GAAGCATTTG 204 0 

CAGCACAATC ATTAGCTGTT GATCgTGAAT TAAAATTACC TCCTGAAAAG GTGAATGTTA 2100 

AAGGTGGCGC TATTGCATTA GGACATCCTA TTGGTGCATC TGGTGCTAGA GTATTAGTGA 2160 

CATTATTGCA TCAACTGAAT GATGAAGTTG AAACTGGTTT AACATCATTG TGTATTGGTG 2220 

GCGGTCnAAC TATCGCTGCA GTTGTATCAA AGTATAAATA ATAAGAAAAC AGGTTATCAC 2280 

AACA$TATTA ATtACATGTT GGCATAACCT GTTTTTATTT GTTTATGGAT TTATTGGGTA 234 0 

ATATTAGTCA TTTGATGGTT TAATTGCAAA TGCTCTAACA GGGAACCCAG GTGCATCTTT 24 00 

TGGTTTAGGG CTGATAGCGT AAATGATGGC GCCACGAGTT GGTAATTGAT CTAAATTAGT 2460 

TAATAACTCG ACTTGGTATT TATCCTGACC AAGAATATAA CGTTCGCCAA CTAAATCACC 2520 

45 ATTTTTTACA ACGTCCACAG ATGCATCGGT ATCGAATGTT TCATGACCAA CAGCTTCAAC 2580 

ACGACGTTCT TCAATTAAGT ACTTCAAAGC ATCTAATCCC CAACCCGGTG CATGTTGTTG 2640 

TCCGTTCGCA TCTTTGTTTT CAAACTTTTC AATATTAGGC CAACGTTTTG ACCAATCGGT 2700 

50 ACGAAGTGCA ACAAAAGTGC CAGGTTCAAT AGTACCATGC TCTTTTTCCC ATGCTTCTAT 2760 

ATGCGCACGT GTTACGATGA AATCATTGTT GTTCGCTACT TCTGTTGAAA AGTCTAATAC 2820 
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AAAGTGAATT GGTGCATCAA TGTGAGTACC ATATTGCGTT ACAATATTCC AACGTTGCAC 
ATAGAAACCA TGATCTTTAA CCGTGAATAA AGTTGAAACT TCGCCTTTTT CAAACTCACT 
AAAACGTGGT ATTTCCGGAT CAAATGTATG CGTTAAATCA ACCCAAGTTG CTTGTTTTAA 
AGTATTTAAT TGTTGCCATA AAGGATATTG TGTCATAAAA TCACCCGTTT TTAGTTTATT 
ATATGATAAA TGCTGCGATT ATTCTTGGCG TTTAGCTTTA ACAGCATTCA CAAGCACAGT 
CAATGCATCT TTAACTTCTT CTTCTTTTCG CGTTTTTAAA CCACAGTCAG GGTTTACCCA 
GAATAATGAG CGGTCGATTT GTTGTAGTGA ACGATTGATT GCTGTAGTAA TTTCTTCTTT 
TGTTGGAATA CGTGGACTAT GAAT AT CAT A TACACCTAGA CCAATACCTA AATCATAATT 
AATATCTTCA AAGTCTTTAA TTAAATCACC ATGGCTACGA GATGTTTCAA TTGAAATAAC 
ATCAGCATCT AAGTCATGAA TAGCATGAAT GATTTGACCG AATTGAGAAT AACACATATG 
TGTATGGATT TGAGTTTCAT CACGAACTGA AGACGTTGCA AGTTTAAATG ATAAAACAGC 
ATCTTTAAGA TATTGTTCGT GAT ATT CAGA GCGTAATGGT AAGCCTTCAC GTAATGCAGG 
TTCGTCAACT TGGATAACTT TGATTC CTGC AGCTTCAAGT GCTAATACTT CTTCGTTGAT 
TGCTAAAGCA ATTTGATCTT GAACGACTTT ACGTGGTAAA TCAACACGTT CAAATGACCA 
GTTTAGAATT GTTACAGGTC CAGTTAACAT ACCTTTAACT GGTTTATCTG TTAAGCTTTG 
TGCATAAACT GTTTCATCAA CAGTTAAAGG CGCTGTCCAT TTTACATCAC CATAAATGAT 
TGGTGGTTTT ACGGCACGTG AAC CAT ATG A TTGCACCCAA CCGAATTTAG TTACTAAGAA 
ACCTTGTAAT TTTTCTCCGA AGAATTCAAC CATGTCATTA CGTTCAAATT CACCGTGAAC 
TAATACATCT AAGCCAATGT CTTCTTGAAT TTTAATCCAT CGAGCAATTT CATTTTTTAA 
GAATGTTTCA TATGCTTCGT CTGTAATGCG TTTGTTCTTC CAATCTGCAC GGTATTTTCG 
^CTTCTCGG CTtTCTGGGA ATGATCCAAT AGTTGTTGTT "GGTAAATCCG^GTAAGTTCAA ~ 
ACGTTTTTGT TGTTGTTCAA TACGTTGCGC GAATGGTGAT TGTCTTGAAG TACGCACGCT 
TTCGAAATCA TAATCTAAGT TTTTGAATGA TTGATTTTGG AAACGCTCAT AACGTGCTTT 
TAATTTATCA TATTTAACAC TATCGTTTTG ATTAAATAGG CGACGCAATG CATCTAATTC 
GTCTAATTTT TCAGTTGCAA AGCTTAAGCC TTCGCCAACA CTTGTATCTA ATGTTTCATC 
AT CT AAAG AT ACTGGAACAT GTAATAATGA AGATGATGGT TGAATGACAA GTTCATTAGT 
GTGTGCTAAC AATTTATCGA TTAAGACTTT TTTAGCTTCA ATGTCACTTG CCCATACATT 
ACGACCATCA ATAATTCCAG CGTATAATGT TTTTGATTTA TCAAAATCTC CAGCTTCAAT 
TTGTTTAAGG TTATAGCCAT TATCATGGAC AAAGTCTAAA CCTATACCAC CAACAGGTAA 
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AACACCAGCT 


TTTTCGAAAT 


AGTCATAAGC 


TTCACGTGTA 


AT ATTTT CAT 


AG CTTTCG CT 


4740 




GTCGTCTGTA ACTAAGATTG 


GCTCATCAAC 


TTGAATGTAC 


TCAGCACCTG 


CATCAATTAA 


4800 


5 


TGATTCAAAC 


ACTTCTTTAT 


AAAGTGGTAA 


TAACGTTTTA 


ACTTTTTCTT 


CAAAAGTTTG 


4860 




GTGACCGCCT 


TTTGATAATT 


TAACAAAAGT 


AATCGGACCA 


ACAATGACAG 


GGTGAGCGTT 


4920 


10 


AACGTTTAAA 


G ATTGGG CAT 


ATTTAAAGCG 


ATCTAATAAT 


ACATTGCGAC 


TCACTTTAGG 


4980 




CTCAACATTG 


TCCCATTCAG 


GTACGATGTA 


ATGATAGTTA 


GTGTTAAACC 


ATTTTATAAG 


5040 




TGCACTTGCA 


ACATGGTCTT 


TATTACCGCG 


AG CAATATCA 


AATAATAAAT 


CATCATCAAT 


5100 


IS 


AGTTCTTCCT 


TGGAAACGTT 


CAGGGATGAT 


GTTGAATAAT 


AATGACGTAT 


CTAATATATG 


5160 




GTCATATAAA 


GAGAAATCAC 


CAACTGGGAT 


GCTATCTAAG 


TGATAGTACT 


TTTGtAATAA 


5220 




TAAATTTyCT TTATGTAGAT 


CAG TTAATGT 


TTGATCTAAT 


TCTTCTTTAG 


AAATCTTCTT 


5280 


20 


IbLLLAA 1 AA 


CTTT CG ATGG 


CTTTTTTCCA 


TTCTCTTTTT 


CTAC CTAATC 


TTGGGAATCC 


5340 




J-AAtjl 1 IIjAI 


GTTTTAATTG 


TTG TCATAAT 


ATTG CCTCCT 


TGTGAGCAGT 


AATAGATTTT 


5400 






C-AAG 1 1 CT AA 


TGAATCTTCG 


ACATTTTGAA 


ACGGTGTGAT 


AATGTATAAA 


5460 


25 




ATTCATGAAC 


AGTATCGATT 


AAATCCTTTG 


AAAGCTTAAG 


ACTTAGTTCT 


5520 




CGTGTTTTGG 


CTTTATCATC 


TTTAACTGCT 


TCAAATTGTT 


GTAAAATTTC 


ATCTGACATC 


5580 




TTGATTCCTG 


GCACTTCATT 


ATGCAAAAAG 


AGTG CGTTTT 


TGTAACTTGC 


GATAGGCATA 


5640 


30 


ATGCCTATGA 


AAAATGGTTT 


GTTCAAGTGC 


TTAGTGGCAT 


GGTAAATTTC 


AATGATTTTC 


5700 




TCTTTGCTGT 


ACACGGGTTG 


TGTTATAAAA 


TAAGACATTC 


CGCTTTCTAT 


CTTTTTCTCT 


5760 


35 


AATCTTTTGA 


CGGCACCATA 


TAATTTACGA 


ACATTAGGGT 


TAAAGGCGCC 


AgcGATGTTG 


5820 


AAGTGTGTAC 


GTTTCTTCAG 


CGCATCACCG 


TCAGTGTTAA 


TACCTTGATT 


AAATCTTAGA 


5880 




GCGSGTTCAG 


TTAATCCTTT 


AGAATTAACA 


TCATAGACAT 


TGGTTGCACC 


TGGTAAGTGA 


5940 


40 


CCAACTtTTG 


AAGGATCACC 


AGTTATGGCT 


AATATTTCGT 


TAACGCCAAT 


GAGCGATAAT 


6000 




CCAAGTAAAT 


GGGACTGCAA 


GCCGATTAAG 


TTTCGGTCTC 


GACATGTAAT 


ATGTACGAGT 


6060 




GGTTCAATAT 


TGTAATATTG 


CTTAATTAAG 


CT AG CAG CAG 


CAATATTGCT 


AATTCTGACA 


6120 


45 


GTTGCCAATG 


AATTATCTGC 


GAGTGTTACC 


GCATCTACAT 


TAGCTTTATC 


AAGTTTAGCG 


6180 




ATATTTTCAA 


AAAATCTATC 


CGTGTCTAAA 


TGTTTCGGTG 


TATCCAATTC 


GATAATAACG 


6240 




GTTGGACGTT 


CTTGAACCTT 


AGATGTTAAT 


GATTGTCTAA 


CTTTATTTTG 


AGATGGATTG 


6300 


SO 


AAAAGTGCTT 


T CGTTGGT AT 


CGG AATCACT 


TTTTTGTCAT 


TAACAGGTTT 


AAGTGTCTGA 


6360 




ATAGATTCTT 


TAATAAATTT 


GATGTGCTCT 


GGCGTTGTAC 


CACAGCAACC 


ACCAATTAAA 


6420 
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TACTTAAATT CACTATTTTC AATATCTAAT AAGCTGGCAT TTGGATAACA AGATAAGAAT 654 0 

GCGTGCTCTG GTAATTCAAT ATGTGTGAAA GACTCTTGCA TATGGTGCGG GCCATGATGA 6600 

CAATTGAGTC CCACGATGTT TGCACCACAT TGAACGAGTT GTTTTAATCC TTCATTGATT 66 60 

GCCTGACCAT TAACTAAGTA ATTTGTGTTT GAAGCGGTTA ATTGAGCAAT GATTGGAATG 6720 

TCGTATTTCT TTCTCGTTCG TGAAATGACA TTTGTTAACT CTTCTAGGTC GTAATACGTT 67 8 0 

TCGAAAAGTA GCGCGTCAAC GCCTTCTTCA ATTAAGGTGT CTATTTGAAT TTCAGTATGA 6 84 0 

TAAAGAATAG TTTGTAAGCT GATATCCTCT TGTTTGATAC CTCTAAACCC ACCAACTGTG 6 900 

C CTAAT AT AT ACGTATCTTT ATTTGCTGCT TTTTTTGCGA TGCGAACGGC GGCTTGATGT 6960 

ATTGCTTTAA CTTTATCTTC AAGACCGAAT CGTTTTAACT TTTCAAAATT TGCACCATAA 702 0 

GTATTGGTTT GAATGACATC AGCACCGGCT TCAATATATG AACGATGGAT GCGTTCAACT 70 80 

TTATCTGGAT GGCTAAGATT ATATGCTTCT GGACAGGTGT CTAATCCTTC AGAGTATAAA 714 0 

ATGGTTCCTA TAG CGC CATC AG CT ACT AAA ACATTATCTT TCAATTGTGT GAGGAATTGA 7200 

CTCATTGAAT GCCTCCTTTA ATGCGTATTT GATGTCTGCA ATGAGTTCAT CAGGATCTTC 7260 

25 GAGACCAACA CTTAATCGGA ATAGACCGAA AGTGAT AC CA CGTTCTTGTC TCACTTCTTC 7320 

AGGTAGTGCA GCGTGAGACA TTGTTGCTGG ATGTGAAAGG ATCGTTTCAA CACCGCCCAG 73 80 

ACTCACTGAA ACGAGTGGTA ATGTCAGTGC ATCGACAAAT TGTTGTGCTT TAGACTCATC 744 0 

30 AGCTAAACGA AAGCCAATAA CGGCACCGCC ATTTTTAGCT TGTT CTAAAT GAGCAGTAGT 7 500 

GAGTCCCGGA TAATAAACTT CTGAAATTTC ATCTTGCTTT ATTAAAAATG ACACGATTTT 7560 

TTGAGCGTTT TCGACAGATT GTTTAAATCT GATTGGAAAA GTTTTTAAAT GTTTAGCAAG 7620 

35 TGTCCAGCTA TCCTGAGCAG ATAACATATT GCCTGTACCA TTTTGTATTA AATAAAGAGC 7680 

GtC^TAATT GCCTCATTAT TAGTTATGAC AGCACCAGCA ATTAAATCGC TATGTCCACT 7740 

TAAAAATTTT GTAGCACTAT GAATGACAAT ATCAGCGCCA AGTAATAAAG GTGATTGACc 7800 

TAACGGTGTC ATAAATGTAT TGTCCACAGC TACCAGTAGT TCATGCTTTT CGGCTATTTT 7860 

AGAAACAGCT TTGATATCAG TAATTTTAAA ACAGGGATTC GATGGTGTTT CGATATAAAT 7920 

TAATTTTGTG TTTGATTGAA TGGCACCCTC GATTTGTTCG AG CTTTGT AG TATCTACGGT 7980 

TGTAAATTCA ATATTAAATC GATTCAAAAT TTGCTCAGTG AGGCGAAAAG TACCGCCATA 804 0 

TACATCATCG GGTAAGATGA CATGATCACC AGATTTGAAA GTCAAAAGTA CTGCTGAAAT 8100 

AGCAGCAATA CCTGATGCAA AAGCAAAAGC GAATTTTCCC TGTTCTAATC GTGCTAACTT 8160 

CTCTTCTAAA AGTTCACGGT TAGGGTTGCC CTTCGTGCAT AATCATATTT AACATCGCCA 8220 
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TCCACACCTC 


' TACGPPAATP 


UAA x ATCALT 


TCTGTCTCTT 


TTGAAAGTGT 


CAT A CAATCT 


8340 


5 


CTCCAATCTG 


AGCTTTATPT 


& a T/ r "' PTTr" ^ TV 


TGATATCGCG 


TTCGATGTCT 


TCATAATTTT 


8400 




CAACACCTAG 


TGAT AAG CGG 


x\X inAnlAti 


CATCAATGC C 


ACGTTTATCT 


TTTTCAGCAT 


8460 




CTGOCATATC 


AACATGTGTT 


TPTSPTY^TR 

X LKju XTj 1 AAaj 


GGAAGGTCAC 


TAATGTTTCA 


GTACCTCCTA 


8520 


10 


AACTTTCTGC 


AAAAATGPAA 


ATY^TPTS R RT 


TTTCTAATAA 


TTTAGCGACG 


CTATAGGCCT 


8580 




TGTTAAGTCT 


TAAACTAAG P 


« 1 V-7V_V — X X 1 


GCCCGCTATA 


TAGTACTTCG 


TCAATTGCTT 


8640 




GAAGTGACTG 


APATTTM"l M rZi 


<j(_AALjX 1 1 TC 


TAGCGTTTGA 


TTGCGCACGC 


TCAATGCGTA 


8700 


15 


AATGCAAAGT 


TTTAAGTPPA 


wv» XAALAACA 


AATAACTATC 


TATTGGTGAA 


AGTGTTGCGC 


8760 




CAGTCATGTT 


GTGAAAATPA 


AALAALTG XT 


GCGCGAGTGA 


TTCATCTTTG 


ACGGTTACGA 


8820 




CACCTGCTAG 


TAPATrHTT a 


*TV"*T , ^* , /*V"' tv iv 

I Vj I v_CvjCCAA 


TATATTTCGT 


GGCTGAATGT 


AAGACTATAT 


8880 


20 


CAGCACCTTC 


r mc~vh.nTf*T'"T' 


GTTGAAAGAT 


AAGGTGTTAA 


AAAAGTATTG 


TCGATAATTG 


8940 




ACAATAAG CC 


X * r\\j\^ X 1 XM 


LAAAG1TGAT 


AGTATGGCTT 


TACATCAATA 


GCAATCATTT 


9000 


25 


GTGGGTTAGA 


T A TTPPTT/"* R. 


A IvjAATAATG 


CAACTGTTTT 


ATCAGTGATT 


TCTTTTTCAA 


9060 


CTTGTTPATA 


nlv- luinAAA 


TCAACGTACT 


TAAATTTGAT 


ATCGTATTGT 


TGCTCGTAAA 


9120 




ATTCAAATAA 


1 v_ l nnn 1 J. Vlj 


t- C-AL CAT ATA 


AATCGAATGA 


AACTAAAATT 


TCATCATGAG 


9180 


30 


GTTTAAATAG 


ATTAPHTfiTT 

A J. AUA Irtl 1 


AA 1 I Cj AATGG 


CTG ACATTC C 


ACTTGATGTA 


GCGAATGATG 


9240 


CAATACCATG 


V— A V—rVrvvT 1 X i. \J 


oLAAAACAGG 


TTTCAAATGT 


TGAGCGTGTA 


GGATTTTTAG 


9300 




TACGTGTATA 


ATP A AAA PPT 


/"""TV*^ TV • 1 1* 1 1 i " I » 1 1 /"i 

1 C(jA\L™GTC 


CTAGTTTTGG 


ATGCTTGTAG 


GCAGTAGATA 


9360 


35 


AATGGATTGG 


ATTCG PTATA 


vjLrtLLiibl 1 (j 


AATCATCGGT 


TAATGTGATT 


TGGGCTAACT 


9420 




GTGTATCCTT 


PAT A TTA APA 


rTTT/T'TTi'PTi 

<-L-U I LCTATA 


AGAJU^aAATA 


AAAAAAGCTT 


CCGTCCTTCG 


9480 




TACCCGAATG 


AATTRTJATA A 


& R R /~» TV TV TV 


AGCTTATGTT 


TCGCGGTACC 


ACCTTTATTT 


9540 


40 


GTTATTC CAT 


CG PTPAA ATA 


TV. ^ ^^^^^^ 

r\l-V_ 1 1A1 1 t_A 


GTACGCATTA 


AAAGTAAATA 


TGCTTACTGA 


9600 




ACAATTATCA 


CAATTAAAP T 


^7A/^TR R^"T*R TV 


PGATATAGTA 


ATGTGCTATC 


CCATACTTAT 


9660 




TAACAAAAAA 


TCGTG CGTAA 


AGAATCCAGT 


ACGC CATTTA 


ACAT CAATGT 


TAATACTGTA 


9720 


45 


TCGCTATAAC 


GGGCGAACCC 


GTAGACACCT 


CATATTGGCA 


TCAACACTCC 


AAGGCCATTT 


9780 




TCAAACACGC 


TTTCAAAATC 


TTCTCTCAGC 


TACTAAAGAC 


TCTCTGTATA 


AGCAGGGTGT 


9840 


SO 


GTTTTACTTy CCTCTTTATT GTGTTTACGT 


TTCATTAAAC 


TGTTATAAGA 


TATTAATTAG 


9900 


CTTACAGAGT 


AAAAAAAGAT 


TTGTCAACAA 


TTATTCAGAA 


AATTTTGATT 


TAAAAGTTAA 


9960 




TTTGTTTGTG 


AAATTGTAAT 


TGGTATCTTG 


AAGTTGAAAA 


ATGAATTATT 


TTTTAAATAA 


10020 
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TCAAATAAAA AGTGATGTGA GTGAATTGTC AAAAAGTGAA GATCAACGTA TTACTAAAAC 1014 0 

AAAAGATGAA CAAATTAAGC AAATAGATAT ATCGGATATC AAACCGAATC CGTATCAGCC 10200 

CCGAAAAACT TTCGATGAAA ATCATTTAAA TGATTTGGCA GATTCAATTA AGCAATATGG 10260 

AATTTTGCAA CCAATTGTGC TTAGAAAAAC AGTTCAAGGT TATTACATTG TAGTTGGTGA 10320 

AAGAAGGTTT AGAGCTTCGA AAATTGCTGG TCTAAAATAC GTATCAGCGA TTATCAAAGA 10380 

TTTAACAGAT GAAGATATGA TGGAACTGGC GGTCATCGAA AATTTACAAC GAGAAGACTT 10440 

AAATGCGATT GAAGAAGCTG AAAGTT AT CA ACGTTTGATG ACAGATTTGA AAATTACACA 10500 

ACAAGAAGTA GCGAAACGAT TGAGTAAGTC GCGCCCGTAT ATAGCGAATA TGTTGAGGTT 10560 

ATTACATTTG CCGAAAAAGA TTGCTGACAT GGTAAAAGAT GGG CGACTGA CAAGTG CACA 10620 

TGGACGAACG TTATTGGCAA TTAAAGATGA ACAACAAATG CTTAGGTTAG CGAAACGGGT 10680 

TGTTAAAGAA AAGTGGAGTG TCAGATATTT AGAAAACCAT GTTAATGAAT TAAAAAATGT 10740 

TTCGTCAAAG TCGGAAACAG ACAAAGTAGA TATAACTAAG CCTAAATTTA TAAAGCAGCA 10800 

AGAACGACAG TTGCGAGAAC AGTATGGTAC CAAAGTAGAT ATATCAATAA AAAAATCGGT 10860 

2s TGGTAAAATC TCATTTGAGT TTGATTCACA AGAAGATTTT GTGAGAATAA TTGAACAATT 10 920 

AAATCGTAGG TATGGTAAAT AGTTACACAA TTTTATATAA TAACTCTTTG TGCAAGTGTA 10980 

AATAAATTGT AATCAGTGAC ATTTGATTCT AG AT 11014 
30 (2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 (D) TOPOLOGY: linear 



20 



40 



45 



SO 



' <xi> SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

TCCCCTTATG GAATTTCACA TTCTAGTTTA CATAATATAT ATTATAGGAA GTTATATGTG 60 

TGTAACGCAA AAgGTACCCT ACATCATAAT CATTATCTAA TATCGTCACA TAACTTACTT 120 

ATGCTATAAT CATGGTATTA TATTGTTTGG AGTGATTTGA TGAGATTTGT CTTTGATATT 180 

GATGGTACGC TTTGTTTTGA CGGCCGATTA ATTGACCAGA CTATTATTGA TACATTGTTA 24 0 

CAATTACAAC ATGATGGTCA TGAACTTATA TTTGCATCAG CACGTCCGAT TCGTGATTTG 300 

TTGCCAGTTT T AC CATCAGT ATTTCATCAG CACACATTAA TTGGCGCAAA TGGTGCTATG 360 

ATTTCACAGC AATCAAAGAT TTCTGTTATC AAACCAATTC ATACTGATAC ATATCATCAT 420 



SS 
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GCTGCACAAP 


' 1 "IV" ft. fY* PTV« 

i. x v^i/\v-Ajl_ x\3n 


AGAACGCGAT 


TTTTGAGCGT 


TTAGATCCAC 


ATAAGCTGGC 


540 




wt\J 1 iVJi n X X 


CI B. r VC % *T m T s f^' f~* A a 

vjA 1 Vj 1 i bCAA 


ATATCGACAC 


GCCAATCAAG 


AkTATTTTAT 


TAAATATAGA 


600 


5 


p rrr^j p a p a a 




TATTAGACGA 


G CT AGAT AAA 


TACCATCAAG 


AATTGGAAAT 


660 






1 LAAA 1 viAGT 


ATAACATTGA 


TATAACAGCG 


CAAAATATTA 


ACAAATATAC 


720 


70 


X U\^\ X X /i\«nA 




ATG CAGATGT 


TAAATATATA 


GCATTTGGTA 


ATGAC CACAA 


780 


TfJATATTnTP 




ATG CT AGT AG 


TGGCTATATT 


ATAGGACCAT 


CAGAAGCATA 


840 








TTGATAAAAT 


CAAACACATC 


AATAATAATG 


CACAAGCTAT 


900 


75 


1 X OV_>l/\i-lVj 1 V- 


TTR A A A A T» 

I 1AAAATCAT 


ATAAATAAAA 


ACACCCCTAT 


CAAATGATAA 


TCATTATCAA 


960 




CTATTTTAAT 


AAAATTCGTC 


CTCGAACATT 


TCTTCCTCTT 


CATCTAATCC 


1020 






GCCATTTCTC 


CATGTTCAAT 


TAACATGTTT 


AAAT ATG CAT 


CGCGGAGTTC 


1080 


20 




A I ATCATTAA 


TCATTTCTTT 


AAGACTATCA 


ATCCACATAT 


TTCTGCGTAA 


1140 






n^/"^*l u I*/** A A / it i it i * 

i LTTCAAL X L 


CGTTTAACAT 


CATTATATGT 


TTATTTGCTG 


CTTCTGGACT 


1200 




AGCTGTAAAG 


AGTAATGCAA 


TCATATGTTT 


ACATATCACT 


CGTCTTCCAT 


CAGCATGAGG 


1260 


25 


ALAA1 TACAT 


ATGGATTTTC 


TAGGATGTTC 


CATATCAATA 


TAACAACGAT 


ATACTTTGTT 


1320 




L»t-L-At_ 1\jH.L 


TTTACTTCAG 


CCTCATGCTG 


CGTTTCTGAA 


AATGATTTTA 


AGTTAATGAC 


1380 


30 


GCATTCACTT 

*Pfi R 'T*/™ , f , /" , ft. TT 1 

CL-tJAx\I 


TGATAATAAT 
AATG a TACT C 


TAAAGCCTCT 


TTCTATAGAA 


CGAATACTTG 


CAATATCAAG 


1440 


C ±~I~i~i~x ATTA 


TTATTTTTAA 


ATAAAGAaAA 


TAAAATAGAT 


1500 




AAGTGTCTAG 


ATTAAAATAC 


TTGATTTATC 


TATATTTTAT 


AACAAGTCTA 


GAATTATCGC 


1560 




ATTCTTAAAT 


AACTAATATG 


AAAATGcTTG 


CACTAATTCt 


TTTGTATAAG 


GGTGTCTATC 


1620 


35 


AACATTAAAT 


AATTCCtCTA 


TTGCAAAATC 


ATCGACTATC 


ATGCCATCCT 


TAAGAACGAT 


1680 




aattCtatta actaagcgtt 


GTAACACGGA 


TAAATCATGA 


GAAATAACGA 


TAAAATGATT 


1740 




TAAGTTCGTA 


ATCGTTTGCG 


CI 1 1 1AATAT 


ATTGATTACA 


TTTTGTTCAG 


CTATAACATC 


1800 


40 


TAAATTTGAA 


GTTATCTCAT 


CACATATTAA 


AACGCGAGGC 


TGTGCTAATA 


ACGAACGCAT 


1860 




GACATTAAAT 


CTTTGTAATT 


GTCCGCCACT 


CACTTCGCTT 


GGTAATTTAG 


TCAATAATTG 


1920 


45 


CGCGTTTAAC 


TCAAAAGTAG 


ATAAATGTTG 


TAATAATAAT 


TGATCCTGAG 


CAGTATTATC 


1980 


AGTTAGACCT 


CTGTAATAAT 


ATAACGCTTC 


TTTTAATGAG 


GTCTCAATCG 


TCCAAT CAGG 


2040 




GTTAAAGCTA 


GTTAAAGGGT 


GTTGGAAAAT 


CGGTAACACA 


GCATTGTCAC 


TTAAGTAAAT 


2100 


SO 


CTCTCCTTTA 


ACAGGTTTAA 


ACAAGCCAAG 


AACCAATGAA 


GCGAGCGTAC 


TTTTACCACA 


2160 




GC CACTTTCG 


C CT AAAAT AC 


CAACATTTTC 


TCCATCAGGT 


ATAGTAATAT 


TGATATCTTG 


2220 



JNSDOCID; <EP_0786519A2_L> 



570 



20 



EP0 786 519 A2 

CCCTCTTTAA TTGTGTTCTA TATTTAATTA GACGTTCAGT ATACGGATGC AAATGCTCAT 2 34 0 

ACTTGAAATG ATTAATATTA CCTCGTTCAA TGATTTGACC TTCTTTTAAA ACATAAATGT 24 00 

5 ACTGACAATA TTTCAATACA TGACTTAAGT TATGTGTGAT AATAAATAAT GTTTG AC CAT 24 60 

GTTCTAATAC AATATGCTGT AATAAATCCA TCACTTGATT ACCGTTCAAA GCATCCAATG 2520 

ATGCAACTGG TTCGTCTGCA ATGATTAATT TAGGCTCCAA CATGAGAACG CTTGCTATGT 25 BO 

10 ATACG CGTTC AAGTTGGCCC CCAGAAAGTT GGAAACTATA TTTATTTAAT ATATCTTTGC 2 64 0 

TTTGTAAATT AACCCACGAC AAAGCCTTAT CAACTTTGGA CAAAGCCTCT TCTTTACTAC 2700 

CTTTATAATG CTTACGATAA ATCGCAGTTA ACTGTTTACC TAATTTAGTA TGGTCGTTAA 276 0 

75 AA CTT TCTGC ATAATTTTGA GAAATATAGC CAATTGTATG ACCATAATAT TGACTCAATC 282 0 

TACTAACATT TTCCCCATCA AATTGGTACG AATCATACGT GCAGCTTAAA TCAAATGGTA 26 8 0 

AATATTCAAG TAAAGCTTTA GCAATCAAAC TTTTTCCAGC GCCGCTCTCT CCAATCAAGG 2 94 0 

CATTAATCTG TTGACTAAAA ATTTTCAAAT CAATCCCTTT AATAAGAGAT TTCTCACTAG 3 000 

TATTCTTTAT TGTTAAATTT TGTATATCAA TGAGACTCAT CATATTCACC CCGTTGTTTC 3 060 

AGCAATCTAT CTCTTAGTGC ATCACCGGTT AAATTAAAAA TTAAAATAGT TATAGCAATG 312 0 

ACTGAAGCAG GTGCAATCAA CATAATTGGA TGAGACGAAA TAAAATCACG ACCTTGTTGC 3180 

AACATAGCGC CCCaCTCTGG TGTTGGCGGT TGTGCACCTA ACCCAATAAA TGATAGTGAA 324 0 

CTTATATATA GAATGATTTT ACCGAAATCA ACGACCATCA AAACGATAAT AG CCGGTATA 3 3 00 

ATTTTAGGTG TTAAATGACG TATTAATATT GTTCTTGTTG GTACATGAAA TAATTGTGCC 3 36 0 

ATTTTTATAT AAGGCTTATT CATTTCGCTA TTAACTATAC TTCTAGTCAA CCTTGTGTAA 342 0 

TTCATCCATT TTATTAATGT AATTGAGATA ACTAAATTCC ATAAAGATGG TTGAAAAAAA 34 80 

CTTGcTAAAG CAATCATGAT GATAAATTCT GGAATACTTA GACCAACATC AATAAACCTT 3 54 0 
AACACTAATC GTTCAATCCA CCCTTTTTTG TATCCGGCAA ATAGACCTAG TGTAACACCT 3 6 00 

40 ATGACAACGA TAG CT ATT AA TGTTAAAACA GTAACAAACA ATGTTGAACG TG CAC CG ATA 3660 

ATAATTCGGG TAAATAAATC TCTCCCATAA TCATCAGTTC CTAATAAATG CAACCAACTA 3720 
ATAGGTTCAA AAGTTTGTGA TAAATTGACT TTGGTTGCAT TTTCACTACT GACAAAGAAT 3780 
45 TCCAGTACAA TTACCACAAA AATAAATGCA ACGAATACAA AAAATATCAG GTTATTCTTT 3840 

GAAAATATTT TATGCATGAC GGTCACTACT TTCTGATATC AATGGTGTAT TGGTTTTGAT 3900 
TTTTGGATTT CCTAATTGTA AACGCTGCTT CGGATCAAGT AATAACGTTA ATAAATCAGC 3960 
AATCGTATTG ATAATAACAA CGAAGAAGCC AATAAATAAC ACGCATCCTT GAATAACAGG 4020 
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ATTTTCAATC ACTACAGTAC CACCTATTAG ACTGCCAAGT GAAATCCCTA GTAATGGGAT 4140 

AATCGGCAAA ATTGTTGGTT TTAGTAAATC ATGAATTAAA ATATAACGTT CATTCATACC 4 2 00 

GCGTAATCTT GATGCTTGTA CGATATTACT TTGCAATAAC AT CAAT AAAT TAGAACGCAC 42 €0 

TAAACGAATG ATGTATGCAC ACATACCTAA AGATAGCGTG ATTACAGGTA ATATAAACTG 43 20 

ACTTAGTATA ACGCTATCTA TATTCATTAA ATTTGTGACA ATAAATAATA AAATAATACC 4 3 80 

GATAAAGAAC GCTGGTAAAC TAATCGATAG TGTTGAGATC ACTCTAATCA CTTTATCCGT 4440 

CCACTTATGA AATCGTTTGG CTG CTATAAT GCCGAGCGGT ATAGATATGC ATAACGACAC 4 500 

TACTAATGTT GAAAATGATA TGAGTAATGT TATGGGTGCA TAGTTGAATA ATATCTGTGT 4 560 

TACCGGTTCT TTTGATTCAA AACTTTTTCC TAAATTAAAA TGTAATAAAT GATTCATCCA 4 6 20 

ATGCCACCAC TGTACCAATA AAGAATCATT TAATCCCAAT TTATCTTTGG TTGCATTTAT 4680 

TTGTTCCGTC GACACTTGTG CTACATCAAG ATGTAATATT TTATCAACAG GATTGCCTGG 4 740 

TGATAATTTC ATTAAAATGA ATGTAAG TGT AGAAATAACA AATAAAACAA CTATCATTTG 4 8 00 

CATCAGTCTA TACAACATAG ACTTTATTAT GAACATAATA GTCCCCCTCC TTGTGTAAGT 4 860 

2s TACTAACACT TTCTTTTTAC ATGAGAATGG CGCATGTATA TGCAACTTAC ATATTAAGAA 4 920 

CTAACGTTCA TTATAGTATT ATC CATAAAG AAATTGAAGT ATATTTAATT TTTTAACAAA 4 980 

ATCATTATAA AATATAATAT TTTGAATCAA GTCAACCATG TAAAATATAA AAAAGTCAAA 504 0 

30 ACAAAAACAA CT AT AG C ACT GT ATTC CATC TCTTTCGAAA TAATTGTTAC TGCAGTGTAA 5100 

CTTAAAAGTC GATGATTTTG TG CAT ATAGT TGTCGAATAT TATTTTTTAT CTTTACGGCG 5160 

AAGTTCAGCG CCCTCATAGC CGTATTTTTC AATTTGCTTT TCTAATTTAC GCGCTTTTCT 5220 

TTCTTTACGC CAATTTCTAG TAAAATACCA TAATAGAAAA CTAATTAATA AACTCATAAT 5280 

CGCTAAAAAT GCAGCGTATC CTAATAATGG TTGATATTTT ATATCTTGAA AATTTGGAAT 534 0 

AAAAAATGCA AGCACACCTA ATATAACAAA TGTAATTACT GCAGATACAA ACCATTTATT 5400 

TAAAACTAAG CAACAGAATA TTGTTAATAA AATCATTATT AATGTTGTGA TCCATAAATA 5460 

ATT AGG CAT A TCGAATAATG TCAT ATT CAT TCTCCTTTTA TTTCATTACT TTCCTTGTAT 5520 

ACATTTTATT ATAAATTTTT AAAAACTTAA ACAATAGCAG TCAGTTTCAA G CAAT ATT CT 5580 

ATCTACTAAT AGAAAAATCA TTGTTCCTTG CGACATGGAA ATCGTAACAT TATCGTTTAG 5640 

GAGACAAAAT TATGTATAAT GAATGTATTA TACCAAAGGA GTGATTATAT GTCTCAAGGT 5700 

TTACCTTTAA GAGAAGATGT TCCTGTTTCA GAAACATGGG ATTTAGTAGA CTTATTTAAA 5760 

GATGATCAAC AATATTATGA AAGTATTGAC GCTCTAGTAC AnCAAGCAAA TCAATTTCAT 5820 
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GAAAATATTT TAATTGCCTT AGATCGCTTA AGTAATTATG CAGAACTACG TTTAAGTGTA 5940 

GATACTAGTA ATATCGAGGC ACAAGTATTG AGCGCTAAAT TATCTACTAC ATACGGTAAA 6000 

ATTGTTAAGC CAATTATCCT TT 6022 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3633 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
_~ (D) TOPOLOGY : linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
CCATCAATAA TGTATACATG ATTGG CAT CA TATTCCCCTT TAATTAGAGA GCTACGTACA 
GTTTGTyTTA TTAAAGTAGA ACTAATAAAT AACCATCTCT TATGTGCACA AACACTTCCC 12 0 

GCAACAATTG ATTCAGTTTT ACCAACCCGT GGCATACCTC TAATGCCAAT CAACTTATGA 180 
CCTTCTTCTT TGAACAATTC AGCTAAAAAG TCTACTAACA AGCCTAAATC TTCACGCTCA 24 0 

AATCGAAAGG TTTTCTTATC TTTTGCATCT TGCT CAATAT ATCTTCCATG TCTTACTGCA 300 
AGACGGTCTC TTAATTCTGG TTTTTTAAGC TTTGTTATTT CAATTTCATT TATACCACGA 3 60 

GCTATTTGCT CAAAACGTTC AACTTTTTCA AGATTGTCTG TTTTAATTAA AAGGCCTCGT 
TTACCTTGAT CAACACCATT AATTGTAACA ATACTTATAC CTAACATACC TAATAA 
(2) INFORMATION FOR SEQ ID NO: 94: 



420 
476 



60 
120 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

AGAAATACAA CGAAGCATAT AAATATAACC GATCTTTTTT CTAATTGAAT ATTAAGTAAG 
TGTATGTACT TTCTGGAAGT AGCACCTAGT rGGATTGTCC CTCCTACAAC AGGCCAAAAA 

45 TTTTTATTTT TAACTGGCTT AACAGTGTTC AGTTTTTCAT ACTCTTCTCT ACTAATTTTG 180 

GCGCACCTTT TTGGAATGAA CCAATTAATA AATGGAAAAA AGTATACAAG CCAAGTTCTT 240 

ATTACATCGA CCATTAAATA CTCATCATCA TACTTAATAA CTCTGTATTT CGGATTTTTA 3 00 

TTGATAATTT CGGTTTCACA AAGCAATAAT TATCACTTCC TATTAATAAC AAATTCACAC 360 
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TTATATGACC TTAAATATAT AACATGAATC TTTTTGTCTA TTATTGAAGA CATATTTATA 4 80 

AAGAAAAATA GCATTGTCAT AATAACCCAA GCAATAAATA CTATAATATT TTGGATAGAT 540 

AAACTAATCA rrACATCTAA GAACATGATT gATAATCCAC CACAGAAAAA ATAAGAAAAT 600 

AGTACAAAGC AAAGATTCTT GAATGATGGA AAAATCATAA TTTTTCCATT GCTACTCCGA 660 

TCATTATAGA TAGATAACTT TACTTT CTG A TTTAAATATA TATAAAACAC TAGAATACTT 720 

AATAATAAAA CCGAACAAAT GATAATAACG CAATTTTTTT CTAAATGAGA ATCAGG TATA 780 

TATATTTTAT CTCTAAACAT AGTGCCAAAT AAAAGTATGC TACCTATAGC TGGCCATAAA 84 0 

15 GCTTTaTTTT TAACTGGTTT GACAATATTT AAATTATCAA AATCTTCTCT GCTGATTTGG 90 0 

ACATATTTTT TTGGTATTAA CCAATTAATA AACGGAAAGA ACAAAACTAA CCAGGTGCTT 96 0 

ACTAAATCAA TCATCAGATA GTCGTTTTTA TATTTAATAA TTCTATATCT GGGATTTTTG 102 0 

TTTACAACTC TAACCTCGCA AAGCAATATC TCCACTTCCG TCTCGTTGGT. TTTATATCTA 1080 

AT ACACTTT C AGATACTTTA TAAGTGTTTT GTATTTTAGT AACATACTAT TTTCCTGTTT 114 0 

ATTACTTAAC TTACGAACTA CAATCTAAGT TTAGTAATTT CTATTGCTTT TTAAGTTTGG 12 00 

CATAAAC CTT TTTATTACTA ATTGAG CCCA TGCTTATTAG AAAGAAAAAA ATTGTAATAA 1260 

TAAT CCACAT AATAAATACC AGTAGATTTT GAGGTTTTAT AGTCATTAGC CATATTAAAA 1320 

30 ATAATATAGA ACAACCTCCT AATAATAGAT ATGTGAAAAC TATAAAACTT CCATCTTTAA 13 80 

AAGTAGGCAC TAATATAACC CT ATTTT CAT TATCTAGATT ATCATCATAT ATCTTTAGTT 14 4 0 

TAAGCTTTTT ATTTAAGTAA ATGTAAAATG CTGCAATACC TATAAATCCT ATAAAACATA 1500 

AAGATATTAA AATCTTATTA TCTAATTGAA CTTCAAACGT ATGTACATAT TTCCGTAAAA 1560 

TAACTACAAA TAAAAACGAA CTACCAGTAA CTGGCCAGAA AATATTATTT TTATTTTGTT 1620 

TATCAACATT TAAATTTTCA AGTTCCTTCT CACTAAGTTT TGCATACCTT TTGGGAATGA 1680 

ACCAATTAAT AAAAGGAAAA AAG T AT ACAA GCCAAGTGCT TACTAAATCA ATTAACAAAT 174 0 

ACTCATCATT ATATTGAACG ACTTTATATC TCGGATTTTT ATTAATAACC TTAATATTAA 1800 

AAAGCAAAAC TCACCACGCC CATTTCATTG GATTTATATG ATTGCTAATA ATATTTTTAG 186 0 

CTTCACTAAC AGCATTCCCA ACACTATCCA TGGATTTTTC TGTAGTTTTT TTAACAACAT 192 0 

CTATACTATT AT CG ATTTT A TGCCCTACCC AGTCTACTTT ATCTTTTAAT CCAAAAATAT 1980 

SO TATTTTGATA AATTAAATCT GTTCCTAATG CAAATACTGT ACTCATAGCC AAACCTGCTA 204 0 

AAATCACCCA TCCTACTGGA TTACTTCCTA AAACAAAAGT CGCTAATCCA GCTCCAACTG 2100 

CTGTCCCTGC AGATCCAGCT GCAAGCGTgC ATACCATTAT GCGACAACGC CTCTCCAAAT 2160 
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CCTTTACCTA 


GGTATTTTCC 


GCCTTTTGCA 


AATTTACTAC 


CATTTTCTAT 


AAACACATTA 


2280 




CCTGATGTAC 


GTTTGACTTC 


CACAAATGAA 


TTTGGACCTG 


CTGGGCCTTT 


CACTCCACCT 


2340 


5 


GCTGTATTGa 


TAAATACACC 


GAATTTACTT 


GcATTTATAC 


CGTCTTGCTC 


TAAAAGTGTT 


2400 




GACGTAATAT 


CTAATCCTAT 


ATCTCTTTTA 


ATACTGTCTT 


TATTGTCATT 


TATATATTTC 


2460 


10 


AATATACTTT 


TCGGGATATC 


GTCTTCTGGA 


TGTTCTTTGG 


CATATGCCTT 


TATAACAGCA 


2520 


AAGTCTGCTT 


TATTTAAAGT 


TTCTTTCTCT 


GCTTTATGTT 


CAATTTTCCC 


CATAGCAACT 


2580 




TTCAAATATT 


TTTCATGACT 


TGCTTTGGCC 


CAATCAAGTT 


CTTT AC CTG A 


AGGAATATTA 


2640 


15 


AATTGATTTG 


TTGAAAAGTT 


CCAAAAATTC 


TGCGCTTGGG 


TAAGTCCTTG 


TTGGACAATT 


2700 




TTTTGAAATT 


CTTCAACTTC 


TTTAAATATT 


TCTGGTGATT 


TTTGATTAAA 


CTCACGCAAT 


2760 




TTGCGTAGCT 


TCTCTTCTAA 


TTCATGTTTT 


TGTTGACCTA 


ATGTTCGTAT 


TATTTGTTGG 


2820 


20 


TTCGATGAAA 


TGGCTTGCTG 


ATTAT CGGAA 


GCATGCTTTT 


TCAAATTGTT 


ATTCAAATTT 


2880 




TCATATCGCG 


TAATTTGTTG 


ACTTAATGAT 


CTGATATCTT 


CTTCAAGCTC 


TGATTCTTTT 


2940 




AAAGATATGC 


TATCAACCTC 


ACTCGTATAA 


CGTGACACAA 


AATTaTCGCA 


AGCTTGCTTC 


3000 


25 


GTTAAATCAC 


TCAATGTTTT 


CATACTTGTT 


GATAATGGAA 


TTAACAC CGT 


ACTAAAAAAT 


3060 




TGCTTAGCTG 


ACGTATACGC 


TTTCCCTTTA 


AGCGCATCAT 


CATTAATAAA 


TTGAGTAATT 


3120 




GCTTTTTCCA 


ACG CATCATA 


ATTTGAATTC 


ATTGTTTGAC 


TCAAATTCCC 


CACACTTGAA 


3180 


30 


GCTTGGTTTC 


GAGATCTGTC 


TAAATACATG 


TCAATACTCA 


TCGGCATGCT 


ccrrri'TCAA 


324 0 




AAATATATGA 


TTTTCAAACT 


ATTTAAAATC 


AAATGCTTTT 


TACATCTACA 


AAGTTGTAAA 


3300 


35 


ATTTTAAAAC 


TCGGCGATGA 


TTATTTCTTA 


TGTAAAGGAG 


TCTAGATGCA 


GGTAAATTGA 


3360 




GATAACATGT 


CGCCTTTTTT 


CTTATTTTAG 


CATATGGATA 


TAATGGTGTC 


i AT ATT 






CGCAATTAAT 


CAATAAAAAT 


TATCTTTCAA 


TATTTTAATT 


TTATTGCGAC 


AACATCCTTA 


3480 


40 


ACATTAAATA 


TATTAATATC 


TCAAAATATA 


TTCACTATTA 


AAATATGTCA 


TCAGTTGTTA 


3540 




AAAGTATTTC 


CTCATCATGC 


GAAATATCAA 


AACGTATCTA 


t AAATACGAAT 


AAGTTTATAC 


3600 




AATCACACAA 


CATCATCATT 


CAAAATTTTA 


TTG 






3633 



45 

(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TGATACGAAt GCATTACAAT TCATATGCAA CAT ACAATT C CTTCTACAGC AAATGAAGTG 60 

AAACAAATAG TTGATGTGAC ATCTGTAGCA GAAAATGATA CGCATTAGTC ATAAAATTAA 12 o 

ATGGAAATGT CGATGAAGTG TATCAGCAAT TACAGCGATT AATTAAGAAT GCTAATGTCG 180 

AAGAGAGTGA GAATACTGAC AATATTAATA GTCAAGATAC AAGTTATACA CCTCAAGTAA 24 0 

AAGTAACAAC ACCAATTTTA GTGAAAGCAC CAATCGCTGG TCGTCGTATT TTACTTAAAG 30 0 

AAGTAAGAGA TTCAATTTTT AGAGAGAAAA TGGTAGGTGA AGGCTTAGCA AT CAAAG CTC 360 

ATGAAGAATC CAAAGTAATC GCACCGTTCA ATGGTTTAAT ATCTATGATT GTACCAACTA 4 20 

AGCATGCAGT TGGTATTCAA TCAGAAGACG GTGTGGACAT AGTCATTCAT ATTGGCGTGA 480 

ATACAG TTG A CTTGGAAGGT AAAGGGTTCA AGTGCTTTGT AAAGCAAAAT GATCATGTTG 54 0 

AAGCAGGGCA AACGTTGTTG CAATTCGACC AGCAATATAT ACAACAACAA GGCTACAATG 600 

20 CTGACGTTAT TGTCGTTATT AGCAACTCTG CCGATTTAGG AAAAGTAGAA CTGACAATGA 66 0 

ATGAAATCAT TACGACTGAA GATGTTATTT TTAAAATATT TAAAAACTAG GAGTGTGTTG 720 

TAATAATGAC AAAATTAC CG CAAAATTTCA TGTGGGGTGG CGCTCTTGCC GCAAATCAAT 780 

TTGAAGGTGG ATATGATAAA GGTGGTAAAG GGTTAAGTGT AATTGATGTT ATGACGAGTG 84 0 

GTGCACATGG CAAAGCACGT CAGATTACAG AATCTATAGA TCCCAATCAC TATTATCCAA 900 

ATCATGAAGG TATTGATTTT TATCATCGTT ATAAGGAAGA TATTGCCTTG TTTAAAGAAA 960 

TGGGATTGAA ATGTTTACGT ACGTCGATTG CGTGGACACG TATCTTTCCG AATGGGGATG 1020 

AAGATGTGCC AAATGAAGAA GGACTCG CCT TTTATGATCG TATCTTTGAT GAATTAATTG 1080 

CACAAGGTAT TGAACCTGTT GTGACGTTAT CACATTTTGA GATGCCACTT CATTTAGCGA 114 0 

AACATTATGG TGGATTTAGA AATAGAGAAG TTGTCGATTA TTTTGTGCAT TTTGCGCGTG 1200 

TTGTATTTGA AAGATATAAA GATAAAGTTA CATATTGGAT GACGTTTAAT GAAATTAATA 1260 

40 ATCAGATGGA CACATCAAAT CCTATCTTTT TATGGACGAA TTCTGGGGTA GCATTGACAG 1320 

AAAATGATAA TCCTGAAGAA GTCyTGTATC AAGTAGCACA TCATGAACTT TTAGCCAGTG 1380 

CyTT AG CAGT TCGTCTTGGT AAAGaGATtA ATCCgAaGTT TAAGATTGGr ACmATGATTt 1440 

CAmaTGTACC CmTTTATCCa TAwTCGTGTC ATCCGAAAGA TATGATGGAA GCACAAATTG 1500 

CGAATCGCTT ACGTTTCTTT TTCCCGGATG TCCAAGTGAG AGGTTATTAT CCAAGCTATG 1560 

CTAAAAAAAT GTTGGCACGA AAAGGATATG ATGTTGGATG GCAAGAAGGG GACGACAGTA 1620 

TTTTACAGCA GGGCACGGTT GATTATATTG GCTTTAGTTA TTACATGTCT ACGGCTGTAA 16 80 

AACATGATGT TGATACTACA GTTGAAAACA ACATCGTCAA CGGTGGTTTG AATCATTCTG 1740 
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GATATACATT GAATGTGTTA TATGATCGTT ATCAGTTACC ACTTTTTATT GTGGAAAATG 1860 

GTTTTGGTGC AGTTGATGAA GTGGTAGATG GACATATTCa TGATGATTAT CGCATTGAAT 1920 

5 ATTTAAAAGC ACATATTACA GCAGCGATAG AAGCAGTTGA TCAAGATGGT GTAGATTTAA 1980 

TCGGTTATAC ACCGTGGGGA ATCATTGATA TTGTTTCATT TACAACCGGT GAAATGAAGA 204 0 

AACGCTATGG TTTAATATAT GTTGATCGAG ATAATGATGG TCATGGCACG ATGGAACGCT 2100 

10 

TGAAAAAAGA TTCGTTCTAT TGGTATCAAC AAGTGATAGC AT CAAATGGA GATAAATTAT 2160 

AAAGGTATAT TATAAGTATT TTAGGGTTAG AGCC CGAGAC ATAAATTAAT ATAGTAGGAC 2220 

15 CTACAGTGTT ATAATGGCGG gCCCCCAACA CAAAGAATTT CGAAAAGAAA TTCtAcAGGT 22 80 

aATGCaAGtT GGCGGGGcCC AACACAGAGA AATTCGAAAA GAAATTCTAc AGGTAATGCA 2 34 0 

AGTTGGGGAA GGACAGAAAT AAATT 2 3 65 

20 (2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11050 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CTGCGATACG 


ATTTGTTGAA 


AGTGGGGAAA 


ACAAAAAAGT 


TATCATTACC 


AATTTAGAGC 


60 


AGGCATACGA 


AGCTTTGATT 


GGTAATAAAG 


GTACACACAT 


TCACATGTAG 


CACTTT AT CA 


120 


CGCGACAAAA 


CATTAAATAT 


GTTTCTCCGT 


TGATTCAAAT 


GAAAAAGTTG 


TCTGCTGACA 


180 


CTTTGCAAGG 


TTTGAAGGAG 


TTTAACTTAT 


GACAGAAAAC 


TTTATTTTGG 


GTAGAAATAA 


240 


TAAATTAGAA 


CATGAACTAA 


AGGCATTAGC 


AGATTACATT 


AATATACCAT 


ATAGTATATT 


300 


ACAACCATAT 


CAAAGTGAAT 


GTTTTGTCAG 


ACATTATACG 


AAAGGCCAAG 


TTATTTATTT 


360 


TTCGCCACAA 


GAAAGT AG CA 


ATATTTACTT 


TTTAATTGAA 


GGTAACATTA 


TTAGAGAACA 


420 


TTACAATCAA 


AATGGAGATG 


TATATCGTTA 


TTTTAATAAA 


GAGCAAGTAT 


TATTTCCAAT 


480 


CAGTAACTTA 


TTTCATCCGA 


AAGAGGTTAA 


CGAATTGTGT 


ACAGCATTAA 


CCGATTGTAC 


540 


AGTTCTTGGA 


TTGCCTAGAG 


AATTGATGGC 


CTTTTTGTGC 


AAAGCTAATG 


ATGATATATT 


600 


TTTGACACTT 


TTTGCATTAA 


TAAATGATAA 


TGAGCAGCAA 


CACATGAACT 


ATAACATGGC 


660 


ATTAACAAGT 


AAATTTGCTA 


AAGATCGAAT 


TATCAAATTG 


ATATGCCATC 


TATGTCAGAC 


720 


AGTAGGATAC 


GATCAAGATG 


AATTTTATGA 


AATCAAACAG 


TTTTTAACTA 


TTCAACtCAT 


780 
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TGAAAAACTT GTTGTTAAAG 


ATCATAAAAA 


TTGGTTAGTA 


AGCAAACATT 


TATTCAATGA 


900 




TGTATGTGTT TAATATACAA 


TGTAAAATGA 


ATAAGTTGAA 


CATGAGGTCT 


AACGTACATT 


960 


5 


TATACGTTAG GCCTTTTTTG 


CTAGCATGAT 


GAATAATTTA 


AAATGTTAGT 


TAAATTTGAT 


1020 




TGTTGAAATT ACAGTAAAAT 


TTAAGGTGAT 


GAAAAATTTA 


GAACTTCTAA 


GTTTTTGAAA 


1080 


10 


AGTAAAAAAT TTGTAATAGT 


GTAAAAATAG 


TATATTGATT 


TTTGCTAGTT 


AACAGAaAAT 


1140 


TTTAAGTTAT AT AAAT AG G A 


AGAAAACAAA 


TTTTACGTAA 


TTTTTTTCGA 


AAAGCAATTG 


1200 




ATATAATTCT TATTTCATTA 


TACAATTTAG 


ACTAATCTAG 


AAATTGAAAT 


GGAGTAATAT 


1260 


15 


TTTTGAAAAA AAGAATTGAT 


TATTTGTCGA 


ATAAGCAGAA 


TAAGTATTCG 


ATTAGACGTT 


1320 




TTACAGTAGG TACCACATCA 


GTAATAGTAG 


GGGCAACTAT 


ACTATTTGGG 


ATAGGCAATC 


1380 




ATCAAGCACA AGCTTCAGAA 


CAATCGAACG 


ATACAACGCA 


ATCTTCGAAA 


AATAATG CAA 


1440 


20 


GTGCAGATTC CGAAAAAAAC 


AATATGATAG 


AAACACCTCA 


ATTAAATACA 


ACGGCTAATG 


1500 




ATACATCTGA TATTAGTGCA 


AACACAAACA 


GTGCGAATGT 


AGATAGCACA 


ACAAAACCAA 


1560 




TGTCTACACA AACGAGCAAT 


AC CACTACAA 


CAGAGCCAGC 


TTCAACAAAT 


GAAACACCTC 


1620 


25 


AACCGACGGC AATTAAAAAT 


CAAGCAACTG 


CTGCAAAAAT 


GCAAGATCAA 


ACTGTTCCTC 


1680 




AAGAAGCAAA TTCTCAAGTA 


GATAATAAAA 


CAACGAATGA 


TGCTAATAGC 


ATAGCAACAA 


1740 


30 


ACAGTGAGCT TAAAAATTCT 


CAAACATTAG 


ATTTACCACA 


ATCATCACCA 


CAAACGATTT 


1800 


CCAATGCGCA AGGAACTAGT 


AAACCAAGTG 


TTAGAACGAG 


AGCTGTACGT 


AGTTTAGCTG 


1860 




TTG CTGAACC GGTAGTAAAT 


GCTG CTGATG 


CTAAAGGTAC 


AAATGTAAAT 


GATAAAGTTA 


1920 


35 


CGG CAAGTAA TTTCAAGTTA 


GAAAAGACTA 


CATTTGACCC 


TAAT CAAAGT 


GGTAACACAT 


1980 




TTATGGCGGC AAATTTTACA 


GTGACAGATA 


AAGTGAAATC 


AGGGGATTAT 


TTTACAGCGA 


2040 




a GTTACCAG A TAGTTTAACT 


GGTAATGGAG 


ACGTGGATTA 


TTCTAATTCA 


AATAATACGA 


2100 


40 


TGCCAATTGC AGACATTAAA AGTACGAATG 


GCGATGTTGT 


AG CT AAAGCA 


ACATATGATA 


2160 




TCTTGACTAA GACGTATACA 


TTTGTCTTTA 


CAGATTATGT 


AAATAATAAA 


GAAAATATTA 


2220 




ACGGACAATT TTCATTACCT 


TTATTTACAG 


ACCGAGCAAA 


GGCACCTAAA 


TCAGGAACAT 


2260 


45 


ATGATGCGAA TATTAATATT 


GCGGATGAAA 


TGTTTAATAA 


TAAAATTACT 


TATAACTATA 


2340 




GTTCGCCAAT TGCAGGAATT 


GATAAACCAA 


ATGGCGCGAA 


CATTTCTTCT 


CAAATTATTG 




50 


GTGTAGATAC AGCTTCAGGT 


CAAAACACAT 


ACAAGCAAAC 


AGTATTTGTT 


AACCCTAAGC 


2460 


AACGAGTTTT AGGTAATACG 


TGGGTGTATA 


TTAAAGGCTA CCAAGATAAA ATCGAAGAAA 


2520 




GTAGCGGTAA AGTAAGTGCT 


ACAGATACAA AACTGAGAAT TTTTGAAGTG 


AATGATACAT 


2580 
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ACCAATTTAA AAATAGAATC TATTATGAGC ATCCAAATGT AGCTAGTATT AAATTTGGTG 2700 

ATATTACTAA AACATATGTA GTATTAGTAG AAGGGCATTA CGACAATACA GGTAAGAACT 2760 

TAAAAACTCA GGTTATTCAA GAAAATGTTG ATCCTGTAAC AAATAGAGAC TACAGTATTT 2820 

TCGGTTGGAA TAATGAGAAT GTTGTACGTT ATGGTGGTGG AAGTGCTGAT GGTGATTCAG 2880 

CAGTAAATCC GAAAGACCCA ACTCCAGGGC CGCCGGTTGA CCCAGAACCA AGTCCAGACC 2 940 

CAGAACCAGA ACCAACGCCA GATCCAGAAC CAAGTCCAGA CCCAGAACCG GAACCAAGCC 3 000 

CAGACCCGGA TCCGGATTCG GATTCAGACA GTGACTCAGG CTCAGACAGC GACTCAGGTT 3060 

CAGATAGCGA CTCAGAATCA GATAGCGATT CGGATTCAGA CAGTGATTCA GATTCAGACA 3120 

GCGACTCAGA ATCAGAT AG C GACTCAGAAT CAGATAGTGA GTCAGATTCA GACAGTGACT 3180 

CGGACTCAGA CAGTGATTCA GACTCAGATA GCGATTCAGA CTCAGATAGC G ATT CAG ACT 324 0 

CAGACAGCGA TTCAGATTCA GACAGCGACT CAGATTCAGA CAGCGACTCA GACTCAGATA 3 3 00 

GCGACTCAGA CTCAGACAGC G ACT CAGATT CAGATAGCGA TTCAGACTCA GACAGCGACT 3 3 60 

CAGACTCAGA CAGCGACTCA GACTCAGATA GCGACTCAGA TT CAG AT AG C GATTCAGACT 34 20 

25 CAGACAGCGA CT CAGATT CA GATAGCGATT CGGACTCAGA CAG CGATTCA GATTCAGACA 34 80 

GCGACTCAGA CTCGGATAGC GATT CAGATT CAGATAGCGA TTCGGATTCA GACAGTGATT 3 54 0 

CAGATTCAGA CAGCGACTCA GACTCGGATA GCGACTCAGA CTCAGACAGC GATTCAGACT 3 6 00 

CAGATAGCGA CTCAGACTCG GATAGCGACT CGGATTCAGA TAGCGACTCA GACTCAGATA 3 660 

GTGACTCCGA TTCAAGAGTT ACACCACCAA ATAATGAACA GAAAGCACCA TCAAATCCTA 3 720 

AAGGTGAAGT AAACCATTCT AATAAGGTAT CAAAACAACA CAAAACTGAT GCTTTACCAG 3 780 

AAACAGGAGA TAAGAGCGAA AACACAAATG CAACTTTATT TGGTGCAATG ATGGCATTAT 3 84 0 

TAGGATCATT ACTATTGTTT AGAAAACGCA AGCAAGATCA TAAAGAAAAA GCGTAAATAC 3 900 

TTTTTTAGGC CGAATACATT TGTATTCGGT TTTTTTGTTG AAAATGATTT TAAAGTGAAT 3 960 

TGATTAAG CG TAAAATGTTG ATAAAGTAGA ATTAGAAAGG GGTCATGACG T ATGG CTT AT 4020 

ATTTCATTAA ACTATCATTC ACCAACAATT GGTATGCATC AAAATTTGAC AGTCATTTTA 4 080 

45 CCGGAAGATC AAAGCTTCTT TAATAGCGAT ACAACTGTTA AACCATTAAA AACTTTAATG 414 0 

TTGTTACATG GATTATCAAG TGATGAAACG ACATATATGA GATATACAAG CATAGAAAGG 4200 

TATGCGAATG AACACAAATT AGCTGTGATT ATGCCCAATG TGG AT CATAG CGCATATGCT 4 260 

50 

AACATGG CAT ATGGT CATAG CTATTATGAT TATATTTTGG AAGTGTATGA TTATGTTCAT 4 320 

CAAATATTTC CACTTTCCAA AAAG CGTG AT GACAATTTTA TAGCAGGTCA CTCTATGGGA 4380 
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TTATCTGCTG 


1 TGTTTG AAG C 


' GCAAAATTTA 


ATGGATCTAG 


AGTGGAATGA 


TTTTTCAAAA 






GAGGCCATAA 


■ TTGGCAATCT 


TTCAAGTGTT 


AAAGGAACTG 


AACATGATCC 


GTATTACTTG 


■t DO u 


5 


CTAGACAAAG 


CTGTAG CTGA 


AGATAAACAA 


ATTCCAAAAT 


TGCTCATTAT 


GTGTGGTAAA 






CAAGACTTTT 


TATATCAAGA 


CAACTTAGAT 


TTTATCGATT 


ATTTATCACG 


CATAAATGTT 


46 8 0 


10 


CCTTATCAAT 


TTGAAGATGG 


ACCAGGAGAT 


CATGATTATG 


CATATTGGGA 


TCAAGCGATT 


4740 


AAG CGTGCTA 


TAACATGGAT 


GGTGAATGAT 


TAATTATTTC 


TTGGAAAATA 


TGTGGCTGCA 


4 BOO 




TTAAATACAC 


AGAGTGAGAG 


ATACAAACTA 


TTTACGCACG 


ACTAACATTT 


CTAAGTGTTT 


4860 


1 c 


AAATTATTTT 


TGTATTAATA 


TGATTGG CGC 


AATTTGCTGA 


TACACAAAAA 


TGTTTCTCGT 


4 920 




G AAA CTT AG A 


TTTAGCTTAT 


AGTTTTATCA 


T CATTTGT AT 


GACTTACATT 


ATAAATTTTA 


4980 




TTATAATGAG 


GTTAACGCTT 


TGAAAGGAGT 


CATCATCATG 


TCGACCAATA 


AAAACGATTA 


5040 


20 


TGAGCATATG 


TTGTTTTATT 


TTGCATATAA 


AACCTTTATT 


ACTACCGCTG 


ATGAAATTAT 


5100 




AGAGAAGTAT 


GGTATGAGTC 


GTCAGCATCA 


TCGTTTTTTG 


mrraTCA 


ATAAATTACC 


5160 




TGGTATTACT 


ATTAAATCAT 


TACTAGAAAT 


ATTAGAAATT 


TCTAAmCAAG 


GATCACATGC 


5220 


25 


AACACTTCAA 


AAATTAAAAG 


AGCAAGGTCT 


CATTATTGAA 


AAAGTTTTAG 


AGACTGATCG 


52 80 




ACGTGTCAAA 


AAA TT AT ATT 


CGACGGATAA 


AGGCGATCAA 


CTCATTG CTG 


AATTGAACAA 


5340 


30 


GGCGCAAGAT 


GAATTATTGC 


AAAATATATA 


TCAACAAGTC 


GGTTCGGATT 


GGTATGATGT 


5400 


GATGGAAGCA 


TTGGCTAAAG 


GgCGACCTGG 


CTTTGATTTT 


ATT AAG CATT 


TGAAAGATGA 


5460 




AAAAGAAAGC 


TAGCATCAGA 


AATGTTAAAA 


ATCTTCGCAT 


TCTTAAATTT 


AAAAAATATG 


5520 


or 

oo 


TCAAAAAGTG 


TATAATAAAA 


ACATATAATT 


TAATTGAACT 


CAGTTTCAAC 


ACATCTTAGA 


5580 




AAGGAGTTTG 


AATGATGAAA 


AAATTAGCAG 


TTATTTTAAC 


ATTAGTTGGC 


GGTTTATACT 


5640 




TCGCATTTAA 


AAAATACCAA 


GAACGTGTTA 


ACCAAGCACC 


TAACATTGAG 


TACTAAATTA 


5700 


40 


AACCATAAAA 


AATTCCCGAA 


CACCTTGTTA 


TAGTGCTCGG 


GAATTTTTTT 


ATGCTTTACT 


5760 




TGAATATATC 


AAATATTATT 


TTTGCGCTTT - 


CTGTATTTTC 


GATATTACCA 


CTAAATGATT 


5820 




CTGATCTAGG 


TCCGTAAGCG 


TAgGTATTAA 


CATCCTCGCC 


TGTATGTCCA 


TCGGAAGTCC 


5880 


45 


ACCCTGTATA 


AGATTTATCA 


TTTACTGGCT 


TCTGAATAGC 


GTGTTGTAGG 


GCTTTTGTTT 


5940 




GCGTTTCTAC 


TTCTGCGGAT 


TTTTCGTCTT 


TTTCTTTTTT 




111 /\VjV_ 1111 


6 000 


SO 


TATTCTCTTT 


TTTAACCTTT 


TTCATATCAT 


CTTGTGAAAA 


TTCAAATCCA 


TAACCTTCAT 


6060 


TAATAACTTT 


TTCAGGGTCT 


TCACCTTTAG 


CCATTTTTTC 


TGTCATATAT 


GATCCAGAGT 


6120 




GTTTCATAGA 


TTTAATCGGT 


TGAGGATTCC 


ATTCGTATCC 


TTTATCTTTA 


CCAATTGTTA 


6180 
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ATTGAATGGC GTCATCGAAT GCTTTTTCAA 
TATCGTTGGA ATGCG CTGAT TTATCTATAG 
5 TTTTATTGCG CTCAAGCTTA CTAAGTGCAC 

CTTTAGAAGC ATCTATTGCA AGTGGCATAT 
CTTT AT CAGA TTTTGATAAC TCCTTACTGT 

10 

ATTTTTTATC TAAATTGCCA TTACTTTTAC 
CATCAACTTT ATGCTTTCCG TTGATTTTAT 
TATCATCTCT AGAAGTCACG TGTGCAGCAT 

75 

CTGTTGAAAC AAGACCAGTC GACTTACCTT 
CTTTCTGCTT GTTACTGTCA ACACCGATGG 

20 AGGCTGTTCC GCCAGCTGCA GAATCAGTAA 

GATTTGTGCC TTTTAAATAT GAATCAAAAG 
CATTTTTATA ATAACGAT AA GCTGTGTTAA 

25 AGATAACATT TTTTGGATTT TTAGTATTAC 

TATCGGATTG CGCAATTGCA GGTGTGACAG 
TAGCAACTGC AAATTTTGTG GCTTTTTTAA 

30 

TGACTATGTC TTCAGTGTAA AAGAGGAATT 
ATGTAAACAG AGTGATTTAG AATAACAAAA 
AGCGTTAGAA TAGAAGCGTG TGAAAATATA 

35 

ATACTAGTAA TGTTATCTCC ATTATTAATC 
GAGCGTAAAC ^I^CGAAGi^ AAA^CAACTC 

40 AATACAAATG ACACGGAAAG TTCAAATCAA 

AAAGATAACA AAGGATAATT CAATTGAAGG 
TGAAGATGAT TTTGTTATAG CAGAGAGTTT 

45 TGTTATTGTC GTTGAACAAT TTGATGATAT 

GCTTGTATTG CTAGATATTA ATTTGCCAAC 
CCGAAAAACA TCTAATGTGC CAATTATATT 

SO 

AATTATGGCA ATACAAATGG GGGGAGATGA 
AACGATTGCC AAAATTCAAG CATTATTGAG 

55 
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AACCTTCCAT TTCAGACATA ACG CCTGTAA 6300 

AAGCACCTTC G AC CATT AAA AAGAATCCTT 636 0 

TTTGTTGCAT ATCAGCTAAT GATGGTTCGT 642 0 

TTTTATCTGC AAACAAACCA AGAACTTTAT 64 8 0 

TCGTGGCAAG GTCGTAACCA TCTTTTTTGA 654 0 

CGAAATATTT AGCGCCGCCG CCTAATAAAA 6600 

CTTTATAAAA TTGTTT AG CG ATTTCGTTTT 666 0 

ATGCTGCTGG TGTTGCATCT GTTAATTCAG 672 0 

TTTCTTTTGC ACGTTCAAGC ACCGTCTTTA 6780 

CAC CATT AT A TGTCTTATGA CCAGAACTAA 684 0 

TATTCTGTTT TGGGTCATTT GAATATGTAC 6900 

CAGTAGGGGT CATTTCTTTA GCATGCGGAT 6960 

ATGATGGACC CATGCCATCG CCAACTAAAA 702 0 

CAAC CG CG AA ACTTTCATCT TTAGAACTTT 70 80 

AACTAAAAAC CGTTGACACG ATAATAAGGT 714 0 

CTGATAACAT AAGACATCCT CCTGAGTATA 7200 

Tt GAGCAATT ATGTAGTTTT AGTTAnAAAT 7260 

aATGAATATA TATGACAATT TGTTATAGAA 7320 

GAATTAAATA TAATTTGAGG TGGAAAAATG 7380 

ATATTCTTTA TAGTGTTGTC TATTTTAGAA 744 0 
GAGAAAGAAA AAGCAAATAC ACTAAATCAA " ~ "7500 

GAG CCGTCAT TG CAGCAGG A TAAAGAACAA 7560 

AAGAAGATTA TAGATGAAAA TATTAATTGT 7620 

AG CATCTGAA CTTAAAAAAT GGAATTACGG 7680 

ACTGTCTATC TTTAACCAAA ATCAAC CTCA 7740 

GTTAAATGGT TTTCATTGGT GTCAAGAAAT 7800 

TATTAGTTCC CGTATTGATA ATATGGACCA 7 860 

TTTTATCGAA AAGCCATTTA ACTTGTCATT 7920 

ACGAACTTAT GACTTGTCAG TAGCTAATGA 7980 
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ACAAAACATA 


CAGCTATCTT 


TGACTGAATT 


ACAAATATTA 


AAGTTATTAT 


TTCAAAATGA 


8100 




AGaTAAATAT 


GTAAGTAG r A 


CTGCTTTAAT 


TGaAAAATGT 


TGGGaATCAG 


AAAACtTCAT 


8160 


5 


AGATGATAAC 


ACATT AG CTG 


TTAACATGAC 


GCGCCTGCTG 


AAAAAATTAA 


ATACTATTGG 


8220 




CGTTAATGAT 


TTT AT CATT A 


CAAAGAAAAA 


TGTCGGATAT 


AAAGTATAGG 


GTGAATGCAA 


8280 


10 


TGACCTTTCT 


TAAAAGTATT 


ACTCAGGAAA 


TAGCAATAGT 


CAT AG TT ATT 


TTTGCTTTGT 


8340 


TTGGCTTAAT 


GTTTTACCTG 


TATCATTTGC 


CATTAGAAGC 


ATATTTACTA 


GCACTTGGCG 


8400 




TTATTTTATT 


ATTATTACTC 


ATATTCATAG 


GTATTAAATA 


TTTAAGTTTT 


GTAAAAACTA 


8460 




TAAGCCAACA 


ACAACAAATT 


GAAAACTTAG 


AAAATGCGTT 


GTATCAGCTT 


AAAAATGAAC 


8520 




AAATTGAATA 


TAAAAATGAT 


GTAGAGAGCT 


ACTTTTTAAC 


ATGGGTACAT 


CAAATGAAAA 


8580 




CACCCATTAC 


TGCAGCACAA 


CTGTTACTTG 


AAAGAGATGA 


GCCTAATGTT 


GTTAATCGTG 


864 0 


20 


TTCGTCAAGA 


GGTTATTCAA 


ATTGaTAACT 


ATACAAGTTT 


AGCACTTAGT 


TATTTAAAGT 


8700 




TATTAAATGA 


AACTTCTGaT 


ATTTCTGTCA 


CTAAAATTTC 


GATTAATAAT 


ATCATTCGCC 


8760 




CAATTATTAT 


GAAATATTCA 


ATACAGTTTA 


TTGATCAAAA 


AACAAAAATC 


CATTATGAAC 


8820 


25 


CTTGTCATCA 


CGAAGTATTA 


ACTGACGTTA 


GATGGACCTC 


TTTAATGATA 


GAACAATTAA 


8880 




TAAATAATGC 


ACTTAAGTAT 


GCGAGAGGTA 


AAGATATATG 


GATTGAATTT 


GATGAGCAAT 


8940 


30 


CCAATCAATT 


ACACGTAAAA 


GATAATGGTA 


TCGGTATTAG 


TGAAGCGrAC 


TTGCCTAAAA 


9000 


TATTTGATAA 


GGGCTATTCA 


GGTTATAATG 


GCCAGCGCCA 


AAGTAACTCA 


AGTGGG aTTG 


9060 




GTTTATTTAT 


CGTAAAACAA 


ATTTCAACAC 


ACACAAACCA 


TCCTGTTTCA 


GTCGTATCTA 


9120 


09 


AACAAAATGA 


GGGTACAACA 


TTTACGATTC 


AATTTCCAGA 


TGAATAAAAA 


CTTTCAATAT 


9180 




TGTAAGTATA 


CTAGTAACAT 




ATTTAAATGT 


TATTAGTATT 








TTTTTTACTA 


TTT1TGTTTT 


9240 




AATATAGAAC 


TAACAAAGAA 


ATGAGGTGCA 


TGCCATGTTG 


CTAGAAGTGn 


AACATGTAAA 


9300 


40 


AAAGG TTT AT 


GGTAAAGGTT 


TGAATGCTAC 


GACAGCACTT 


AATCAAATGA 


ATTTATCAGT 


9360 




TGGAGCTGGT 


GaATTTGTTG 


CaATTATGGG - 


TGAGTCTGGG 


tCAGGGAAGT 


CTACACTACT 


9420 




AAATTTAATT 


GCtTCTTTTG 


ATGGACTAAC 


TGAAGGTGAC 


ATTATTGTGG 


ATGGCGCACA 


9480 


45 


TTTAAATAAT 


ATGAAAAATA 


AAAGTAAAGC 


ATTGTATCGT 


CaACAAATGG 


TAGGTTTTGT 


9540 




TTTTcAAGAT 


TTTAATCTTT 


TACCAACAAT 


GACGAATAAA 


OAAAATATAA 


IX*J\ I G CCA IT 


9600 


50 


AATTTTAGCT 


GGTGCTAAAC 


GAAAAGATAT 


AGAACAAAGG 


GTACATCAGT 


TGGCAGTACA 


9660 


ATTACATTTA 


GAGGGATTCT 


TAAACAAGTA 


TCCTTCTGAA 


ATCTCTGGGG 


GTCAGAAGCA 


9720 




ACGCATTGCC 


ATTGCACGTG 


CATTAGTTAC 


TAAGCCGACG 


ATTTTACTAG 


CCGATGAACC 


9780 
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TCAATTGGAA 


CAGACAATTT 


TAATGGTAAC 


TCATTCAAAT 


AT CGATGCGT 


CTTATGCAGA 


9900 


GCGAGTCATT 


TTTATTAAAG 


ATGGGCGTCT 


ATATCATGAA 


ATATATCGTG 


GTGAAGAAAG 


9960 


TCAATTAGCT 


TTTCAACAAC 


GAATAACAGA 


TAG CTTAGCA 


CTTGTGAATG 


GAGGAAGTGT 


10020 


CAATATATGA 


AGTTAAGATT 


GTTATGnACA 


TAGTGCGACG 


TCAATTTATT 


ACGCAGCGAC 


10080 


TTGTAATCAT 


TCCATTCATT 


TTAGCGGTAA 


GTGTACTATT 


CATGATTGAA 


TATACGCTTG 


10140 


TGT CAATTGG 


GTTAAATAGC 


TACATAAAAC 


AGAAGAATGA 


CTT CCTAGTA 


CCATTTATTA 


10200 


TCATAGCTAA 


TTrrrrrATG 


GCGCTTTTAA 


CTTTTATTTT 


TATTTTCTAT 


GCAAATCACT 


10260 


TTATGATGTC 


ACAAAGACGA 


AAAGAGTTTA 


GCATTTTTAT 


GACATTGGGC 


ATGACCAAGA 


10320 


AAAGTATGCG 


TTTAATTGTA 


GTGATGGAAA 


CTATCTTACA 


ATTTGTGATA 


ATTTCAGTCG 


10380 


TTAGTATTGC 


CGGCGGATAC 


TTACTTGGTG 


CGATATTTTT 


CTTGTTTATA 


CAGAAAATAA 


10440 


TGGGCAGTGA 


AGTTGCGACG 


TTAAGGTATT 


ATC CATTTGA 


CTCTGTAGCG 


ATGTTTATTA 


10500 


CTTTGATTAT 


CATTGCTGTA 


TTAATGGGCA 


TGCTACTTAT 


ATTCAACTTG 


TTTAGTATTA 


10560 


ATTTTCAACG 


GCCGATAACT 


TATCAACATC 


GTTCCGATTC 


TAGTGTCATA 


TCACGATGGT 


10620 


TGCGTTACGT 


TTTAATTGTT 


ATAGGAAGCG 


CAnACTATAT 


TTAGGTTACT 


TTATTGCATT 


10680 


ACAACAAGAT 


ACGACGTTTG 


GTGCCTTTTT 


TAAAATATGG 


ATTGTCATAG 


GATTAGTTAT 


10740 


TATCGGTACT 


TATGCATTTT 


TTGTAGGTAT 


AAGTGAAATA 


ATTATTAGTA 


TATTGCAGCA 


10800 


GGTATCAAAA 


GTTTACTATC 


ATCCACGGTA 


TTTTTTTGTG 


GTAGTTGGGA 


TGCGTGTACG 


10860 


TCTTAAAATG 


AATG CAGTCA 


V3 X X X u\«nnL 


X \— «V\— XXX \J 


CTGTGTACAT 


TTTTGATTGT 


10920 


AACVj\-T1 CAt-A 


AJAjACAI iaa 


CAACCTATCG 


TGATATGAAT 


1 /\V_ <w_r\. X ih 


LVj/vin 1 X b/\ X 




TACGAATGAT 


TakGATTTGT 


CATTTAGCGA 


CAATTCTAAG 


TCACAAaTAG 


AACGTCAACA 


11040 


AACAATTGAG 












11050 



40 (2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
45 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

SO 

CGACATAACG AGGCAAGGGT ACATGATACT TTAGCCTCGT TTTTGATATG TATTTTTCTG 60 

AATATAAGGG CAATAGATGG TATTTTATAw TTTTTTTAAG GTAGTGATTA ACATAGATAT 120 
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TCAAGCGGAA CAGCATTATG CACCAGTATT AACGCATTTT TTAGATCCAA GAGGGCAATA 
TATATTGGAA GTGATTTGTG GCAGTTATGA AGATTTAAAC GTATCTTTTT ATGGTGGACC 
TAATGCTGAA AGAAAAAGAG CAATCATTTC GCCGAACTAT TATGAACCTA AAGAAAGCGA 
CTTTGAATTA ACTTTAATGG AAATAGATTA TCCTGAAAAA TTCGTCACTT TAAAACATCA 
ACATATTTTA GGGACATTAA TGTCTTTAGG TATCGAACGC GAACAAGTTG GAGATATAAT 
TGTGaATGAA CGAATTCAAT TTGTTTTGAC AAGTAGATTG GAATCATTTA TTATGTTAGA 
ATTACAACGT ATTAAAGGCG CAT CAGTT AA ACTTTATACT ATTCCAGTAA CAGATATGAT 
1S ACAATCTAAT GAGAATTGGA AAAATGAAAG TGC a CAGTT A GTTCTTTAAG GTTAGATGTT 

GTTATTAAAG AAATGATACG TAAATCACGT ACGATTGCGA AACAACTAAT CGAAAAAAAA 
CGTGTTAAAG TGAATCACAC TATTGTTGAT TCAG CAGATT TTCAATTACA AGCAAATGAT 
20 TTAATATCCA TCCAAGGTTT TGGTAGAGCA CACATTACTG ACTTAGGTGG TAAAACTAAA 

AAAGATAAAA CGCACATTAC CTATAGAACA TTATTCAAAT AGTAATGATT TAAGGAGGAT 
AACAAATGCC TTTTACACCA AATGAaATTA AGAATAAAGA GTTTTCACGT GTaAAGAATG 
GTTTTAGAAC CTACTGnAGT TGG 
(2) INFORMATION FOR SEQ ID NO: 98: 



25 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 10322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
TTTTGCAAAG CTTATTTTAT GTCAAACAGA TAGTCAATGT GAAACAAAGG TTAGTACATA 60 
40 TAATCATCCA GACTTTATGT ATATATCAAC AACTGAGAAT G CAATT AAG A AAGAACAAGT 120 
TGAACAACTT GTGCGTCATA TGAATCAACT TCCTATAGAA AG CACAAATA AAGTGTACAT 180 
CATTGAAGAC TTTGAAAAGT TAACTGTTCA AGGGGAAAAC AGTATCTTGA AATTTCTTGA 
AGAACCACCG GACAATACGA TTGCTATTTT ATTGTCTACA AAACCTGAGC AAATTTTAGA 
CACAAT C CAT TCAAGGTGTC AG CATGT AT A TTTCAAGCCT ATTGATAAAG AAAAGTTTAT 
AAATAGATTA GTTGAACAAA ACATGTCTAA GCCAGTAGCT GAAATGATTA GTACTTATAC 420 
TACGCAAATA GATAATGCAA TGGCTTTAAA TGAAGAATTT GATTTATTAG CATTAAGGAA 4 80 

AT CAGTT AT A CGTTGGTGTG AATTGTTGCT TACTAATAAG CCAATGGCAC TTATAGGTAT 54 0 



45 



50 



240 
300 
360 
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GAATGGTTTC TTCGAAGATA TCATACATAC AAAGGTAAAT GTAGAGGATA AACAAATATA 660 

TAGTGATTTA AAAAATGATA TTGATCAATA TGCGCAAAAG TTGTCGTTTA ATCAATTAAT 720 

TTTGATGTTT GATCAACTGA CGGAAGCACA TAAGAAATTG AmTCAAAATG TAAATCCAAC 780 

GCTTGTATTT GAACAAATCG TAATTAAGGG TGTGAGTTAG ATGCCAAATG TAATAGGTGT 84 0 

TCAGTTTCAA AAAGCGGGAA AATTAGAATA TTATACACCT AATGATATAC AAGTAGATAT 9 00 

AGAAGACTGG GTAGTTGTCG AATCTAAAAG AGGCATAGAG ATAGGTATTG TTAAAAATCC 960 

ATTAATGGAT ATTGCTGAAG AGGATGTTGT GTTACCTCTT AAAAATATTA TTCGCATTGC 1020 

15 TGATGACAAA GATATTGATA AATTTAATTG TAATGAACGA GATGCTGAAA ATGCATTAAT 10 80 

ACTATGTAAA GACATTGTAA GAGAACAAGG TTTGGACATG CGTTTAGTCA ATTGCGAATA 114 0 

TACATTAGAT AAATCGAAAG TTATTTTTAA TTTTACGGCG GATGATCGTA TTGATTTTAG 12 00 

20 AAAATTAGTA AAAATATTAG CGCAACATTT AAAAACACGT ATCGAGTTGA GACAAATTGG 1260 

TGTAAGGGAT GAAGCCAAAT TGCTTGGCGG TATCGGACCT TGTGGTAGGT CGTTATGTTG 1320 

TTCTACATTT TTAGGGGATT TTGAACCAGT AT C GATT AAG ATGGCTAAGG ATCAAAATTT 13 80 

ATCATTAAAT CCAACTAAAA TTTCTGGTGC ATGTGGTCGT TTGATGTGTT GTTTAAAATA 144 0 

TGAAAATGAC TATTATGAGG AAGTACGTGC ACAATTACCT GATATTGGTG AAG CAATTG A 1500 

AACGC CTG AT GGTAACGGGA AAGTAGTTGC TTTAAATATA TTAGACATTT CTATGCAGGT 1560 

GAAGCTTGAG GGACATGAAC AGCCACTTGA ATATAAATTA GAAGAAATAG AAACTATGCA 1620 

TTAAGGAGGC ATTATTACAT TTGGATCGCA ATGAAATATT TGAAAAAATA ATGCGTTTAG 1680 

35 AAATGAATGT CAATCAACTT TCAAAGGAAA CTTCAGAATT AAAGGCACTT GCAGTTGAAT 1740 

TAGTAGAAGA AAATGTAGCG CTTCAACTTG AAAATGATAA TTTGAAAAAG GTGTTGGGCA 1800 

ATGATGAACC AACTACTATT GATACTGCGA ATTCAAAACC AGCAAAAGCT GTGAAAAAGC I860 

40 CATTACCAAG TAAAGATAAT TTGGCTATAT TGTATGGAGA AGGATTTCAT ATTTGTAAAG 1920 

GCGAATTATT TGGAAAACAT CGACATGGTG AAGATTGTCT GTTCTGTTTA GAAGTTTTAA 1980 

GTGATTAATC AAGCACACTC AAATAGTGTT ATAATTATAA ATGAATATGG TTTGGATAAG 204 0 

TCTGAGACAA TGCATGTTTC AGGCTTTAAT TGTGTATAAA GTTTTGGTGA TTGCATAAGA 2100 

GATGGCGGTA CTAAATGTTA TTATTAAGTG TGCACGCAgT ATCaTTAGTT ATAAAATGTA 2160 

GCTGTTAAAA GTCAAAAATA CATCGAATGT AGTTAGGCAT ATAATATAAA AAGAGTTTTC 2220 

AATTACTCAA TAGAAAAAGG TTGTCTTCAT AGGAGTTAAA AATGTTAAAA GAGAATGAAC 2280 

GATTTGATCA ACTAATCAAA GAAGATTTTA GTATTATTCA AAATGATGAT GTTTTTTCAT 2340 
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TGGACTTATG 


TTCAGGCAAT 


GGGGTGATAC 


CCTTGTTATT 


GTTTGCGAAA 




2460 




ATATAGAAGG 


TGTTGAGATT 


CAAAAAACAC 


TTGTCGATAT 


GGCGCGACGC 


ACATTTP A A T 


2520 


5 


TCAATGATGT 


TGATGAATAT 


TTAACAATGC 


ATCACATGGA 


TTTGAAAAAC 


GTT ACTA A A CI 


2580 




TATTTAAACC 


TTCACAATAT 


ACTTTAGTAA 


CGTGTAATCC 


GCCTTATTTT 


AAAGAG AATP 


2640 


10 


AGCAACACCA 


ACATCAAAAA 


GAAGCACATA 


AGATAGCGAG 


ACATGAGATT 


ATGTGT A PA P 


2700 




TTGAAGATTG 


CATGATTGCA 


GCCCGTCATT 


TATTAAAAGA 


AGGTGGCAGG 


PTAAAPATf^P 


2760 




TACATCGTGC 


AGAGAGACTA 


ATGGATGTCT 


TGTTTGAAAT 


GAGAAAAGTG 




2 820 


IS 


CTAAGAAAGT 


CGTTTTTATA 


TATAGTAAAG 


TAGGGAAATC 


AG CACAAACG 


ATAGTAGT A ^ 


2830 




AAGGTCGAAA 


AGGTGGAAAT 


CAAGGTTTAG 


AAATCATGCC 


CCCATTTTAT 


■n X X X f\ X X v» 


2940 




AAGATGGTAA 


TTATAGCGAA 


GAAATGAAGG 


AAGTATATTA 


TGGATAGTCA 


TTTTCTATAT 

x X X IVjlrtlrti 


o rt ft /\ 
J 000 


20 


ATTGTAAAAT 


GTAGTGATGG 


AAGTTTATAT 


ACAGGATACG 


CTAAAGACGT 


TAATflPAPPT 


3060 




GTTGAAAAAC 


ATAACCGAGG 


TCAAGGAGCC 


AAATATACGA 


AAGTAAGACG 


*r p pnmY^r'' r t 


3 120 


25 


TTAGTTTATC 


AAGAAATGTA 


TGAGACAAAG 


TCTGAAGCAT 


TGAAGCGTHA 


RTATPH 2i n T**T 


3180 


AAAACTTATA 


CCAGACAAAA 


GAAATTGCGA 


TTAATTAAGG 


AGCGATAGTA 


TGnPTrtTIlTT 
iwL 1 Vj x /\ 1 I 


3 240 




ATATTTAGTG 


GGCACACCAA 


TTGGTAATTT 


AGCAGATATT 


ACTTATAGAO 


1 1 A XVj i 


3 300 


30 


ATTGAAACGT 


GTTGATATGA 


TTGCTTGTGA 


AGACACTAGA 


GTAACTAGTA 




3 360 




TCATTATGAT 


ATTCCAACTC 


CATTAAAGTC 


ATATCACGAA 


CATAACAAGG 


AT A & P. P 21 P lx r* 


3420 




TGCTTTTATC 


ATTGAACAGT 


TAGAATTAGG 


TCTTGACGTT 


GCGCTCGTAT 


PTP. A TP, cmn 


A O f\ 

J 4 8 0 


35 


ATTGCCCTTA 


ATTAGTGATC 


CTGGATACGA 


ATTAGTAGTG 


GCAGCCaGAG 


AAGCTAATAT 


-3 3*1 U 




TAAAGTAGAG 


ACTGTGCCTG 


GACCTAATGC 


TGGGCTGACG 


GCTTTGATGG 

AAA > T A 


PTAf^TTlP. ATT 

V» X n\J X X X 


*a c f\ r\ 




ACCTTCATAT 


GTATATACAT 


TTTTAGGATT 


TTTGCCACGA 


AAAGAGAAAG 


AAA A A AP.TPP 


*a ^ c r\ 


40 


TGTATTAGAG 


CAACGTATGC 


ATGAAAATAG 


CACATTAATT 


ATATACGAAT 


PAPPPP ATPP 






TGTGACAGAT 


ACATTAAAAA 


CAATTGCAAA GATAGATGCA 


ACACGACAAG 


TAT PA PTAfiG 


J / oU 


45 


G CGTGAATTA 


ACTAAGAAGT 


TCGAACAAAT TGTAACTGAT 


GATGTAACAC 


AATTAPA Af2P 




ATTGATT CAG 


CAAGGCGATG 


TACCATTGAA 


AGGCGAATTC 


GTTATCTTAA 

x x x x inn 


TTTZA Afl/nYiP 
x i\M\r\\j\s X\^\m. 






TAAAGCGAA P 




CGTGGTTTGA 


TGATTTATCT 


ATCAATGAGC 


ATGTTGATCA 


3960 


SO 


TT AT ATT CAA 


ACTTCACAGA 


TGAAACCAAA 


ACAAGCTATT 


AAAAAAGTTG 


CTGAAGAACG 


4020 




ACAACTTAAA 


ACGAATGAAG 


TATATAATAT 


TTATCATCAA 


ATAAGTTAAT 


CACTTTATCG 


4080 




ATTaTATGAA 


ATTTTAAACG 


ATTTTATAAA 


CGCAAGCTGT 


AATTTTAAAT 


GGTAAGTTAT 


4140 
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GTTTTTTAAT GTAAAATAAA TACATTGAAA GTAATAAATA CCTTAACATT GAATAAGATG 426 0 

AAAATGAGAT GACGAGATAA ATGTTCGCGT CCGTTGAAAT GCATAGAAAT CTTAGATATT 4320 

5 ATTTGAAGTG AGACATTACG AGGAGGAACA GTTATGGCTA AAGAAACATT TTATATAACA 4380 

ACCCCAATAT ACTATCCTAG TGGGAATTTA CATATAGGAC ATGCATATTC TACAGTGG CT 444 0 

GGAGATGTTA TTGCAAGATA TAAGAGAATG CAAGGATATG ATGTTCG CTA TTTGACTGGA 4 500 

w 

ACGGATGAAC ACGGTCAAAA AATTCAAGAA AAAGCTCAAA AAG CTGGTAA GACAGAAATT 4 56 0 

GAATATTTGG ATGAGATGAT TGCTGGAATT AAACAATTGT GGGCTAAGCT TGAAATTTCA 46 2 0 

?5 AATGATGATT TTATCAGAAC AACTGAAGAA CGTCATAAAC ATGTCGTTGA GCAAGTGTTT 4680 

GAACGTTTAT TAAAGCAAGG TGATATCTAT TTAGGTGAAT ATGAAGGTTG GTATTCTGTT 474 0 

CCGGATGAAA CATACTATAC AGAGTCACAA TTAGTAGACC CACAATACGA AAACGGTAAA 4 800 

20 ATTATTGGTG GCAAAAGTCC AGATTCTGGA CACGAAGTTG AACTAGTTAA AGAAGAAAGT 4 86 0 

TATTTCTTTA ATATTAGTAA ATATACAGAC CGTTTATTAG AGTTCTATGA CCAAAATCCA 4 92 0 

GATTTTATAC AACCACCATC AAGAAAAAAT GAAATGATTA ACAACTTCAT TAAAC CAGG A 498 0 

25 

CTTGCTGATT TAGCTGTTTC TCGTACATCA TTTAACTGGG GTGTCCATGT TCCGTCTAAT 504 0 

CCAAAACATG TTGTTTATGT TTGGATTGAT GCGTTAGTTA ACTATATTTC AGCATTAGGC 510 0 

TATTTATCAG ATGATGAGTC ACTATTTAAC AAATACTGGC CAG CAG AT AT TCATTTAATG 5160 

30 

GCTAAGGAAA TTGTGCGATT CCACTCAATT ATTTGGCCTA TTTTATTGAT GGCATTAGAC 5220 

TTACCGTTAC CTAAAAAAGT CTTTGCACAT GGTTGGATTT TGATGAAAGA TGGAAAAATG 52 80 

3S AGTAAATCTA AAGGTAATGT CGTAGACCCT AATATTTTAA TTGATCGCTA TGGTTTAGAT 534 0 

GCTACACGTT ATTATCTAAT GCGTGAATTA CCATTTGGTT CAGATGGCGT ATTTACACCT 54 00 

GAAGCATTTG TTGAG CGTAC AAATTTCGAT CTAGCAAATG ACTTAGGTAA CTTAGTAAAC 5460 

40 CGTACGATTT CTATGGTTAA TAAGTACTTT GATGGCGAAT TACCAGCGTA TCAAGGTCCA 5520 

CTTCATGAAT TAGATGAAGA AATGGAAGCT ATGGCTTTAG AAACAGTGAA AAGCTACACT 5580 

GAAAGCATGG AAAGTTTGCA ATTTTCTGTG GCATTATCTA CGGTATGGAA GTTTATTAGT 564 0 

45 

AGAACGAATA AGTATATTGA CGAAACAACG CCTTGGGTAT TAGCTAAGGA CGATAGCCAA 5700 

AAAGATATGT TAGGCAATGT AATGGCTCAC TTAGTTGAAA ATATTCGTTA TGCAGCTGTA 5760 

TTATTACGTC CATTCTTAAC ACATGCGCCG AAAGAGATTT TTGAACAATT GAACATTAAC 5820 

SO 

AATCCTCAAT TTATGGAATT TAGTAGTTTA GAGCAATATG GTGTGCTTAA TGAGTCAATT 5880 

ATGGTTACTG GGCAACCTAA ACCTATTTTC CCAAGATTGG ATAGCGAcGG AnAATTGCAT 5940 
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AACCTCAAAT 


TGATATTAAA 


GACTTTGATA 


AAGTTG AAAT 


TA Af^tPAf^PA 


Mv_vj/\ i IATTG 


6060 




ATGCTGAACA 


TGTTAAGAAG 


TCAGATAAGC 


TTTTAAAAAT 


TPAAftTAfZAP 


1 X Auii Hi, L\3 


6120 


5 


AACAAAGACA 


AATTGTATCA 


GGAATTGCCA 


AATT CTATAP 


A rr A CZ A AT 


Al 1A1 IvrG x A 


6180 




AAAAAGTAGC 


AGTTGTTACT 


AACCTGAAAC 


CAG CTAAATT 






6240 


10 


GTATGATATT 


ATCTGCTGAA 


AAAGATGGTG 


TATTAACCTT 


t\\3 X /VlU XXX /A 


TV ft r P/~' f' TV TV 


6300 




TTCCAAATGG 


TGCAGTGATT 


AAATAACTGT 


ATTTITAAAA 


ATTAnnAHAP 


A1AA 1 lAlbl 


6360 




TAATCGATAC 


ACATGTCCAT 


TTAAATGATG 


AGCAATACC5A 


THATr: R TTTP 
X VJrt X tJLrt 1 i lu 


7A /""TV* ft ft /"* TV"» t* 

AO I vjAAGTGA 


6420 


15 


TTACACGTGc 


TAGAGAAGCA 


GGTGTTGATC 


GTATGTTTGT 

V X rk X XXX X 


■rUj X X \3VJ 1111 


TA TV TV TV TV n*f^r^ » 

AAUAAATCGA 


6480 




CAATTGAACG 


CGCGATGAAA 


TTAATCGATG 


AGTATGATTT 


X X IrtlAlljOl. 


Al 1ATCGGTT 


6540 




GGCATCCAGT 


TGACGCAATT 


GATTTTACAG 


AAGAACACTT 


fif^ A A TYZfl 7X TT 


TV TV fp/'M 1 Ml MM TV f> 

t» AA rCTTTAG 


6600 


20 


CTCAGCATCC 


AAAAGTGATT 


GGTATTGGTG 


AAATGGGATT 


AwA 1 Inl 


i LtWj A I AAA T 


6660 




CTCCTGCAGA 


TGTTGAAAAG 


GAAGTTTTTA 


a a a a nr* a & 21 t 


' 1 /"'T M 1 •!» TV furry 


AAGCGTTTGA 


6720 


25 


AGTTACCAAT 


TATCATTCAT 


AACCGTGAAG 


PA A PTfA Af2 H 


\-l\a 1A1 vJVjAI 


ATCTTATTGG 


6780 


AGGAGCATGC 


TGAAGAGGTA 


GGCGGGATTA 


TG CATARCTT 


X M.Vj X Vjvj 1 ILi 


C uAGAAATTG 


6840 




CAGATATTGT 


AACTAATAAG 


CTGAATI'TTT 






G TG A CATTT A 


6900 


30 


AAAATGCTAA 


ACAGCCTAAA 


G AAGTTG CTA 


AG CATfiTClTP 


nn X \j\j/\Lj\— Aj 1 


1 1 vjCTAGTTG 


6960 




AAACCGATGC 


ACCGT AT CTT 


TCGCCACATC 


CGT AT AG A RC1 




/""» TV TV r~*f*r~> O /~V» »V 


7020 




GAGTAACTTT 


AGTAGCTGAA 


CAAATTGCTG 


AATTAAAAnn 


v— X 1A1L1 x M. 1 


T\ TV /~ ' TV TV pfTV^T 

GAAGAAGTGT 


7080 


35 


GCGAACAAAC 


AACTAAAAAT 


G CAGAGAAAT 


TGTTTAATTT 


AAATTCATAA 


AGTTAAAAGT 


7140 




GAGAAAGATC 


ACCGCCATAA 


ATGTAAACGA 


TGCTATATTC 


GTTTAATATG 


CTATGGTTCT 


7200 




TTCTCACTTT 


TTTAAATTAA 


AATATCGTGC 


ATGTGGAATA 


CGTGCGATAG 


AGATGGTTAG 


7260 


40 


AGCTTTGAAA 


TTAAGAATTG 


TAGGAAGGCG 


TTTTAAATGA AAATCAATGA GTTTATAGTT 


7320 




GTAGAAGGAC 


GAGATGATAC 


TGAGCGTGTT 


AAACGAGCTG 


TTGAATGTGA 


TACGATTGAA 


7380 


45 


ACGAATGGTA 


GTGCCATCAA 


CGAACAAACT 


TTAGAAGTAA 


TTAGAAATGC 


TCAACAAAGT 


7440 


CGAGGCGTTA 


TTGTATTAAC 


AGATCCAGAT 


TTCCCAGGAG ATAAAATf AG 


AAGTACAATT 


7500 






X LhA/\VjO X Vj I 


TAAACATGCG 


TATATTGATA 


GAGAAAAAGC 


TAAAAATAAA 


7560 


SO 


AAAGGGAAAA 


TTGGTGTTGA 


ACATGCCGAC 


TTAATTGATA 


TTAAAGAAGC 


GTT AATG CAT 


7620 




GTTAGTTCAC 


CCTTTGATGA 


AG CTT ATG AA 


TCAATTGATA AATCTGTGCT 


AATAGAGTTG 


7680 




GGGTTAATTG 


TTGGGAAAGA 


TGCAAGGCGC 


CGTAGAGAAA TTTTAAGTAG 


AAAATTGCGA 


7740 
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GCGGATGTAA GGCAAGCTTT AGAAGATGAA TGAGGAAGTG AAAATGTTGG ATAATAAAGA 7860 

TATTGCAACA CCATCAAGAA CGCGAGCGTT GTTAGATAAA TATGGCTTTA ATTTTAAAAA 792 0 

AAGTTTAGGA CAGAACTTTT TGATAGATGT GAATATCATT AATAATATCA TTGATGCAAG 7980 

TGATATTGAT GCACAAACTG GGGTGATTGA AATTGGTCCA GGCATGGGGT CATTGACAGA 804 0 

ACAATTGGCC AGACATGCTA AAAGAGTATT GGCATTTGAA ATTGATCAAC GTTTAATACC 8100 

TGTATTAAAT GATACACTAT CAC CTTATG A TAATGTGACG GTGATTAATG AAGATATTTT 816 0 

AAAAGCGAAT ATTAAAGAAG CTGTTGAAAA TCATTTACAA GATTGTGAAA AAATAATGGT 822 0 

/5 TGTTGCAAAC CTGCCGTACT ATATTACGAC GCCAATTTTA TTAAATTTGA TGCAACAAGA 8280 

TATACCAATT GATGGCTACG TGGTGATGAT GCAAAAAGAA GTGGGCGAAC GCTTAAATGC 834 0 

TGAAGTAGGT TCAAAAGCAT ATGGTTCGTT ATCAATTGTC GTACAATACT ATACAGAGAC 84 00 

TAGTAAAGTA TTAACGGTAC CTAAATCTGT ATTTATGCCA CCACCTAATG TTGATTCAAT 84 60 

AGTTGTAAAA CTGATGCAGA GAACTGAACC GTTAGTAACA GTAGATAACG AGGAAGCATT 8 520 

CTTTAAGTTA G CAAAAGCAG CATTTGCACA AAGAAGAAAG ACAATTAACA AT AACT AT CA 8 580 

AAATTATTTT AAAGATGGTA AACAACACAA AGAAGTGATT TTACAATGGT TGGAACAAGC 8 64 0 

AGGTATTGAT CCAAGACGTC G CGGTGAAAC GCTATCTATT CAAGATTTTG CTAAATTGTA 8 700 

TGAAGAAAAG AAAAAATTCC CTCAATTAGA AAATTAAATG ATTGACAAAG CAAAGCACTA 8760 

TTGTTAAAAT TTAAATTTTG TTTGACGAAA ACG TTGCAAA TATGGTATTA TGTAACTTGT 8 82 0 

AGCGAGGTGG AG CAATATGC CAAAATCAAT TTTGGACATC AAAAATTCTA TTGATTGTCA 88 8 0 

35 TGTAGGAAAT CGTATTGTAC TGAAaGCCAA TGGAGGCCGT AAGAaAACAA TAAAACGTTC 894 0 

TTCAATTTTA^ AAAGAAACAT ATCCGTCAGT TTTCATTGTT JSAGTTAGATC _AAGACAAACA 9000 

CAAC?TTGAG AGAGTATCTT ATACATACAC TGATGTGTTA ACTGaAAATG TTCAAGTTTC 9060 

40 ATTTGAAGAG GATAATCATC ACGAATCAAT TGCACACTAA ATAAGACATA TAGAGATGTT 9120 

AGACGTTTCT TAGTATAAGA AGTAAATATT ATGATAATTA TTTGAGTGTT GGGcATTATG 9180 

TTCAATACTC TTTTTATTTA CAAAATGTTT AACACTGATG TTTCGCTTAT AGATTTTTCA 924 0 

GTAAATGGAT AATTGTATTT ATAAACACAA ATACAAGTAA ATACTAAGTA ATTAGATGGA 93 00 

GAAAATTACT TTTTTATTAA AAAAACACTA AAAAACAAAT TAAAATGTCA AATATTAATT 9360 

CTCTTTATGT TAAAATCATC ATATTAAGAT AACGAAAAGA GGGCGGAAAA TGATATATGA 94 20 

AACGGCACCA GCCAAAATTA ATTTTACGCT CGATACACTT TTTAAAAGAA ATGATGGCTA 94 80 

TCATGAGATT GAAATGATAA TGACAAGAGT TGATTTAAAT GATCGTTTAA CTTTTCATAA 954 0 
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AAATCTCGCA TATCGTGCAG CGCAACTATT TATTGAGCAA TATCAACTAA AGCAAGGTGT 
AACAATTTCT ATCGATAAAG AAATACCTGT TTCTGCTGGC TTAGCTGGAG GTTCGGCTGA 

5 

TGCAGCAGCA ACGTTAAGAG GATTGAATCG ACTTTTTGAT ATAGGGGCGA GTTTGGAAGA 
ATTGGCTCTA CTAGGCAGTA AAATCGGGAC AGATATTCCG TTTTGTATTT ATAATAAAAC 
^ TGCACTATGT ACTGGAAGAG GAGAGAAAAT CGAGTTTTTA AATAAACCAC CTTCAGCTTG 

GGTGATTCTT GCTAAACCAA ACTTAGGCAT ATCATCACCA GATATATTTA AGTTGATTAA 
TTTAGATAAG CGTTACGACG TACATACGAA AATGTGTTAT GAGG CCTTAG AAAATCGAGA 
15 TTATCAACAA TTATGTCAAA GTTTGTCTAA TCGATTAGAG CCAATTTCTG TTTCAAAACA 

CCCACAAATC GATAAATTAA AAAATAATAT GTTGAAAAGT GGTGCAGATG GTGCGTTAAT 
GAGTGGAAGC GGACCTACTG TGTATGGGCT AGCACGAAAA GAAAGCCAAG CAAAAAATAT 
TTATAATGCA GTTAACGGTT GTTGTAATGA AGTGTACTTA GTTAGACTAT TAGGATAGAA 
GGGTTGAAAA GATGAGATAT AAACGAAGCG AGAGAATTGT TTTTATGACG CAATATTTGA 
TG 

(2) INFORMATION FOR SEQ ID NO: 99: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GATTGATTAA ATGTTTTAAT CCACTTCAAT GCCTTCGATA AACTCTACAA TCGCGCTATT 60 

CATATAATTA TTCGATTTCA TTTGTTCAGC ATATGTCTCA TTAAATCCAG ACATAACTTT 120 

TTTAAAwGCG AAAATTGAAA TTGGTATCGT TACTAATAAG GCACTAGCCA TACGCCAATC 180 

AATG AG CATT ATGTATAAAA AGATAGCAGC TGACAAAAGT AAGTTTCCTA TAACTTCAGG 24 0 

AATCATATGT GCTAAAGGTA ATTCTATTGT TTCAACCTTA TCGACAAATA TATTTTTTAA 300 

TTCACCTATT TTCTTAGATT CCaCTACGCC TAAAGGGAGA CGCATTAATT TTTGAGCTAA 360 

TTTTTTACGA ATTTCAGATA AAATTTCATA TG CCGTAATA TGTGATAGCA TCGTTGACGC 420 

TCCAAAACAA CACACTTGTG AAATATAAGC GATTAAAGCA ATAAAGATAT AAACCATAAT 4 80 

CGAATTAATC GTATATGTAT TGTTAATCAT CATTAAAATA ATTTTAAATA CTGCCCAATA 54 0 

AGGAACTAAT CCAGAAAAGA CACTGATGAT AGACAACAAA ATTGATAACA TAATTTTCCA 600 
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ATATGTAACT CCTkTCAATT AATAATCTAA ATTAAGCCGC TTATATTATT TATTTCACTG 720 

GATGATATAC ATAATATAAA TTTGTTATTT GTTAAAAATT AATACTTATT ACAAGTACAT 780 

CATATATTAG TTGATAACGA TTATCAATGT CGCGTGGATT TGTGACACAT TTCTTTTAAA 84 0 

AATTCACAAG GTTATGGGGC AGAAATGATA AAGAGCCACT AATGATTTAT TATGTAGTGG 900 

TTCTGGGAGT GGGACAGAAA TGATATTTTC ACAAAATTTA TTTCGTCGTC CCACCCCAAC 96 0 

TTG CATTGTC TCTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGTCCCT GAATATAGCC 102 0 

TTGTAGAGTC TAGTACATTG ATTTGTATCC CAATGTCCCT ATAATTGATT ATTCGCTTTA 1080 

J5 TCTAATGATC CTATGACTCA ACTATTAAAT CATTTTTCGA AATACTTAAT TCTAATATAA 114 0 

TTAAATTCAT TTATTGTAAT ATTGCAAAAA TACATTGCAC ACCTTGTTCA TCAATGCTAT 1200 

AATTAATTAC ATAATAAATT GAACATCTAA ATACACCAAA TCCCCTCACT ACTGCCATAG 1260 

20 TGAGGGGATT TATTTAGGTG TTGGTTATTT GTCACCTTTT TTATTGTTGC GCGTTCGTAA 1320 

CCAATGTGCA AAAAACGCAA CAAGACAGCC G CTTAT AGCT GAAGTCATGA TGTTAATTAA 13 80 

TAAATTGAAC ATCCGTCATA CACCTCCTCT CTGCGTTAAA GTAACGCCCG AGATGTTAGG 144 0 

CGACCATCAT ATT AT AT CAT TTATTTATTA TATTTCACGC AATATTAAGG CTTAAGTAAA 1500 

GTTTTTTTTA GTGGTTTACG CTACTTTAAT TGCTATCTTT TAAAATCCAT TTAGATAATA 1560 

TAAATGTGAT GGGTATCGTA ATAATTAAAC CAGCAAATGG TGCAATTTCT GCTGGCAAAT 1620 

TTAGCCAGGA TACAAATACA TATAATAAAA CTGTTTGTAA GCTTACGTTG ACAATCTGCG 1680 

TAATTGGAAA ACTAATGAAT TTTCTCCAAG TAGGTTTTAC CCTGTAAACA AAATAACAAT 174 0 

TCAAATAATA TGAAATCACA AAAGCGACTA GAAATCCGGT AATATGACTA ATCATATATT 1800 

CAATGTGTAA TAATTTTAAC AGCAATAAAT AGACAACATA ATAATTTAAC GTATTAATGC 1860 

CGCCAACAAT GATAAATTTT AAAATTTCAG CATGCGTTTG TGTTAGTTTC ATATGTGTAc 192 0 

40 TCCTCAACAT CAAAATATAT G CATAACT AC GTTCTCGAAC ATACTCGAAT ATG CGAGCCA 1980 

ATCCGCTTCA CTTCAAATAT GCTTATTTCA AT CTTT AT AC CCTTTCACAG CAAATTTAGT 204 0 

CTCTTTCCCC TCATCCTTAT ACGCCATTAT AATGTAACTG ATTTATCGCG TGACTCATTA 2100 

GCACTATAGA GATTACTTTA GTTCACTAGT AATTTTATAT ACAATAAGAG CGACAACAGT 2160 

AATGAGAGGA TGTCTACTAT GCAATTACAA AAAATTGTCA TCGCTCCTGA CTCATTTAAG 2220 

GAAAGTATGA CCGCACAGCA AGTTGGCAAT ATTATAAAAC AGGCTTTTAC TAATGTTTAT 2280 

GGGAATACCC TTCATTATGA TATCATTCCG ATGGCTGATG GTGGTGAAGG TACCACAGAT 2340 

GCTTTAATGC ATGCAACAGG TGCCACTAAG TATACAGTCA TCGTTAATGA CCCTTTAATG 24 00 
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GCGGCAGCGT CAGGTTTGGA TTTATTAGAA AAAGAGGAAC GTAATCCTTT ATACACATCA 2520 

TCATATGGTA CCGGTGAACT AATTAAAGAT GCATTAAATC ATGGTGCTAA GACCATTATT 2 580 

TTAGGGATTG GTGG CAGTGC AACAAATGAT GGTGGTACAG GTATGCTAAG TGCACTAGGC 264 0 

GTAAAGTTTA CTGATGTAAA CGGGGACTTA TTACAAATGA ATGGTGCTAA TCTTGCTCAC 2700 

ATTGCACAAA TCGATATAAC CAATCTAGAT TCGCGATTAA AAGAGGTGAC CTTTAAAGTG 2760 

GCCTGTGATG TTTCAAATCC TTTATTGGGT GAAAATGGTG CT A C CT AT AT TTATGGTCCT 2 820 

CAAAAAGGCG CTGATGCAAA GATGATACCA AAGTTGGATT TCGCAATGTC GCATTATCAT 2 88 0 

GATAAGATAA AAATGTG CAC AGGAAAGTCC GTTAATCAAA TACCAGGTTC TGGTGCAGCT 2 94 0 

GGCGGTATGG GCGCAGCATT ATT AG CGTTT TGTGAGACAA CTTTAACAAA AGGTATTGAT 3000 

GTCGTCTTTG ACATTACAGA TTTTCATCAA AGAATTAAAG ATGCAGACCT CGTTATTACT 3060 

20 GGAGAAGGAC GCATGGATTA TCAGACCATC TTTGGTAAAA CACCCGTAGG CGTTGCGTTA 3120 

GCTGCAAAAC AATATCATAT TCCTGTCATC GCGATTTGTG GCAGTCTAGG CGAAAATTAT 3180 

CAACATGTTT ACGATTTCGG TATTGATAGT GCCTATTCTA TAATCTCTTC ACCTAGCACT 3240 

TTAGAAGATG TCCTACAAAA TAG CGAACAA AATTTATTAA ACACTGCAAC TGACATTGCT 3300 

CGTATTCTGA AATTACAATA ATGTCAAAGT AAATCATCAG CTTTATTATT TGCAGTTAAA 3 3 60 

ACTTGAATGA GGTGAAACCC ATGAAAAGAA CTGATAAATA CCGTGATTCA TATCAATACG 3420 

ACAATCAAAA CCAAAATCAT CGTCGTCAAT CTGAAGACGC ATCGTATAGA CAACAATATG 34 80 

CTAAAGGCGA TCCTGAAGAA CACCCGGAAC GATACTATAA TGGTAGAGAT TATCGAAGAG 3 54 0 

AACAAATTCT TGAAGAAGAA AACGAGAAAT CCCGCCGTTC AAAAAAATGG TTATATATCA 3600 

TTATTGCCAT TCTCTTAATT ATTGTCGCTA TTTTTGTCAC ACGCGCCTTA CTTAACAATG 3660 

ATAGCGATAA AGTTAGTAAT GACCCTAAAG TCTCTCAAAA TTATAAAAAA CAAGTTGAAA 3720 

40 ATCAAGACGG CCAAATTAAC CAGCAAGTAG ATAATGCTAA AGAAAATATT AAAAACAACC 3780 

AAAAAACTGA TGACATTATT AAAAATTTAC AAAATCAAAT CGACAACTTG AAGCAGCAAG 3 840 

AACAAAACAA AGCTGATTCT AAGCTAACTC AATTTTATCA AGACCAAATC AACAAATTGA 3 900 

45 CAGAGG CAAA TAATG CACTT AAAAACAATG CAAGCCAAGG TAAAATTGAA AGCATGTTAA 3960 

ATGATATTAA TACAAAATTC GACAGTATTA AATCTAAATT AGAAAGCTTA TTTAAAGATG 4020 

ACAATGGTGG CGCTAATTAA TTATTACACC TGCTTTGATG ATAAACATTA ATTCCCTATA 4060 
SO 

CTTTATCTGT ATCACTACGT TATT CGTGAT GATGCATTAA GAGTATAGGG ATTTTTTATA 414 0 

TAAACTTGTA TTCTAACTAC ATACAAATAC ACACAAAACG TATATAATTT ATATAATTAT 4200 
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TTATTGCTAA TTACGTTAGG CGTCATGACC GCTTTTGGCC CACTAACTAT AGATATGTAC 4320 

GTACCATCAT TACCTAAAGT GCAAGGTGAT TTTGGTTCTA CTACATCAGA AATTCAATTA 43 80 

ACATTATCAT TCACAATGAT TGGTCTTGCA CTAGGCCAAT TTATCTTTGG ACCTTTATCC 444 0 

GATGCTTTTG GTCGCAAACG GATTG CTGTA TCCATTTTGA TCATTTTCAT TTTGGTATCA 4500 

GGTTTGTCTA TGTTTGTTGA TCAATTGCCA TTATTCTTAA CTTTACGATT TATTCAAGGT 4560 

TTAACTGGTG GTGGCGTCAT CGTGATTGCA AAAGCCTCTG CTGGTGATAA ATTTAGTGGC 462 0 

AACGCACTCG CTAAATTTTT AGCATCTTTA ATGGTAGTTA ATGGCATCAT CACTATTCTT 46 80 

GCACCATTAG CCGGTGGATT AGCTTTATCC GTAGCAACAT GGCGTTCTAT TTTCACAATT 474 0 

TTAACTATTG TGGCACTCAT CATTTTAATT GGCGTCGCTT CTCAATTACC TAAAACATCT 4800 

AAAGATGAAT TAAAGCAGGT GAATTTTAGT AGCGTCATTA AAGATTTTGG AAGTCTTTTG 4860 

AAAAAACCAG CATTTATTAT TCCAATGCTA TTACAAGGwT TAACTTATGT AATGCTATTT 4 92 0 

AGTTATT CAT CTGCATCGCC ATTTATTACT CAAAAATTGT ATAATATGAC ACCCCAACAA 4 980 

TTTAGTATCA TGTTTGCTGT TAACGGTGTA GGTTTAATCA TTGTCAGTCA AGTCGTTGCT 504 0 

25 TTATTAGTAG AAAAATTACA TCGCCACATA TTATTAATCA TTTTAACTAT TATACAAGTG 5100 

GTAGGTGTTG CTTTAATTAT CCTGACACTT ACATTCCATT TACCACTTTG GGTCTTACTC 5160 

ATCGCATTCT TCTTAAATGT GTGTCCTGTG ACGTCAATTG GACCGCTTGG TTTCACAATG 522 0 

GCTATGGAAG AACGAACAGG TGGCAGTGGT AACGCATCAA GTTTACTTGG CTTATTCCAA 52 8 0 

TTTATCTTAG GTGGCGCTGT TGCACCATTA GTTGGCTTAA AAGGCGAATT TAATACATCA 534 0 

CCATATATGA TTATTATCTT CATTACAGCC ATTCTATTAG TCAGTCTACA AATCATTTAC 54 00 

TTTAAAATGA TTAAAAAGCA ACATGTCGCA TAACACTTCA ACATAATTAG AAC CCTAGCA 54 60 

AAGA^AT'CTA* TCrtTGTCAG ^TTCITCTT" TATGAATTAT^ GAGATCGAAT CTTCAACTAA 552 0 

AATTACGCCT TCATAGCAAG GACATTT CTA TTCAATCACC CTTTAACAGG CATCCAAATT 5580 

TcTGTAATAT ATTTTTCACT TGTAGTATCA CCAT 5614 
(2) INFORMATION FOR SEQ ID NO: 100: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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AAAGACAATG 


ATATGAAGTA 


TATGfiATATP 


ALAGASAAAG 


ToLUiATGTC 


GGAATCTGAA 


120 




GTTAACCAAT 


TGCTAAAAGG 




I lAbAAAATC 




TTTTCTAGAA 


180 


5 


GCTCAAGAAA 


AATATGAGGT 


TAATGTPATT 


T 21 TPTTYITT TA 




AGTAGAAACA 


240 




GGTAACGGCA 


AATCAGAATT 


AGCAAAAGGP 


ATTH Zi IflTV 


A TV TV 7\ TV 7\ ^V* 


CTATTACAAC 


300 


10 


ttttttggta 


TAGGAGCATT 


CGATAGTAGT 


GCTGTTCGTA 


GTGGGAAAAG 


TTATG CTG AA 


360 


AAGGAACAAT 


GGACATCACC 


AG AT AAPp, PP. 


ATTATTGGTG 


GTGCAAAGTT 


CATTCGTAAT 


420 




GAATATTTTG 


AAAACAATCA 


ACTGAATTTA 


TATCAAATGC 


GATGGAATCC 


AGAAAATCCT 


480 


15 


GCGCAACATC 


AATATGCGAG 


TG ACATTPG P 


TGGGCAGATA AAATTGCCAA 


ATTAATGGAT 


540 




AAATCCTATA 


AGCAGTTTGG 


TATAAAttAA A 


GATGATATTA 


GAP A A AP AT A 


TTATAAATAA 


600 




GACATCGGTG 


CTTAAAGGAG 


rrnnAArnT 

V— x vjvj/v\\»-r\/\ x 


TTATTGTTTP 

* X X X VJ XXX W 


P 2v p fTPPTTT 


AGCGCATTCT 


660 


20 


GAGTGTGTTA 


GTTAAATGGA 


xxx innLLln 


ACAAAAAAPG 


PTATATAHrA 


TCAAATATGC 


720 




TATATCCCAC 


ATCATTGTTA 


P A A A TT2T 21 P 2i 


TGATGTAAAT 


fiA ATATTP.PT 

>JAn X /A X X VjV X 


GTCTAAATGT 


780 




GCATGTAATA 


TACAATGGTG 


PAPJATAATIVP 


ACTTAAGTCC 


TTAAAAATOA 


AACGTTAgTT 


840 


25 


CCAAGAGTCA 


TTTTTAAA PA 




TGATAAAATA 


GAAAAG A ATP, 

vJ**J**^V**VJ^TJ^ X VJ 


AAAAATATAG 


900 




AGGTGACAAT 


ATGAAGATAG 


PAATTATAiV 


TGCAGGCATC 


GGTGGATTAA 


CAGCTGCTGC 


960 


30 


ATTATTACAA 


GAACAAGGTC 


ATAPTATTi 21 
X n V. X X Inn 


AGTCTTTGAA 


AAAAATP,AP,T 


CAGTTAAAGA 


1020 


AATTGGCGCT 


GGGATTGGTA 


TPf^r; a n at a a 


TGTG CTTAAA 


AAACTAGGTA 

****** x *vnj\j x *r%, 


ATCATGACTT 


1080 




AGCTAAAGGT 


ATTAAAAATG 


PT(^(Tf3P A AfiT 


CTT AT CTACA 


ATGACAGTGT 


TAGATGACAA 


1140 


35 


AGATCGCCTG 


TTAACTACTG 


TTAAATTA A A 

X X J~VVWA X X Mrt/i 


AAGTAATACA 


TTGAATGTGA 


x*^^^*h jhj*. 

CGTTACCACG 


1200 




CCAAACATTA 


ATTGACATTA 


TTAAATfTTA 

x xn-rvrt x X Xx** 


TGTAAAAGAT 


GACGCAATAT 


TTACAAATCA 


1260 




TGAAGTCACG 


CATATAGATA 


ATGAGAPAfSA 


TAAAGTTACC 


ATACATTTCG 


CGGAACAAGA 


1320 


40 


AAGTGAAGCA 


TTTGATTTAT 


GTATTGGTGP 

w X -F^ X X ^J r \J X vv 


TGATGGAATT 


CATTCTAAAG 


TGAGACAATC 


1380 




TGTAAATGCT 


GACAGTAAAG 


TATTATATPA 


AGGGTATACA 


TGCTTTAGAG 


GTTTAATTGA 


1440 




TGATATTGAT 


TTAAAGCATC 




AAAAGAATAC 


TGGGGaAGAA 


AAGG aAG AGT 


1500 


45 


AGGTATTGTT 


CCGTTATTAA 


ATAATPA Anr 


ATATTGGTT C 


ATTACAATTA 


ACTCGAAGGA 


1560 




TV TV X f* * TV rprt » m 


AAATATAGTT 


CGTTTGGTAA 


ACCTCATTTG 


CAAG CAT ACT 


TTAATCACTA 


1620 




TCCAAATGAA 


GTTAGAGAGA 


TCTTAGACAA 


ACAAAGTGAA 


ACAGGTATCT 


TATTG CAT AA 


1680 


50 


TATTTATGAT 


TTGAAACCAC 


TCAAATCTTT 


TGTTTATGGT 


CGTACTATTT 


TACTAGGAGA 


1740 




TGCAGCACAT 


GCGACAACGC 


CTAATATGGG 


GCAAGGTGCT 


GGACAAGCAA 


TGGAAGATGC 


1800 



55 



3NSDOCU>. <£P 0786519A2_I_> 



594 



10 



15 



EP 0 786 519 A2 

TAAAATACGT GTCAAACATA CTGCAAAAGT AATTAAGCGT TCTAGAAAAA TCGGTAAAAT 1920 

TGCCCAATAT CGTAGTCGTT TATTTGTTGC AGTTAGAAAT CGTATTATGA AAATGATGCC 1980 

AAATGCATTA GCAGCTGGAC AAACTAAATT CTTATATAAA TCGAAAGAAA AATAATACAA 204 0 

CAATATGAAA ACCCCCGTAT GTTGAAACGA GAGCTCAACA TATGGGGGTT CTTGTTTTTA 2100 

TAATGTTATT ATAATAAATT CAATTATTAG TTAACGACAA ATTGTGGTTT CTCACCTTGA 2160 

ACGGCACTAA TTG CAGCATT AGCAACAATT TTAGACATCA TGTCACGTGC TTCAAATGTA 2 220 

GCATTACCAA TATGCGGTGT TAATACTACA TTATTAAGTG ATTTTAAGT C ATCGGTAATA 2280 

TCTGGTTCAA ATTCATATAC ATCAAGTGCA GCACCTTCAA TTTCATT AT C TTTCAATG CT 234 0 

TGCACTAGTG CTTGTTCGTG CACGATTGGA CCACGAGAGG CATTGATTAA ATACGCCGTA 24 00 

GATTTCATCA TTTTAAATTG TTCTGTATCA ATTAAATGAT GCATTTTAGG ATTATAAGCA 24 60 

20 GCGTTGATAG TGATAAAATC TGCATTCTTT AATAGTGTAT CTAAAT CT AC ATATTTTGCA 2520 

CCGATTTCTC G1TCTTTTTC TTCTTTG CGA TTAGGTCCAG TGTATAGCAC ATCCATGTCA 2 580 

AATGCTCTTG CACGACGAGC TACTGCACTA CCAATTTCAC CTAAACCGAT AATGCCGATT 2 64 0 

25 GTTTTCCCAG ATACTTCTCT ACCTCTGAAA AATAAAGGTG CCCATCCATC AAATCCAGTT ' 2700 

GTACGTGATA ATTGGTCCCC TTCAACAATA CGACGCGCTA CTGCAAGTAC TAATCCAATT 2760 

GTTAAATCAG CAGTCGCGTT TGTTGATGCT TTAGGTGTGT TTGTAACATC TATACTTTTT 2820 

TCTCGGGCAT ACTCGATATC AATATTATTA AAACCAGCGC CATAGTTGGC AATGATTTTT 2 8 80 

AAGTCTTTAC CAGCATCGAT AACATCTTTA TCAACGTTTG TAGATAATAA ACTAATTAAG 294 0 

GCAGTCGCGT TTTTAACACC TTTAATTAAA GTGTCTTTAT CGACTAATCC TTTACCTTCA 3 00 0 

TACATTTCAA CTTCAAAATG TTCTTGTAAA AGTTTTAAAC CTACTTCTGG TATtGCACCA 3060 

gCAACATAAm CTTTTtCCAT AAAAGACCAC TCCTTTTATC TTAGTATAGT AGAAGATTAG 312 0 

40 ACAGTATACA ACTATGT CAT GATGTCTTGT GTATCAATGA TGTAAGCGCG TACTTTTGAT 3180 

GGAGGCGATA TAACTTAGGC ACTGTAGAAC TATGAATATT GTAATGTGGA AAAACTGGAT 324 0 

CAATTAAATT AGATAACGTA GTTTTAAAGT TAATAGTATT AGAAAAAATT AATATTTTGA 3300 

45 ATATGGGAGG AAATATAAAT AAGTAGGTGG CAACGAAAAA TAGCAAAAAA AGAGCTTCTC 3360 

CTATAAAGGA AAGCTCAAAG TTTTTTGATG ACATATGTAC TAGAATTAAG TTTCAAGACA 3420 

ATATGTATCA TCGTGTTTAT ATTAAATATG GATGTAGTTG TAGTTACCTG CTTCACTTGC 34 80 

SO 

AGAAATAGTT CTAGAACTTA CTGAGAAAGG TCCGCCACTA TAATTCATTT CTGAAATTGT 354 0 

AACTGAACCA TCACTGTTTA CACTTTCTAC AT ATG CAACG TGACCAAATG GTCCTTCAGA 3600 
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AGCAGCAGCC CAATTATTAG CATTTCCCCA AGTAGAACCG ATTTCTCCGC CAACTTTATC 3 720 

ATATACATAC CAAGTACATT GTCCTGCAGT GTATAAGTTA CCAGAATGTG AAATTGATGA 3780 

TGTAGTTGTC GTAGTTGTCG TAGTCGTTGT AGTTTGAGTC GTGTTGTAGT TATAGTTGTT 384 0 

GTAATTTGTA TAATTTTCAG CAGCATCTGC ATGATGTGCT TGACCTACTA ATGCTGTGCC 3900 

GATTCCTGCT GTTAACGTAG TTGCTGTTAC TAATTTTTTC ATGAATAAAG TCCTCCAAAG 396 0 

TTCTATATCT TTTTTTATAA ATAAAACGTA GCGACTGTTT TATTCTCACA TCTCGAATTG 4 02 0 

ATGACAATAG TTACTTTAAC AAAATtAATG CTTCTTGTGG GGAATGTTAT TGATTTGTAA 4 08 0 

AAGAATAAAA AAACTTTGAC TAATTTTGTA ATAAAAATTA GTCAAAGTTA CAATGAGATT 414 0 

AACAGATAAT TAATAGGAAA TATTTATTTG TAATATGTTT AAATAAATCG AATTGTTAAA 4200 

GGTATTATAT ATTCTTGGCC ATTATAATAT TTGACACACG CAATAATTGT GAATACAAAA 426 0 

20 GATAATATTG AGAAAGCGAA TATGGATAAA ATACCGATAA ACGTAATGAT GAAACCTATA 432 0 

ATAATAATGA AATCAATATC TGTAGCAATT AGGAAAACGC CTATTAAAGT GATAACGACT 43 8 0 

AAAACGATAG ACCAAATAAT ATAAGAAATC GTATAGTTAA GATAATTTTT TCCAGCACGA 44 4 0 

TCAACTAGTT TCGATTCATC TTTTTTCAAT AACCATATTA TCAGTGGACC AATAATAGAT 4500 

GTGAATAAAC TTAATAAATA GATAAGCATC GCCATAATGT TCTCATCATT GGATTTGCGA 4 560 

TTCGGTTGAT GATTTGTTAC GTCGTTCATT TCAGTTGTCA TATTAGACAC TCCTTTGAAA 4 620 

ATTGTAATAT TATCTTTAAC TATAACAAAA TATAATCAAA AATAAACATG TTTATTAAAC 4 6 80 

AATTATTAAA AATAAAAATA ATTGGTGGAC GTCGGCGTTT AAATAGGTTA ATTTAAGGTT 474 0 

ATATATACTT AACATTTATA ATGATGCGTA ATGAATTCGC ATCATTTTTA TATTGTCTTA 4 800 

CGTATAATTT GTTTTTAATT TTAACCAAAG ATAGAAAGAG GGTTGTTTAT GAAAATAGCA 4860 

ATTGTAGGAT CAGGAAATGG CGCAGTTACG GCAGCAGTAG ATATGGTGAG CAAAGGCCAC 4920 

40 GATGTTAAAT TATATTGTCG TAATCAATCT ATAAGTAAGT TTCAAAACGC AATCGAAAAG 4980 

GGCGGATTTG ATTTTAATAA TGAAGGTGAT GAACGTTTCG TAAAATTCAC TGATATTAGT 504 0 

GATGATATGG AATATGTTTT AAAAGATGCT GAAATTGTTC AAGTGATTAT TCCATCTTCA 5X00 

TACATAGAGT ATTATGCTGA TGTAATGGCA GAGCATGTAA CTGATAATCA GTTGATATTC 5160 

TTCAACATGG CTGCAGCAAT GGGGTCAATT CGTTTTATGA ATG TTTT AG A AGATAGACAT 5220 

ATTGAAACAA AACCACAACT AGCGGAAgcT AATACGTTGA CGTATGGTAC GCGTGTCGAT 5280 

TTTGAAAATG CAGCAGTTGA TTTATCTCTA AATGTACGTC GTATCTTCTT TTCAACATAT 5340 

GATAGAAGCT GTCTAAATGA TTGTTATGAC AAAGTTTCAA GTATTTATGA TCATTTAGTA 5400 
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CCAACATTAT TGAATGTCGG TCGCATTGAT TATGCTGGCG AGTTCGCTTT ATATAAAGAA 552 0 

GGAATTACTA AACATACAGT TAG ATT A CTT CATGCAATCG AATTAGAACG TTTGAATTTA 5580 

5 GGCCGTAGAT TAGGTTTTGA ATTATCAACA GCTAAAGAAT CACGTATTGA ACGTGGTTAT 564 0 

TTAGAACGTG ATAAAGAAGA TGAACCATTA AATCGTTTGT TTAATACAAG CCCAGTATTT 5700 

TCACAAATTC CAGGACCAAA TCATGTAGAA AGCAGATATT TAACTGAAGA TATTGCATAT 576 0 

10 

GGTTTAGTAC TATGGTCAAG CTTAGGTCGT GTTATTGATG TACCGACACC AAATATAGAT 58 2 0 

GCAGTAATTG TAATTGCATC AACCATTTTA GAGAGAGACT TCTTTGAGGA AGGCTTAACA 588 0 

GTTGAAGAAA TTGGTTTAGA TAAGCTTGAT TTAGAAAAAT ATTTAAAATA AATGATGGCT 594 0 

15 

TGAAGATAGA AAAGGATATA GCATTATGCA AAAGCAATAA ATTGAAGAAA AGAGGTTTCT 6000 

CATCAATAAG CGnAGGGGAC GATAGATGAT GAAAAGAAAA CCCACCTTTT TAGAATCAAT 606 0 

20 TTCGACAATG ATTGTAATGG TTATTGTTGT TGTAACAGGC TTTGTGTTTT TTGATATTCC 6120 

AATTCAAGTA TTATTAATTA TTGCCTCAGC ATATGCCACA TGGATTGCAA AACGTGTAGG 6180 

CTTAACATGG CAAGATTTAG AAAAAGGCAT TGCAGAACGT TTAAATACTG CAATGCCTGC 624 0 

25 AATTTTAATT ATACTAGCGG TAGGAATTAT AGTAGGCAGT TGGATGTTTT CTGGCACAGT 63 00 

GCCAGCCTTG ATTTATTATG GCTTAGATTT ATTGAATCCA AG CT ATTTTT TAATATCAGC 63 60 

CTTTTTTATA AGTGCTGTTA CATCTGTAGC AACTGGTACA GCATGGGGCT CTGCATCAAC 64 20 

30 TGCAGGGATT GCACTTATTT CTATTGGTAA TCAATTGGGG ATTCCTCCAG GGATGGCAGC 64 80 

GGGTG CT ATT AT AG CAGGGG CTGTGTTTGG CGATAAAATG TCACCATTAT CAGATACAAC 6 54 0 

TAATTTAGCG GCG CTTGTTA CTAAAGTTAA TATATTTAAA CATATACATT CGATGATGTG 6600 

35 

GACGACGATA CCTGCATCAA TCATAGGTTT ATTAGTATGG TTTATTGCTG GATTTCAATT 6660 

TAAAGGGCAT TCAAATGATA AACAGATTCA AACTTTGTTA TCAGAG CTTG CACAGATTTA 6720 

TCAAATTAAC ATATGGGTCT GGGTTCCCTT AATTGTGATC ATTGTTTGTT TGCTATTTAA 6780 

AO 

AATGG CT ACA GTGCCAGCTA TGCTAATATC . AAGCTTTTCT GCCATTATAG TGGGGACTTT 6840 

TAATCATCAT TTCAAAATGA CAGATGGTTT CAAAGCAACA TTTAGTGGTT TTAACGAATC 6900 

45 AATGATACAT CAGTCTCATA TTTCATCCAG TGTGAAAAGC TTGTTAGAAC AGGGTGGTAT 6960 

GATGAGTATG ACCCAAATAT TAGTAACGAT ATTTTGCGGA TATGCATTTG CAGGTATTGT 7020 

AGAAAAAGCA GGATGTTTAG AAGTCTTATT AACTACTATT TCTAAAGGCA TCCATTCTGT 7080 

50 AGGAAGTTTA ATATGTATTA CTGTTATTTG TTGTATTGCG CTTGTATTCG CTGCAGGTGT 7140 

TGCTTCGATT GTAATTATTA TGGTCGGTGT GTTAATGAAA GATTTGTTCG AAAAATACCA 7200 
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AATACCATGG 


GGAACATCAG 


/"IT* A" f w i - 1 T A f """1*21 


xACGAATCAA 


CTTCATGTCT 


CTGTTGAAGA 


7320 




ATTTTTCATA TGGACAGTAC 


PATPTTATTT 
v>^V X VJl 1 r\ XXX 


ft. r TV , /"V , ^ft ft^*T» 


ATAGCAATTA 


TCTATGGTTT 


7380 


5 


TACAGGGATA 


GGTATTAAAA 


AGTPATPPA A 


X X v_>\V_\j 11 1A 


ft ^"«*T"i» ft ft tiviy^ 

ACTTAATGTG 


AGCGTGGAAT 


7440 




ATATATAATA 


TGTTGAAACA 


CTTTAATCAT 

V» X X A /W X V— rt X 


X inlii/ii, A\»j X 


AvjCLtOx xAxA 


ft rryinrv ft ft ft ft 

ATTTGAAAAG 


7500 


10 


GTTTTAACTT 


AGAATAAATA 


TCCTCTATYir 

* V-v 1 ^» X f\ X VJi— 




TATGTTTTGT 


AGCGGAACAT 


7560 


GTTGATATAT 


GTAATGTAAG 


TTTT A Tf^Tr A 
A X X X r\ X O X 


Tf"^ fi. T - "l "1 V' ,r P ft yv 
IvjAI J. 1 vj 1 AA 


rj-»/-i » pni A ft ft 'T**T* 

I UACTAAATT 


AATTGAGAAT 


7620 




TTGAAGGCAA 


GTATATTTGT 


AAHTArTTTIV 

«VlV3 X A\V_ XXX >\ 


& ( " T ' ft ft. ft ft ft T**'l * 

1 AAAAA1 1 


I ATCAATGTA 


TAGCCGATTT 


7680 


IS 


GACATGCCTA 


AATTTGGGTG 


TG TP A A TPP P 


lv3 lAlb I 1 1 


TTATTCTTTA 


TTACAGAGTG 


7740 




AATCGGATTG 


GTGAAAATCG 


X X X X Vj/iVJ 


nilll 1 A\-VwA 


ATT CGATTTT 


TTTCATAGAA 


7800 




ATTAAAAAAG 


CCAACAAGGC 


Tf^TTr* A A & 


1 X Vj I xVjOCGT 


AAA CAT AG CC 


AT CACTAATT 


7860 


20 


AGTGAATGAA 


GTTATAACCA 


wv„/VjV», X X 


1 AbL 1 vjACiA I 


rGTACGTGAA 


GTTACAACAC 


7920 




CTGGGCCATA 


ACCATAGTTC 


a TTTPm zv n & 

■rt X 1 1 V_ 1 Kxr\MJ\ 


1 AC- 1 CsA 


AC CATTG CTG 


TTAACACTTT 


7980 




CAACGTATGC 


AACGTGACCG 


TATPPAPPTT 
X A X vj\ — nk»L X x 


\-sAvj 1 Ibll I vj 


CATAATTGCA 


ccag ci^rixs 


8040 


25 


GTGTATTGTT 


CACTGTGTAA 


ft. f " I 'f™**! T'f"* 
v- v — M.vjV_ i L 1 ib 


t-ACjCTG CG I I 


AG CCCAGTTA 


CTTG CATTG C 


8100 




CCCAAGTTGA 


ACCGATTTTA 




A TV"* ft ft ft T» IV /~» 

VjA 1 LAAA I AC 


GTAGTATGTA 


CATTGACCAG 


8160 


30 


AAGTGTATAA 


GTTACGTCCT 


P A 2iPTATA RP 


LHL I 1 IjAvjA I 


TGAACGGC CA 


TTTGATGATG 


8220 


GAGCCATAGT 


TGTAGTTACT 


TP A APATTVT* 


ioLl xvjAAvjI 


G CTGT AG CTT 


GCACCTAAAC 


8280 




CACCAGTACG 


GTAGCTGTTT 


PTPTTPT A Af 
VJ X Vj X X W x /vrt\». 


* I ■ ft ' 1 M 1 ' ft. T 1 ft /™*'T V "P 
XAX XAXAoi i 


ATTGTAGTTA 


TATGATTGAT 


8340 


35 


TATTATTTGA 


GTAGTTGTTG 


T A A PPP PTY^T 
x nnv.ouv. 1 vj. X 


Avj x 1 A 1 1 Vj I A 


/**/*"T'ft T'ft ft f~*/~*(~"* 

G CT AT AAC CG 


TTGTTGTAAT 


8400 




TGTTATAGTT 


ATTGTAACCA 


T"TPT APT A rtT 
x x vj x rvvj x rv«? X 


An 1 AvjC 1 Vj 1 A 


Qj I AGCCATTA 


TCTTGGTTTA 


8460 




ATTGACTTGG 


ATGCCAGTTA 


fPTTTP P ft TV" 1 


1 Vj 1 AM IvjAj 1 A 


GTTACCTTGT 


GCATCAATAG 


8520 


40 


TGTAAGTATA 


GCTATATGAT 


GTTPPPTPPT 

W X X VJXTVJ X Va-VJ X 


1 x Vj\j/\1 XA1A 


ACCIjTAGTTA 


TCTTGCTCAG 


8580 




AAGCATGAGC 


TTGATTTCCT 


P ATPPA ATTP 


V-VjAX ITjIAuv. 


/"* ft ft T*/^/ *I /^» TV 

vjAAi LCTGCA 


GTTGCGATAG 


8640 




TAGCTGTAGC 


\Jf* XXX IV. 1 1 L 


ATTTTAAAAA 


1A1LL1LL IA 


AAAATTTTAA 


ATCTAAAATA 


8700 


45 


TTTTCGTAAT 


GTCCGTGTGA 


PA A A ATT A 2VT 


ft Tft ft 

bl lAi AAVj I X 


ATCTCTCGTA 


ATTAAACGAC 


8760 




AAGAAAGACT 


ATAACAGAAA 


TTAGCGTCCT 


TGTGTGCTTT 


GTTAACGTTT 


TGTAATTTTT 


8820 


SO 


TGCTAATATC 


TTGACACAAT 


AGAATTTTAA 


AAGTATAGAA 


ATTTGCATTT 


TGCAAAACTT 


88B0 


ATAACTACGG 


CATTCTTTGT 


GAAAACTGAA 


TGTTTCGAAA 


ATAAGTCTGT 


TACAAATTTG 


8940 




TAATATTACT 


GAAAATTCTA 


AATGTATATT 


TTGTGCATAA 


TATAGGACTT 


TTAATCAGAA 


9000 
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GGATGAAAAT GTATATTTAA TGGATAAAAT ATCCTAATTT AGCATAAAAA AATGTTTTAA 
TAAAAGTATT ATTTGATATA ATCGATTTAT GTTTTGTTAC TGCTAAAAAA CATGTGGCG 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1868 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CCTTCAGCCA TTTGACTTCG ACATGAGTTG CCTGTACATA TAAAATAAAT TGTTTTTTTA 
GTCATAACAA TCTCCTAATT AATTAAAATA TGATAAGTGT TAGATACAAC CCTATGAGGG 
TTATAAATAG TACTGGAATT GTAATGATGA TACCAGTTTT AAAGTATGTG CCCCAAGAAA 
TCTTAACATC TTTTTG t GTT AAGACGTGTA AC CACAGTAA TGTAGCTAAA GAGCCTATCG 
GTGTAATTTT TGG AC CT AAA TCAGAACCGA TAACATTCGC ATAAATTAGG CCTTCTTTTA 
ACATG CCATG GACATTTGAT TGACCAATAG CAATCGCATC TATTAAAACT GTAGGCATAT 
TATTCATTAT TGATGATAAA AACGCTGAAA TGAAGCC CAT TCCCAAAATA GTGCTAAATA 
GACCGTAATT GGAAATATAT TCTAATATTT TAGCCAATAT TAAAGTAATG CCAGCATTTC 
TTAAGCCGAA TACGACGATA TACAT AC CAA TTGAAAATAA TACTATATTC CAAGGTGCGC 
CCTTAATGAC TTGCTTAATA TTT ACAG CAT TTGATTTACG AGCCAACATT AGAAAAATAA 
AAGCAATGAT TCCAGTGAAA ATTGATACCG GAATTTTAGT AAATTTACTG ATT AGAT AG C 
CGAAAAGTAA TATAACTAGA ACAATCCaTG AAATTTTAAA TAGCTTTAAA TCATTAATGG 
CATCTTTAGG ATGCTTTATA TTATTATCAT CAAACGTTTT AGGTATCGCT TTTCTAAAAT 
ATAACCACAA TACTATAATA CTTGCTAAAA GCGAGAATAA ATTAGGTATA ATCATTCTAC 
TAAAATATCG AACGAATCCT ACATGAAAAT AATCAGCAGA TATAATATTC ACT AGATTG C 
TCACGATTAA AGGTAAAGAA GTTGTGTCAG CTATAAAACC ACTCGCAATA ATnAAAGGGA 
ATATGGCCCG CTTACTAAAA C CT AT ATTTT TAACCATCGC TAATACAATA GGCGTTAAGA 
TTAAcGTGCG CCATCATTTG CGAAAAATGC AGCAACAATG GCACCCAATA ATATGATATA 
AACGAACATT TTTAAACCAT TGCCTTTTGA AGCATGAAGC ATGTGAATAG CTGACCATTC 
GAATAATCCA ACTTTATCTA ATATTAATGA AATAAGAATG ACTGAGACAA AAGTCAAAGT 
AGCATTCCAA ACAATACCTG TTACTTCGAA AACATCGGAA AAACTTACAA CACCAGTAAT 
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TAATACAAAT 


AATAAAGTTA 


CTAGAAAAAT 


GAGTGTCGCT 


AAAGTTGTCA 


T CATT AG CAT 


1380 




TCACCAGTCT 


TAAGGTTATG 


ACAAATACAT 


CGTTGGTTAG 


AGGTATGAAC 


CTTAGACAAG 


1440 


5 


TTATTAATTA 


CGGACTCAAA 


AATATTATGA TTg AG CTGGT ATAAATGTTT ATTTCCGATT 


1500 




TTTCGTGTCG 


TAACTAAGTT 


GGTTTTTACT 


AATGCTTTCA 


TATGrTAGCT 


AAGTGTAGGT 


1560 


10 


TGAGAGAATT 


GAAAATGTGC 


TAACAAATCA 


CAAGCGCATA 


ACTCTCCACA 




1620 


TCTAGTATTT 


CTAATCTGCT 


TGAATCTGAT 


AAAACTTTTA 


AAAATGTTGC 


TAG TT CTTT A 


1680 




TACGTCATAA 


CATACCTCCT 


AGACGTTAAA 


TAGATTATCA 


TCTATATAGA 


TGAATGTCTA 


1740 


15 


TGTTCCTTTG 


GTATATTACA 


CGATATGACT 


ATGTAATTTA 


AATTTGGTTT 


TAGTATTAAA 


1800 




AGGGTATTAA 


AGATAAATTA 


TAGATATTGA 


TTTTGCAAAA 


TATACTCTTT 


GTTCTGCATT 


1860 




GAAAAAGG 












1868 


20 


(2) INFORMATION FOR SEQ ID NO: 102: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 15249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 102: 






ATTTATGAAA 


TCCATAG CnA 


TAAACATTAT 


TCTTGCATCG 


GCTATACAAA 


X X AVv \ Kj 


60 




AAGCAAATTT 


GTATATCAAC 


CTGGAATTGT 


GTTCACGTCA 


ATGGCaAATG 


CCGATGATGT 


120 


35 


GTTATCAGGC 


GATAGTTATT 


TTATGGCTGA 


ATTAAAATCT 


ATTAAGCGTA 


TTGTTGAAA T 

X X w X X \J/Wi J. 


180 




TCCAGATAAT 


CAAAAAATAT 


ACTGCTTTAT 


AGATGAAATT 


TTTAAAGGTA 


CCAACAPA A r* 


240 




TGAACGAATT 


GCCGCTTCAG 


AATCAGTACT 


ATCATTTTTA 


CATGAAAAAT 




300 


40 


AGTTATTGCA 


GCAACACATG 


ATATTGAGTT 


AGCTGAACTC 


TTAAAACAAC 


GTTATGAAAJV 


360 




TTACCATTTC 


AATGAGGTAA 


TAGAAAATAA 


TAACATACAT 


TTTGATTACA 


AAATTAAGCC 


420 




TGGCAAAGCA 


AATACACGTA 


ATGCCATCGA 


ATTATTAAAA 


AT CACTT CAT 


TTCCAGCAAA 


480 


45 


AATATATGAA 


CGAGCAAAAG 


ATAATGTCCC 


GAAAATTTAG 


CATTTAACTT 


TAAACATAAA 


540 




AACGTCAGCT 


ATCACATGAC 


AGAAGACTAT 


GAACAGTTTC 


AATAATGTTC 


ATAGTAATCA 


600 


SO 


TGTTAATAAC 


TGACGTTTAT 


TTTATTCTGC 


AGAATACTCT 


TCTAAATCTA 


TATTGCTGTG 


660 


CCCATTTAAT 


GCTAAATCAG 


CAAATCGACC 


TTGCTGATAC 


AAATAGTGGC 


CGGCAACGCC 


720 




TATCATTGCA 


GCATTATCTG 


TGCATAATTT 


AGGACTTGGG 


ATAGTTAATT 


GAATGTCATT 


780 
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AACAATTAAT CGCTGAACAC CATATTCTTT ACAAG CTTG A ATAGCTTTAA ACGTGAGCAC 900 

CTCTACAACA CTGTTTTGAA AGCTCGTTGC TACGTTAGCT TCAATGATTG GaATATTTTT 960 

TTGTCGTTGA TTGTGAAGTT GATTGATTAC GGCACTTTTC AACCCACTAA AACTAAAATC 102 0 

ATAACTATCT TTATCCAACC AAACACGAGG GAATGAATAA GTATCTTCAC CTTCAGCAGC 108 0 

CAACCGATCA ACTTGTGGAC CACCTGGATA ATTTAAACCA ATTGTTCGTG CCACTTTATC 114 0 

ATAAGCCTCA CCTACTGCGT CATCTCGTGT TTCACCAATG ACTTCAAATG ATAAATGATC 12 00 

CTTCATATAA ACTAATTCAG TATGTCCACC TGAAACAATA AGTGCAATTA GCGGGAATGT 126 0 

TAATGGCTCT TCTATGTGAT TAGCATATAT ATGTCCTGCA ATATGATGAA CAGGAATAAG 1320 

TGGCTnATCG TAAGCAAATG CCAATGCTTT GGCTGCATTA ACACCTATTA GTAACGCACC 13 80 

AATTAGTCCA GGGCCTTCTG TAACCGCTAT GGCATCAATA TCTTCTATTG ATACATCGGC 144 0 

20 ATCCCCTAGA GCCTCGTTTA TTGTTGCTGT TATACCTTCA ACGTGATGTC TACTTGCCAC 1500 

TTCGGGAACG ACACCGCCAA ATCGTTTATG ACTTTCAATC TGACTTAAAA CTGTATTTGA 1560 

TAAAATATCT CTGCCATTTT TTATAACACT AACGCTTGTT TCATCACAAC TTGTTTcAAC 162 0 

AGCTAGTATT AATATATCTT TAGTCATTTA AATTCACCCA CATAACCATT GCGTCCTCAC 1680 

CTTCACCATA ATAATTTTTA CGTTTACCAC CATATTGAAA TCCTAAATTT T CAT ATA CAT 174 0 

GTTGTGCCAC TTTATTATTA ACTCTTACTT CTAAACTCAT CACATCACAA GTGTGACTTG 1800 

CATAGTTTAT TCCGTATTTT AAAAGCATTT GACCTAAACC ATAGCCTCTA TAATTATCAT 186 0 

CGATTGCAAC TGTTGTAATT TG AG CTTG AT CGATAACAAT CCATAAACCT AAATAACCAA 1920 

TAATTTGTTG TTCAAATTCt AAGACAAAAT ATTTCGCAAA GTTATTTTGC TCTATTTCAT 1980 

GATAAAATGC GTCAATTGTC CAAGAACTGT CATTGAAACT CCGACGCTCA AGATCAAAGA 2040 

CTTQTGGCAC ATCTTCTTTA GTCATCTCTC TAATGTTTAA TTGTTCTTTT GACTGTTGAT 2100 

40 CCAATTTCGT TCCGCCTCAG CTAATTTATG GTATTTAGGA GTAAATGTAT GTACGTCTGA 2160 

AGGTTTATCT AGCAATTGAT ACATGACTGA TGCATTTGGT AGctGCGCAA TCACTTCACC 2220 

TTGTAATTCA TCTTGTAATT TTACAGTATC TTTCCCAATA TAAATAAATG GTTGGTTTAA 2280 

45 ATCTTCTAAA AAAGCTCGCA ATGCCTCTAT CGACATATAT TGATCTTCTA AAATAGTCAC 234 0 

TAATTGACCA TTTTGCCACT GGAATATGCC TGTATAAACT GCTTGTCGTC TTGCATCAAA 24 00 

CACAGGAACC AATAATTTAT CAGTATGATC GATTGTTGCT GCCAATGCCT TTAATGATGA 2460 

50 

AACACCATAT AATTTAACAT CTAACG CAT A CGCTAATGTT TTAGCAACAG TAACACCGAT 2 520 

ACGTAAGCCA GTATATGAAC CAGGACCTTC AGCAACAATA ATCGCATCTA ATTGCTGTTT 2580 
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TTGTTTAGAA 


TrrnT a niT i\ 

AV.^.\3lKul 1A 


1 i ICAGCTAA 


AACTTCATCG 


TTTTGCATCA ATGCTACTGA 


2700 




TAATGGTTGA 


TTPP. A TP.T h. T 


A A TV"*"* A /""*/""• a /*"» 

LM IVjAG CAG 


CGAATTCATG 


\Jf\ X 1 1 vtV. v- 


1 1 X MA x 1 1 


2760 


5 


GTTCATAATG 


TTP'PPPT^TYl r^ 1 


PPP A A P A A 


CAATTTGTCT 


x vj x f\ x xxx 




2820 




AAATGTTAAT 


AG AT AAA TP, P 


P.*PPPP r PP/" , « A iv 


GTAAATCTTT 


TAT AAATTn A 

X f \ X nnn X X Vjx\ 


PTPPATTP A a 


2880 


10 


TAACAGTAAT 


TCrPTTJATrT 

X VJV_ V»» X 1 L 1 


TTPP A A A A A T^T* 


CATCAAATCC 


TAAATPTTPA 


1 V—rVvj AA 1L1 1 


2940 


CTAAGCGATA 


APAATPPATA 


T*P A T 1 /"* /~" TV tv tiwp 


TTAAA 1 i'l'l'l 


APPPPTATAT 

"■V_ v,v,Ul >\ X >\ X 


wAl 1 IAAUjA 


3000 




TGTTAAATGT 


PGGGGAATTA 


A l l»<j 1 ACGTC 


TTACACCAAG 


"UV, XXX 1 LL 1 


TATB T\ A TTP /~T^ 


3060 


15 


TTAACGTTGT 


TTT A PPTfZ C^V 


t-t_ 1 AAATCTC 


CGTTAAGTAA 


AATPAAATPA 
•«-rvx Wvull 


v-vAL 1 l"i"rCA 


3120 




ATTGCTCAAC 


TAAAAATAT7A 


P P A A A '1 wi v*i » rr» 

Ov-AAATrTGAT 


TCATTT CATC 


TA A ATTATTT 


HTPTTTR TV* A 

A 1 u 1 1 1 ATCA 


3180 




ATGTTGATTP 


TPPT A T A TT A 


1 CaU ill l'CAT 


TCATAAAAAT 


fl&TTATrffiT 
VJAI 1A1V.LA1 


Ivjli t— AATCG 


3240 


20 


TAT CT AACTT 




(_ 1 1 1ATATTG 


TAACAAATTT 


C TV R^TTS RAT 
v-rtAL 1 i AAA 1 


1 I CTTATCTT 


3300 




TG AAA p AH A T 


liilLlAl 1 L_A 


AAGTTAATTG 


TAAGaaAATT 


*T7i 7A 7A A *"P A ' 1 " t*M' 
lAflrtMlAl 1 1 


bTTGACATAC 


3360 




TAAAP,PAP.AT 


A x 1 AAA 1 I 


AAATTTAT CA 


AATTTTTAGA 


P A A'PTP*T'2i BP 
V— ni\ 1 1 CI AAU 


I AT. TAAAGTG 


3420 


25 


AT AT AT A P P A 


x 1 CAI_Ij<jAA(j 


GAGTATAATA 


aaatgcttaa 


TCAATATACT 


GAACATCAAC 


3480 






AAA Tp A 'I **T* TV *T*^P 


ATTTTATTAT 


ACTCTTTAGG 


ACTCGAACGT 


TAgTAAATAT 


3540 


30 


TTACTAAAPn 


"1 M 1 •TV A P TPP 


TATTTCTGTT 


tgaatgggac 


TTGTAAACGT 


CCCAATAATA 


3600 


TTGGG A CGTT 


1111 iAILii 1 


TTATCTTTCA 


attacttatt 


TTT ATT A CT A 


TAAAACATGA 


3660 




TTAATCATTA 


AAATTTACGG 


P/"**/"* TV TV riHinn tv « 


TATGCGAa CG 


AgcATGATCA AAAAAGGAGA 


3720 


35 


TCACCAAGCA 


CCAGCAAGAA 


GTCTTTTACA 


TGCCACGGGC 


GCGCTAAAaA 


GT CCAACTG A 


3780 




TATGAACAAA 


CCATTTGTAG 


^ i • a n m i ■ i v^^« ■ < tx * 


CTCTTATATT 


GATATTGTTC 


CTGGACATGT 


3840 




TCACTTGAGA GAGCTTGCAG 


AT AT AG CT AA 


AGAAGCAATT 


AGAGAAGCCG 


GTGCCATTCC 


3900 


40 


ATTTGAATTC 


AATACAATTG 


GTGTTGATGA 


TGGAATAGCT 


ATGGGACATA 


TCGGAATGCG 


3960 




ATATTCTGTA 


CCATCACGTG 


AAATTATTGC 


AG ATGCAG CT 


GAAACTGTAA 


TTAACG CTCA 


4020 




TTGGTTTGAC 


GGCGTATTTT 


A ^^A ffMII^/ M||« tv 

AUAl 1C CTAA 


TTGTGACAAG 


ATTACACCCG 


GTATGATTTT 


4080 


45 


AGCAGCCATG 


AGGACAAACG 


TACCAGCTAT 


CTl^-l-GCTCT 


GGTGGACCAA 


TGAAAGCTGG 


4140 




CTTATCTGCA 


CATGGAAAAG 


CATTAACACT 


TTCATCAATG 


TTTGAAGCAG 


TCGGCGCATT 


4200 


50 


TAAAGAAGGA 


TCGATTTCTA 


AAGAAGAATT 


TTTAGATATG 


GAACAAAATG 


CCTGCCCTAC 


4260 


TTGTGGTTCA 


TGTGCTGGGA 


TGTTTACTGC 


AAATTCAATG 


AACTGTTTGA 


TGGAAGTTTT 


4320 




AGGTCTAGCA 


TTACCATACA 


ACGGTACTGC 


ACTTGCAGTC 


AGTGATCAGC 


GACGAGAAAT 


4380 
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TATCGTTACT CGCGAAgCAA TTGATGATGC ATTTGCACTT GATATGGCTA TGGGTGGTTC 4 500 

AACAAACACG GTACTGCATA CGTTAGC CAT TGCCAATGAA GCTGGTATTG ATTATGACTT 4560 

AGAGCGCATT AATG CTATTG CCAAACGCAC GCCATATTTA TCAAAAATAG CACCTAGTTC 4620 

ATCGTATTCA ATGCATGATG TGCATGAAGC TGGTGGCGTC CCAGCAATTA TTAATGAATT 4 680 

GATGAAGAAA GATGGCACGT TACACCCAGA TAGAATCACA GTTACTGGCA AAACGTTACG 4 74 0 

TGAAAATAAC GAAGGCAAAG AAATTAAGAA CTTTGATGTC ATTCACCCTC TTGATGCACC . 4 800 

ATATGATGCA CAAGGCGGTT TATCTATCTT ATTTGGTAAT ATCGCCCCTA AAGGCG CAGT 4 860 

TATTAAAGTT GGCGGCGTTG ATCCATCTAT CAAAACATTT ACTGGGAAAG CAATTTGTTT 4 92 0 

CAATTCGCAT GATGAAGCTG TTGAAGCAAT AGACAATCGT ACCGTTCGTG CAGGCCACGT 4 980 

CGTTGTCATT AGATATGAAG GACCTAAAGG TGG AC CAGGT ATG CCTG AAA TGTTAGCACC 504 0 

20 TACTTCCTCT ATTGTTGGTC GCGGCTTAGG TAAAGATGTT GCATTAATTA CTGATGGGCG 5100 

TTTTTCCGGT GCCACAAGAG GTATTGCAGT TGGTCATATT TCCCCTGAAG CTG CAT CTGG 5160 

TGGACCAATT GCCTTAATTG AAGATGGTGA TGAGATTACT ATTGATTTAA CAAATCGTAC 522 0 

25 ATTAAACGTA AACCAGCCTG AAGATGTTCT AGCGCGTCGC CGAGAATCTT TAACACCATT 5280 

TAAAGCGAAA GTAAAAACAG GTTATCTAGC TCGTTATACT GCCCTAGTAA CTAGCG CAAA 534 0 

TACAGGTGGC GTCATGCAAG TCCCTGAGAA TTTAATTTAA TTTATTTTTA TATTGGAGAT 54 0 0 

GGTTAAAATG TCTAAAACTC AACATGAAGT AAACCAAAAT ATTGAC CCTT TAAAAATGGC 54 6 0 

TGAATCACTT G AAC CTG AAC AACTAAATGA AAAAACTTTA AATGATATGC GTTCAGGATC 5520 

AGAAGTGCTA GTAGAAGCTC TACTTAAAGA AAATGTGGAT TATTTATTCG GTTATCCTGG 5580 

TGGTGCCGTA CT AC CTTT AT ATGACACGTT TTATGATGGT AAAATCAAAC ATATTTTAGC 5640 

AAGACACGAA CAAGGTGCTG TTCATGCTGC AGAAGGTTAT GCACGTGTAT CTGGTAAamT 5700 

GGCGTCGTTG TAGTTACAAG CGGTCCaGGT G CAACT AATG TAATGACAGG TATTACGGAT 5760 

GCACATTGCG ACTCTTTACC TCTAGTTGTA TTCACTGGAC AAGTTGCTAC ACCAGGCATT 5820 

GGTAAAGATG CATTCCAAGA AG CGG AT ATT CT AT CT ATG A CTTCACCAAT TACAAAACAA 5880 

45 AATTATCAAG TGAAACGTGT TGAAGATATC CCTAAAATCG TACACGAAGC TTT CCATGTA 5 94 0 

GCTAATTCTG GACGCAAAGG TCCTGTAGTG ATTGATTTTC CAAAAGATAT GGGTGTTTTA 6 000 

GCTACAAATG TGGATTTATG CGACGAAATC AATATTCCAG GTTATGAAGT TGTTACAGAA 6060 

SO 

CCAGAAAATA AAGACATTGA CACTTTCATC TCACTTTTAA AAGAAGCGAA AAAGCCTGTC 6120 

GTATTAGCCG GCG CAGGT AT TAATCAATCA AAATCAAATC AATTATTAAC ACAGTTTGTT 6180 
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GATACACTAT TTTTAGGTAT GGGAGG AATG CATGGTTCTT ATGCTAGTAA CATGGCATTA 63 00 

ACTGAGTGTG ATTTACTCAT TAATTTAGGT AGCCGCTTCG ATGATAGATT AGCAAGCAAA 63 60 

CCTGATGCCT TTGCACCTAA CGCCAAAATT GTACATGTAG ATATTGATCC TTCAGAAATC 64 20 

AATAAAGTTA TTCATGTAGA TTTAGGTATT ATTGCAGACT GTAAAAGATT TTTAGAATGT 64 80 

TTAAATGATA AAAATGTTGA GACTATAGAA CACAGTGACT GGGTTAAACA TTGTCAAAAT 6 54 0 

AATAAGCAGA AACACCCATT TAAACTTGGT GAAGAAGATC AAGTATTTTG TAAGCCACAA 66 00 

CAAACAATCG AATATATCGG CAAAATTACA AATGGTGAAG CAATTGTTAC TACAGACGTG 66 60 

GGACAACATC AAATGTGGGC AGCTCAATTT TATCCATTTA AAAATCACGG ACAATGGGTT 6720 

ACAAGCGGTG GTTTAGGAAC AATGGGATTC GGTATTCCTT CGTCAATTGG TGCCAAATTA 6 780 

GCTAATCCTG ATAAAACAGT CGTATGTTTC GTCGGTGACG GTGGTTTCCA AATGACAAAC 6 84 0 

20 CAAGAAATGG CACTTTTACC CGAATATGGT TTAGATGTCA AAATCGTACT AATCAATAAT 6900 

GGAACATTAG GTATGGTTAA ACAATGGCAA GATAAGTTCT TTAATCAACG CTTCTCACAC 6 960 

TCAGTATTTA ATGGTCAACC TGATTTTATG AAAATGGCAG AAGCATATGG CGTCAAAGGT 7020 

TTCTTAATCG ATAAGCCAGA ACAACTGGAA GAACAATTAG ATGCAGCGTT TGCTTATCAA 7 080 

GGAC CAGCTT TAATTGAGGT TCGTATTTCC CCTACTGAAG CTGTAACCCC AATGGTTCCG 714 0 

AGTGG CAAAT CAAATCATGA AATGGAGGGC TTATAATGAC AAGAATTCTT AAATTACAAG 7200 

TTGCGGATCA AGTCAGCACG CTAAATCGAA TTACAAGTGC TTTTGTTCGC CTACAATATA 7260 

ATATCGATAC ATTACATGTt ACACATTCTG AACAACCTGG GATTTCTAAC ATGGAAATTC 7320 

AAGTCGATAT TCAAGATGAT ACATCACTTC ATATATTAAT TAAAAAATTA AAACAACAAA 73 80 

TTAATGTTTT AACGGTTGAA TGCTACGACC TTGTTGATAA CGAAGCTTAA TTTTAAGACA 744 0 

AAGGfiAATGA TGCGCTAATT AGTTATAGAT ATATCATAGG CTGCTAGTTA ACATCTGCCA 7500 

40 CTATTACAAA GTTATATTTC AGAATTTTCG AAACACAAAA TATTTAATTA TTTGGAGGAA 7560 

TTTATTATGA CAACAGTTTA TTATGATCAA GATGTAAAAA CGGACGCTTT ACAAGGCAAA 7620 

AAAATTGCAG TAGTAGGTTA TGGATCACAA GGTCACGCGC ATGCACAAAA CTTAAAAGAC 7680 

45 AATGGATATG ATGTAGTCAT CGGCATTCGC CCAGGTCGTT CTTTTGACAA AGCTAAAGAA 774 0 

GATGGATTTG ATGTGTTCCC TGTTGCAGAA GCAGTTAAGC AAGCTGATGT AATTATGGTG 7800 

CTATTACCTG ATGAAATTCA AGGTGATGTA TACAAAAACG AAATTGAACC AAATTTAGAA 7860 

so 

AAACATAATG CGCTTGCATT TGCTCATGGC TTTAACATTC ATTTTGGTGT TATTCAACCA 7920 

CCAGCTGATG TTGATGTATT TTTAGTAGCT CCTAAAGGAC CGGGTCATTT AGTTAGACGT 7980 

55 



30 



35 



604 

•NSOOCtO: <EP 070651 9A2_I_> 



10 



15 



EP0 786 519 A2 

CAAGCACGTA ATATTGCTTT AAGTTATGCA AAAGGTATTG GTGCAaCTCG TGCAGGTGTT 8100 

ATTGAAACAA CATTTAAAGA AGAAACTGAG ACAGATTTAT TTGGTGAACA AGCAGTACTT 8160 

TGCGGTGGTG TATCGAAATT AATTCAAAGT GGCTTTGAAA CATTAGTAGA AGCGGGTTAT 8220 

CAACCAGAAT TAGCTTATTT TGAAGTATTA CATGAAATGA AATTAATCGT TGATTTGATG 8280 

TATGAAGGCG GTATGGAAAA TGTACGTTAC TCAATTTCAA ATACTGCTGA ATTTGGTGAC 834 0 

TATGTTTCAG GACCACGTGT TATCACACCA GATGTTAAAG AAAATATGAA AGCTGTATTA 84 00 

ACTGATATCC AAAATGGTAA CTTCAGTAAT CGCTTTATCG AAGACAATAA AAATGGATTC 84 6 0 

AAAGAATTTT ATAAATTACG CGAAGAACAA CATGGTCATC AAATTGAAAA AGTTGGTCGT 8520 

GAATTACGCG AAATGATGCC TTTTATTAAA TCTAAAAGCA TTGAAAAATA AGATAGACCT 8580 

ACAATGAGGA GTTGTTAAAT ATGAGTAGTC ATATTCAAAT TTTTGATACG ACACTAAGAG 864 0 

20 ACGGTGaACA AACACCAGGA GTGAATTTTA CTTTTGATGA ACGCTTGCGT ATTGCATTGC 8700 

AATTAGAAAA ATGGGGTGTA GATGTTATTG AAGCTGGATT TCCTGCTTCA AGTACAGGTA 876 0 

GCTTTAAATC TGTTCAAGCA ATTGCACAAA CATTAACAAC AACGGCTGTA TGTGGTTTAG 8820 

25 CTAGATGTAA AAAAT CTG AC ATCGATGCTG TATATGAAGC AACAAAAGAT GCAGCGAAgC 8880 

CGGTcGTGCA TGTTTTTATA GCAACATCAC CTATTCATCT TGAACATAAA CTTAAAATGT 894 0 

CTCAAGAAGA CGTTTTAGCA TCTATTAAAG AACATGTCAC ATACGCGAAA CAATTATTTG 9000 

ACGTTGTTCA ATTTTCACCT GAAGATGCAA CGCGTACTGA ATT AC CATTC TTAGTGAAAT 9060 

GTGTACAAAC TGCCGTTGAC GCTGGAGCTA CAGTTATTAA TATTCCTGAT ACAGTCGGCT 912 0 

ACAGTTACCA TGATGAATAT GCACATATTT TCAAAACCTT AACAGAATCT GTAACATCTT 9180 

CAAATGAAAT TATTTATAGT GCTCATTGCC ATGACGATTT AGGAATGGCT GTTTCAAATA 924 0 
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GTTTAGCTGC AATTGAAGGC GGTGCGAGAC GAATTGAAGG CACTGTAAAT GGTATTGGTG 93 00 

AACGAGCAGG TAATGCAGCA CTTGAAGAAG TCGCGCTTGC ACTATACGTT CGAAATGATC 9360 

ATTATGGTGC TCAAACTGCT CTTAATCTCG AAGAAACTAA AAAAACATCG GATTTAATTT 9420 

CAAGATATGC AGGTATT CGA GTGCCTAGAA AT AAAG CAAT TGTTGGCCAA AATGCATTTA 94 80 

45 GTCATGAATC AGGTATT CAC CAAGATGGCG TATTAAAACA TCGTGAAACA TATGAAATTA 954 0 

TGACACCTCA ACTTGTTGGT GTAAGCACGA CTGAACTTCC ATTAGGAAAA TTATCTGGTA 9600 

AACACGCCTT CTCAGAGAAG TTAAAAGCAT TAGGTTATGA CATTGATAAA GAAGCGCAAA 966 0 

so 

TAGATTTATT TAAACAATTC AAGGCCATTG CGGACAAAAA GAAATCTGTT TCAGATAGAG 9720 

ATATTCATGC GATTATTCAA GGTTCTGAGC ATGAGCATCA AGCACTTTAT AAATTGGAAA 9780 
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AAGAGGGTCA 


X r\ XXX /Av»V~AvJ 


GATTCAAGTA 


TTGGTACTGG 


TTCAATCGTA 


GCAATTTACA 


9900 




ATG CAGTTG A 


x X rt. X x x x 


^* A Tt Tt Tt m Tt T4 

wVGAAAGAAA 


CAGAATTAAT 


TGATTATCGT 


ATTAATTCTG 


9960 


5 


TCACTGAAGG 




/"^ K »/^p m « Tt ft f-m 

UWVvjCAGAAG 


TACATGTAAA 


TTTATTGATT 


GAAGGTAAGA 


10020 




CTGTCAATGG 


^ X X X \7V? X4"V X X 


O K Tf* TV TP TV m -m 

uA I v_A TG AT A 


TTTTACAAGC 


CTCTTGTAAA 


G CAT ACGTAG 


10080 


10 


AAGCACATGC 


TAAATTTGPA 


rz/Tv a a a a »ty"» 


*1 Klip Tt P" Tt Tt rrt 

TTG AG AAGG T 


AGGTAATTAA 


TTATGACTTA 


10140 


TAACATTGTT 


GCCCTACCT("5 


Vj i\jA IVjVjAA I 


CGGTCCAGAA 


ATTTTGAACG 


GATCTCTATC 


10200 




ATTGCTTGAA 


ATTATAAfiTL 
« X X r\ X x\r\K3 X -f\ 


A 1 AAA I Al AA 


CTTTAATTAT 


CAAAT AG AG C 


ACCACGAATT 


10260 


IS 


TGGTGGTGCC 


TfTATTnATA 
A V— x/"V x luftin 


Uil 1LGGCUA 


GCCTTTAACT 


GAGAAAACCT 


TAAATGCGTG 


10320 




TAAAAGAG CA 


("2 2xTY^ fT 21 *T"T*T"» 


IACTGGGTGC 


AATCGGTGGA 


CCTAAATGGA 


CAGATCCTAA 


10380 




CAATCGACCA 


OA acta Rn/^ 7i t 

\3AA\— xviuuri X 


T* J\ 'I"!* it n It n fun 

XATTAAAATT 


GCGTAAATCC 


TTAAATTTAT 


TTGTAAATAT 


10440 


20 


ACGCCCCACT 


Av»\^\j 1 1VJ1 k_A 


J\ 7* /~»p* f~*r~* /""I* 7\ p^ 

AAviGLGCTAG 


TTCTTTATCA 


CCTTTAAAGG 


AAGAACGCGT 


10500 




TG AAGG CA PA 


O TV TTT 71 pmq 1 » 
\J A ill >\\J 1 1A 


TAGTCCGTGA 


ATTGACAAGT 


GGTATTTATT 


TTGGAGAACC 


10560 




T Ad APATTTT 




TV p» pi p«r<rn Tt vr tv 

AGGCCTTAGA 


TTCTCTTACT 


TATACAAGAG 


AAGAAATAGA 


10620 


25 


A CC5 CA TTfZTT 




TTAAATTGGC 


CGCTTCAAGA 


CGAGGAAAAC 


TAACATCAGT 


10680 




Td A T A A A f2 A L. 


A A TV"P Tt ' 1 " I ' T\ /-t 

AA 1 V» 1A1 1 Ats 


CTTCTAGTAA 


ATTGTGGCGC 


AAAGTCGTAA 


ATGAAGTAAG 


10740 


30 


T CAA TTAT AT 


V-v-AVjAAvj 1 AA 


/■** A /*"• T TV JV Tt rr\ »— 1 7* 
\-Avjr I AAATCA 


CTTATTTGTT 


GATGCTTGTA 


GTATGCATTT 


10800 


AATCACAAAT 


CP* 21 & L. Zi C 2S. A T* 


I 1GACGTCAT 


CGTATGTGAA 


AACTTATTTG 


GCGATATTTT 


10860 




AAGTGATGAA 


GCTTCAGTGA 


I CTGGTTC 


A CTTGGTTT A 


TCACCTTCTG 


CTAGTTTTAG 


10920 


35 


TAACGATGGT 


CCAAGATTGT 


A I VjAGCCTAT 


TCATGGATCA 


GCACCAGATA 


TTG CAGGTAA 


10980 




AAACGTTGCC 


AATCCATTTG 


vjAATGATTCT 


ATCTTTAGCG 


ATGTGTTTAC 


GTGAAAGCTT 


11040 




AAATCAACCA GATGCTGCAG 


A 1 (j AATTAG A 


Tt it Tv it mit iiMit 

A CAA CAT ATT 


TATAGCATGA 


TTGAACATGG 


11100 


40 


GCAAACGACA 


G CAGATTT AG 


/^/~V***/^ /^TV. TA TN 'T" 1 ' 

bu^CAAATT 


GAATACTACT 


GATATTTTCG 


AAATTCTATC 


11160 




TCAAAAATTG 


AATCACTAAG 


uUObAGATGT 


AAATGGGTCA 


AACATTATTT 


GACAAGGTGT 


11220 




GGAACAGACA 


TGTGTTATAC 


f~*f~*f' It Tt It rrwryio 


GCGAACCGCA 


ACTATTATAC 


ATTGATTTAC 


11280 


45 


ACCTTATACA 


TGAAGTTACT 


Tf'^l^P/*^Tl/^ It Tt 

TCTCCTCAAG 


CATTTGAAGG 


ACTTAGGCTT 


CAAAACAGAA 


11340 




AATTAAGACG 


CCCAGATTTA 


ACATTTGCAA 


CACTCGATCA 


CAATGTTCCT 


ACTATTGATA 


11400 


SO 


TATTCAATAT 


TAAAGATGAA 


ATTGCAAACA 


AACAAATCAC 


AACATTACAA 


AAAAACGCCA 


11460 


TAGATTTTGG 


GGTGCATATT 


TTTGATATGG 


GTTCTGATGA ACAAGGTATT 


GTTCACATGG 


11520 




TAGGACCTGA 


GACAGGACTT 


ACACAGCCTG 


GCAAGACAAT 


CGTTTGTGGT 


GACTCTCACA 


11580 
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ATGTTTTCGC AACTCAAACG CTATGGCAAA CAAAACCCAA AAACTTAAAA ATCGATATTA 11700 

ATGGTACCTT ACCAACAGGC GTCTATGCTA AGGACATTAT TCTGCATTTA ATTAAAACGT 11760 

ATGGTGTTGA CTTTGGTACA GGCTATGCTT TGGAATTTAC TGGCGAAACA ATTAAAAACC 11820 

TTTCAATGGA TGGTCGAATG ACTATTTGTA ACATGGCTAT CGAAGGTGGT GCCAAATACG 11880 

GCATAATCCA ACCTGATGAT ATAACATTTG AATATGTTAA AGGGAGACCA TTTGCCGATA 11940 

ACT t CGCTAA ATCAGTTGAT AAGTGGCGTG AgCTATATTC TGATGACGAC GCGATATTTG 1200 0 

ATCGTGTAAT TGAACTTGAT GTTTCAACAT TAGAACCACA AGTGACATGG GGAACTAATC 12060 

CTGAAATGGG TGTTAATTTC AGTGAACCAT TCCCTGAAAT CAATGATATC AACGATCAAC 12120 

GTGCGTATGA TTATATGGGG TTAGAACCAG GTCAAAAAGC TGAAGACATC GACTTAGGGT 12160 

ATGTTTTTCT CGGTTCATGT ACAAATGCTA GACTATCAGA TTTGATTGAA GCTAGTCATA 1224 0 

TTGTTAAAGG AAATAAAGTT CATCCAAATA TTACAGCTAT TGTCGTACCA GGTTCTCGTA 12300 

CAGTAAAAAA AGAAGCAGAA AAATTAGGTC TAGATACTAT CTTTAAAAAT GCAGGATTTG 12360 

AATGGCGTGA ACCAGGATGT TCAATGTGTT TAGGCATGAA TCCTGACCAA GTACCTGAGG 12420 

GCGTACATTG TGCATCTACA AGTAATCGAA ACTTTGAAGG ACGACAAGGC AAAGGTGCAA 124 80 

GAACACATTT AGTATCCCcT GCTATGGCAG CAGCAGCAGC TATTCATGGT AAATTTGTGG 1254 0 

ACGTAAGAAA GGTGGTTGTT TAAATGGCAG CAATCAAACC TATTACAACA TATAAAGGTA 12 6 00 

AAATAGTCCC TCTCTTCAAC GACAATATCG ATACAGACCA AATCATTCCT AAGGTACACT 12660 

TAAAGCGTAT TTCAAAAAGT GGCTTTGGTC CATTTGCTTT TGATGAATGG CGGTACTTAC 12720 

CTGATGGTTC AGATAATCCT GATTTCAATC CTAACAAACC ACAATATAAA GGGGCTTCTA 12780 

TTTTAATTAC TGGAGATAAT TTTGGATGTG GTTCAAGTCG TGAACATGCT GCTTGGGCTC 12840 
TTAAGGACTA TGGTTTT CAT ATT ATT ATTG CA<3GAAGT^~CAGTGACATA TTTTATATGA " ~ 1290 0 

ATTGCACTAA AAATGCGATG TTGCCTATCG TTTTAGAAAA AAGTGCCCGT GAACATCTTG 12960 

CACAATATGT TGAAATTGAG GTCGATTTAC CAAATCAAAC TGTGTCATCA CCAGACAAGC 13020 

GTTTCCATTT TGAAATTGAT GAAACTTGGA AGAATAAACT TGTAAATGGC TTAGATGACA 13 080 

TTGCAATCAC C CT ACAAT AT GAATCATTAA TAGAAAAATA TGAAAAATCa CTTTAAGGGA 13140 

GTTGAATATT ATGACAGTCA AAACAACAGT TTCTACGAAA GATATCGATG AGGCATTTTT 13200 

AAGACTTAAA GATATTGTCA AAGAAACACC TTTACAATTA GACCATTACT TATCTCAAAA 13260 

GTATGATTGT AAAGT CTATT TAAAACGAGA AGATTTACAA TGGGTACGTT CTTTTAAATT 13320 

AAGAGGTGCT TACAACGCTA TTTCTGTTTT ATCAGATGAA GCTAAAAGTA AAGGTATTAC 13380 
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AAACGCTGTT 


ATCTTTATGC 


CAGTCACTAC 


ACCTTTACAA 


AAGGTAAATC 


AAGTAAAGTT 


13 500 




CTTTGGAAAT 


AGTAACGTTG 


AAGTTGTACT 


CACTGGTGAT 


ACATTTGATC 


ACTGTTT AH P 


1 ~) C f r\ 

X3 560 


5 


TGAAGCTTTA 


ACTTATACAA GTGAACATCA 


AATGAACTTT 


ATAGATCCAT 


TCAATAATGT 






TCATACAATT 


TCTGGACAAG 


GTACGCTTGC 


TAAAGAAATG 


CTAGAACAAG 


CAAAGTCTC5A 


13 6 80 


10 


CAATGTTAAC 


TTTGATTATC 


TATTTGCCGC 


AATTGGTGGT 


GGCGGTTTAA 


TTTCAGGTAT 

* X X WiVJW x *» x 


13/40 


TAGTACTTAC 


TTTAAAACCT 


ATTCAC CTAC 


CACGAAAATT 


ATAGGTGTTG 




13 8 0 0 




TGCAAGTAGT 


ATGTATGAAT 


CTGTTGTGGT 


AAATAATCAG 


GTAGTCACAT 


TGCCTAATAT 


13 ooO 


15 


CGATAAATTT 


GTGGACGGTG 


CATCTGTAGC 


TAGAGTTGGC 


GATATTACAT 


TTGAAAT'TYSr' 


13 920 




AAAAGAAAAT 


GTAGATGATT 


ACGTTCAAGT 


AGATGAAGGT 


GCAGTTTGTT 


N — x *»««-v7x\ x X x X 


T "1 Q O A 

13 980 




AGATATGTAT 


TCAAAACAAG 


CAATTGTAGC 


AGAACCTGCT 


GGCGCATTAA 


GTfJTA AHTnr 

vj x vj x ^VfAv? X \j v^. 


"1 /♦ /"\ VI ft 

1404 0 


20 


GCTTGAAAAC 


TATAAAGATC 


ATATTAAAGG 


TAAAACAGTG 


GTTTGTGTCA 


X X AO X VJVj X VJVJ 


1410 0 




TAATAATGAT 


ATTAATCGAA 


TGAAAGAAAT 


TGAAGAACGT 


TCATTACTAT 

* x x nv x x 


apha acia a &t 


14160 




GAAGCATTAC 


TTTATCTTAA 


ATTTCCCTCA 


ACGTCCAGGT 


GCATTGAGAG 


7\ TV 'I'M-l'f^T* J TV J\ 


14220 


25 


TGACGTATTA 


GGACCTCAAG 


ACGATATTAC 


TAAATTTGAA 


TACTTAAAAA 


*v\x v, x x v_ x \_rt. 


1428 0 




AAATACAGGT 


ACTGTCATTA 


TTGGTATTCA 


ACTTAAAGAT 


CATGATGATT 


T* A A T 21 P A A PT 

X /Art X ns^t^tW^. X 


1434 0 




CAAACAACGT 


GTAAAt CATT 


TCGATCCTTC 


CAATATTTAT 


ATTAATGAAA 


« X X vj X 1 


144 0 0 


30 


ATATTCATTG 


TTAATTTAAC 


ACATAGTAAG 


AAAAACAGTC 


ATAAATTGAT 


X A \_ X /A/A X X Uln 


1 A A C f\ 

±*± 4 b U 




AATCATCTTA 


TGACTGCTTT 


TTATTATACT 


TTACATTTCT 


CGTTTCGTCA 


G ATTP A A A PP 

vJ/A X x \^/A*A/A\— \J 


14 U 


35 


TTTTCACTTC 


GCCAAGCCAT 


CTTTCTTTGT 


GTTTGCTTTT 


a TTTTGACGT 


TTTAGAPATA 


1 A EL Q f\ 
X4 DO U 


AAAAAaGAGA 


CCTTGCGGTC 


TCAATGCGGC 


TCATCG CATC 


CACTTTTTGC 


CTGG CAA PP^T 


i ^ <r >i ri 
14 64 U 




TCTACTCTAG 


CGGAACGTAA 


GTTCGa CTAC 


CATCGACGCT 


AAGGAGCTTA 


ACTTf 'PTY^TnT 

**.Vw- X X Vw. X X VJ X 


t xi "7 n n 


40 


TCGGCATGGG 


AACAGGTGTG 


ACCTCCTTGC 


TATAGTCACC 


AGACATATGA 


ATGTAATTTA 


x «» / o u 




TACATTCAAA 


ACTAGATAGT 


AAGTAAAAGT . 


GATTTTGCTT 


CGCAAAACAT 


TTATTTTP*AT 






TAAGTCTTCG 


ATCGATTAGT 


ATTCGTCAGC 


TCCACATGTC 


ACCATGCTTC 


CACCTCGAAP 


i 4 n n n 

X. *S O O L/ 


45 


CTATTAACCT 


CATCATCTTT 


GAGGGATCTT 


ATAACCGAAG 


TTGGGAAATC 


TPATPTTP.AP. 


X,*« 574 VJ 




GGGGGCTTCA 


TGCTTAGATG 


CTTTCAGCAC 


x 1ATCCCGTC 


CACACAT AG C 


TACCCAGCTA 


15000 




TGCCGTTGGC 


ACGACAACTG 


GTACACCAGA 


GGTATGTCCA 


TCCCGGTCCT 


CTCGTACTAA 


15060 


SO 


GGACAGCTCC 


TCTCAAATTT 


CCTACGCCCA 


CGACGGATAG 


GGACCGAACT 


GTCTCACGAC 


15120 




GTTCTGAACC 


CAGCTCGCGT 


ACCGCTTTaA 


TGGGCGAACA 


GCCCAACCCT 


TGGGAC CG AC 


15180 
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(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GTGGCAATAT TTCTAGTTCT CGTTTTGATA AGATTTTAAA AGGATCTGTT GTGTTTGCAG 60 

TGTCCTGATT TGAATTAGAT ACAAATTCAT TCACTAAAGA TGTTGTAAGT TTCATATCTA 120 

CATATGTTTC ACCTTTATAT ACAGTTCGAA TAGCTAACAA TAATTGTTCA TCAGGTG CAT 180 

TTTTCAATAT GTAACCTTTC GCACCATTAC GCAACACATG GAACAAATAC TCCTCATCAT 24 0 

CAAACATTGT TAATATTAGT ATTTTAGTTT CAGGAAAACT GTCAGCAATT TTACTCGTAG 300 

CGATAAGACC TGACTCACCT GGTGG CAT AC TTAAATCCAT TAGTAACACA TCAGGTTTAt 3 60 

25 ATTC.CATTAC TTTTTGGTAA GCTTCGACGC CATCTGCAGC CGTTGCAACA ACTTCCATAT 420 

CATTTTGATA ATTTAAAATC ATAGAGAACC CCGTACGGAC AACAGCGTGA TCATCGGCAA 4 80 

TGACTATTTT CAATTTTATT CCCCCAATGT ATGTTTCAAA TTGGAATGTT CAATGTAACA 54 0 

TTGGTACCCT CACCAATTTT CGTTTCAATA TTGACGCTAC CGCTGACTAA CTCAGCTCGC 6 00 

TCATTCATTC CAT AT AAAC C GAGTCCAGAA CCTTTAGGCT TAGAACTTGG ATCAAAACCA 6 60 

TTTCCCGCAT CTATCACTTC TGCTACCAAA TGGCGCCCAG TTTGACGGAT ACCTACATTT 720 

ATTTCATTTA CATCAGCGTA TTTCAACGCA TTTAAAATAG CTTCTTGCAC TACTCGATAA 780 

ACAACCGTTT CAATATCACT^ ATCAAAGCGA GTATTTTTAA TATTTGATGT AT ATATGATT^ 840 " 

TTTATTCCAT AATTTTCTTC AAACTGTTTA AAATATGATT TAAAAGCTGC TTCAAGGCCT 900 

AGATCATCCA AAGAAGCGGG TCTTAATTCA ACCGACATAT TACGTATATC ATCAATTAAT 960 

TTAGCGACAA TATATTCAAT ATTTTCTGCG TCTTCCAAAA GCTTAGTTGT ATCTTCTTGA 1020 

45 TATTTTAATA AT CTCAATTG AACATCTACA TTGAGCATTT CTTGAATCAC ACTATCATGT 1080 

AACTCTCTAG AAATTCGCTT TCTTTCATTT TCTTGGGCTG AGATTGTTTT ACGCAT CAT A 1140 

CGTTGTTGAT GCAATTTCTC TTGCTGTTCA ATTTGTGATG AAACATTTTG AAGCGTAAAT 1200 

50 GCATGAATTC CCCTGTCTTG ATCAATCAAC TGATATGTTG CTGTAAATGG CATCACTTTT 1260 

TGATCTTTCG TCTTCATAAA TACTTGGAAA TTCGTAGCTT GTACTTGCAT CGATTCTAAG 13 20 
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ATCGCATTCG CCACAGCACT GTAATTATCT TCTTCAGATA AT AT AT CTTT AG CAG CAT CA 1440 

TTCATTGCAA TAATTTTACC GTTATCATCA GCAAAAACTA TCTTTTCGAT TGAATGCTCA 15 00 

TAATATTTTT TCAATAAAGT ATCTAACTGT ATACTGTCCT CATTAATCAT GACTTACACC 1560 

CTAATTCATC TCATTATTTA T CAT CATTG A AAATACCAAA CTTACGTTGA ATATCATCAT 1620 

TATCAAATAT TTTTGGTAAA GGACGACCAT CTCTTTGACC AAATAATAGT ACGCCATACA 16 80 

CTTGATTCTT ATACCAAAGC GGCACTGCTA AAACTGCTGT TAATGATTCG CTCAATAAAA 174 0 

TTGGATAGTC AATCTTTTCT TCAGGCCCTA AAGCTAAACC AACATTGGCT ATT AC CAT AC 18 00 

GCTTTCCTGT TTTCATAACA GTTCCAGCTA ATCCACGACC TTTTCTTAAA ATAATCAATT 186 0 

TAAATCGATT ATTTTTATTA CCTGAAACAT AGTGCCATTT TATTGGAGAT GATGGTTTGT 1920 

TAGATTCATA GAAAG CGATT GCCGCAAAAT CATAACCCTC TTCTTTGCGT ATTTT AT CT A 1980 

ATGTCTCTTG AAATCTACGA TCTTCAATTA TTGCTTCTGG TGTCAAATCC TTTCACCTCT 204 0 

TATGCTTACA CTTTATTCTT ACGGTAAATA ATATATCTGC GATTTATATA TGTCAAAGGT 2100 

ACACTCCAAA CATG CACCAA ACGTGTAAAT GGCCAACAAG CCATAATAGT GAAACCTAAC 216 0 

25 AATATATGCA TTTTAAATGC AATCGGCACA CCACTCATCA ATGACGCATC TGGTTTTAAC 2220 

ATAAATAATT GTCTAAACCA AATTGATAAT GAAGTTCTGT AGTTAAAGTC TGGATGTTGT 22 80 

ATATTTGTTA CTAATGTTGC GTAACATCCC ATAAATACGA TAAGTAATAA TAAGAAATTT 2 34 0 

ACAAATATAT CCGACGCTGA ACTTAATCTT CGAATACTTT TCGTAGTAAC ACGTCTCGCT 24 00 

GTTAATAAAA ACATCCCTAT CAAAGTTATT ATACCAAAGA TGCTACCAAT ATAAACAGCG 24 6 0 

CCTATATGAT ATAAATGCTC AGACACACCC ACTGCATCCA TCCATGGTTT CGGTATTAAC 2 52 0 

AATCCAACTA CGTGTCCAAA AAACACTGGA ATAATACCTA AGTGAAATAA TAAACTTCCC 2580 

CACATCAACC TTTTTCTTTC TATTAATTCA CTAGATTTAG CTGTCCAAGA AAATTTATCA 2640 

TAACGATAAC GTGCAATATG ACCTGCGACA AAGACAACTA AACATAAATA CGGAAATATA 2700 

ACCCATAAAA ACTGATTAAG CATGATGTTT CACTCCTTTT GGTGATGTCA AACATAATTT 2760 

CAATGTTTTT CTAAGTGCTT GAATCACATA GG CAT ATGGA TTGTTATCTT CACCAAGTGC 2820 

45 ATTCGCCATC ACATATGTTC CATCCTCAAT AATCATAATG ATTAATTGAA TATTCTCTTC 28 80 

AGCTCTTGGA TCATTTCGCC ATTCTGCCAC TTGCAAAAAT TGAAG CATCA ACGGTAGATA 2940 

ATCAGAAAGT TCATTATCTA CCATTTCTAG TCCAAACATT T CAT AT AAT A CCTTTAATTT 3000 

50 AG CT AACATT TGCCCACGTT CTTTTTGCGT ATCAAATTTG TTATACGTCA TATATAATGG 3060 

TGCTTTTTTC GTAAAATCAA ATGTATCTGT ATAAATCGCT TTGATTTCTG ATAATGAAAA 3120 
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TGTTTCTTCA AAAGTTTTTG GATGAAAAGT TAATTTTTCT GGAAAACATA ACTGTTGTGC 324 0 

CATATATCCA AAACTTTCTT GATATTTTTT AAAATTATOG AAATTAATCA CGGAAAATCC 3 300 

CTCCATAGAA ATTCTCATTA TAAATTTCTT GACCAGTTTT CCCTGAACCT ACTGCAACGC 3 360 

CACAGCCTTC ACAGTTATCT CCAAAATGCT CGCCGCCGTA ATTGTATCCT GTACTAC CTT 3420 

GTGCGTGATA CGTATCTAAA TAGGTTTCTT TGTGTGATGT TGGAATAACA AATCG AT CTT 34 80 

CATATTTGGC TAGTCCTAAT AAACGATACA TGTCTTTAGT TTGGCGCTCG GTTATACCTA 3 54 0 

ATCGCTCTAA TCG AG A CGTG TCAAATGGCT GTTGAGTAAC TTGAGATCTC ATATAACTTC 3 6 00 

TCATCATTGC CATACGTTGT AGGGCTCCTT TTACTGGCTC TGTATCTCCT GCAGTGAAAA 3 660 

TATTAGCTAA GTATTCAATA GGTAAACGCA TTTCTTCAAT GGCTGGGAAA ATCGCATCTG 372 0 

GATTTTGAGT TGTATTTTTA CCTTCAAAAT AGCTCATAAT TGGGCTAAGT GGTGGGCAAT 3780 

20 ACCAAACCAT CGG CATCGTT CTAAATTCAG GATGTAACGG AAATGCAAGT TT AT ATT CAA 3 84 0 

TTGCTAACTT ATAAATTGGA GAGTTTTGTG CAGCTTCAAT CCAATCGTAA CCAATACCAT 3 900 

CTTTTTCAGC TTGAGCAATG ACTTCTTCGT CAAATGGGTT TAAGAATATA TCTAATTGTT 3 96 0 

TTTCATATAA ATCTTTCTCG TCTACTGCTG AAGCTGCTTC ATGAACTCGA TCTGCATCAT 4 020 

ATAATAAAAC ACCTAAGTAA CGCATACGTC CTGTACAAGT TTCAGAGCAT ACCGTAGGCA 4 08 0 

TACCCGCCTC GATT CTCGGG AAACAGAAAG TACACTTTTC AGCTTTGTTC GTTTTC CAAT 414 0 

TGAAGTAAAC TTTCTTATAT GGACAACCTG TCATACAGTA ACGCCATCCA CGACATGCGT 4 2 00 

CTTGGTCAAC TAATACAATG CCATCTTCAT CACGTTTATA CAT AG CAC CT GAAGGACACG 4 260 

ATGCAACGCA ACTTGG ATT C AAG CAATGTT CACATAAACG TGGTAAATAC ATCATAAAAG 4 32 0 

TTTCGTCAAA TTGGAATTTA ATATCTTCTT CTATTTTTTG GATGTTAGGA TCTTTTGGAC 4 380 

"CTCT^CATG ACCACCTGCT AAGTCATCTT CC^GTTAGG TCCCCATTCA ATTTCAATGT 444 0 

40 TATCCCCCGT AATTTCTGAA TACGCTCTAG CAACTGGCGA ATGCTTCCCT GATTTCGCAG 4 500 

TTGTTAAATG TTCATAATTA TAGTTCCATG GCTCATAATA ATCTTTAATT AATGGCATAT 4 560 

CTGGGTTATA AAAAATTTTA CCTAAAG CAA TTTTTGAAAT TCTACTTCCA GATTTTAATT 4 620 

45 CAAGTTTCCC TTTACGATTT AGTACCCAAC CACCTTTGTA GTGTTCTTGG TCTTCCCAAC 4 680 

GTTTCGGATA CCCTACACCT GGCt TCGTTT CTACGTTGTT GAACCACATG T ACTCAG CAC 4 740 

CTGGACGATT TGTCCaAGTG TTTTTACATG TCACACTACA CGT ATGG CAT CCTATGCATT 4 800 

TATCTAAATT TAATACCATC GCAAcTTGCG CTTTAATCTT CAAGCCAATT AACCTCCTTC 4 86 0 

ATCTTTCTAA CTGCTACATA TAAATC C CTT TGGTTCCCAA TTGGTCCATA ATAATTAAAG 4 920 
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GGCGCGTTGT 


GTGAACCACC 


ACGTGTATCT 


GTAATTTCTG 


ACCCAGG CGT 


TTGAATATGT 


5040 




TTATCTTGTG 


CATGATACAT 


AAACATTGTA 


CCTTTAGGCA 


TACGATGCGA 


AATAACTGCT 


5100 


5 


CTTGCCGTTA 


CAACACCATT 


ACGGTTATAC 


ACTTCTAGCC 


AATCATTATC 


TTGGATATCG 


5160 




TGTTTTTCAG 


CATCTTCATT 


TGATATCCAA 


ACCGTTGGAC 


CACCTCTAAA 


TAGTGTCAAC 


5220 


10 


ATATGCTTAT 


TATCTTGATA 


CATTGAGTGT 


ATATTCCATT 


TTCCATGAGG 


CGTTAAATAA 


5280 


CGCAgTACCA AAGCATCTGT 


ACCACCTTTA 


ATTTTCTTAT 


CTCTATTCCC 


AAATACCATT 


5340 




GGCGGCAATG 


TCGGTTTATA 


TACTGGTAAG 


CTCTCCCCAA 


ATTGTTGGAA 


AACTTCGTGA 


5400 


15 


TCCACATAAT 


AACTTTGACG 


TCCTGTTAAT 


GTTCTAAAAG 


GTACTAGACG 


TTCTATATTC 


5460 




GTTGTAAATG 


GTGAATATCG 


TCGACCTTGT 


TTATTTGAAC 


CTGGGAATAC 


TGCTGTCGGT 


5520 




ATTACTTCTC 


GTGGTTGTGA 


AGTTATATTT 


AAAAACGAAA 


TTTTCTCAGC 


AGCGCGTTCG 


5580 


20 


CTAGAAATAT 


CTTTTAACGG 


CATTCCAGTT 


TGTTCTTCGA 


GATCTTCATA 


TGATTTTTGT 


5640 




GATAATTTAC 


CATTCGTAGC 


AGATGAAATA 


CTTAGTATTG 


CAT CAG CTAC 


ATTACGTGCT 


5700 




GTATCAATAC 


GTGGACGATT 


CGCTCTCACA 


GAATCATCAT 


TTGTATCACT 


CCACGTACCT 


5760 


25 


AACATACTTT 


TTAATTCTTC 


ATATTGTTCA 


CTGACACCGA 


AACTTACACC 


ATGTGCTCCA 


5820 




ACTTTCCCTT 


TTTCAAGTAC 


AGGACCAAGC 


GTG A CATATT 


TGTCGTAAAT 


TTTAGTGTAG 


5880 




TCG CGTTCTA 


CAATTGCAAA 


GTTAGGCATT 


GTACGTCCAG 


GTACCGCTTC 


AATTTCACCC 


5940 


30 


TTCGACCAAT 


CTTTCACTAC 


GCCGTATGGT 


GTTGAAATTT 


CTTGCTTTGT 


ATCATGACTA 


6000 




AGTGGAGTTG 


TCACAACATC 


TTTAAACGTT 


CCAGGTAAAT 


AGTCTTITGC 


CATTTCTGAA 


6060 


35 


AATG CTTTTG 


CCAACGTTTT 


ATAAATATCC 


CAGTCTGAAC 


GCGATTCCCA 


TAACGGATCA 


6120 


ATGGCAGGAT 


TGAAAGGATG 


TACATATGGA 


TGCATATCCG 


TTGATGATAA 


ATCATGTTTT 


6180 




TCATACCAAG 


TCGCTGCCGG 


CAAAACAATG 


TCAGAATATA 


ACGGTGTTGC 


CGTCATTCTG 


6240 


40 


AAGTCTAAAG 


AGACCACTAA 


ATCTAACTTA 


CCTGTTGTTT 


CTTCACGCCA 


CGTAATTTCT 


6300 




TCTGGCTTTT 


CATCTTCATT 


TGGTGTAGCT 


AATAACCCTG 


ATTTTGTGCC 


AAGTAAATGC 


6360 




TTCATAAAGT 


ATTCTTGACC 


TTTTGCAGAA 


CTTGAAATTA 


AGTTTGAACG 


CCATATAAAT 


6420 


45 


AATGATTTTG 


GATGATTCTT 


TTTCAAATCA 


GGATCTTCTA 


TTGCAAATTG 


TGTTTGTTTT 


6480 




GATTTCACTT 


CATCAATTGC 


ACGTTGCAAA 


ATCGCTTCAT 


TTGAATCTAT 


ACCTTCATCT 


6540 




TTAGCTTCTT 


CTGCAAACAA 


CAAACTATTT 


TTATTAAATT 


GTGGATATGA 


TGGTAACCAA 


6600 


SO 


CGAAGTCTAG 


CTGCTAAAAC 


ATTATAATCA 


GCTGGATGTT 


GATGCTTTAA 


CTCCTCTGTT 


6660 




TTAGCTAATG 


GAGATTTTAA 


ACGATCTACA 


TTTGACTCTT 


CATATTTCCA 


TTGGTCTGTT 


6720 
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AATGCGACAG TACTCCATCC TTCAATCGGA CGACATTTTT CTTGTCCCAC ATAGTG AG CC 6 84 0 

CAACCGCCAC CATTCACACC TTGACAGCCA CATAACATAA CTAAGTTTAA GATTGAACGA 6 900 

TAAATCGTAT CTGAGTTAAA CCAATGGTTA ATACCCGCAC CCATGATAAT CATTGAACGC 6 960 

CCTTCAGTAT CGATAGCGTT TTGCGCAAAT TCTTTCGCTA CTTGAATGAC AACACTTTGT 7 020 

TTTACGCCTG AAATGGCTTC TTGCCAAGCA GGTGTATATT TTGATTCTGC ATCGTCGTAT 7080 

CCTTTTGATT CTAATTTATG ATCAAAACGA CGCACGCCAT ATTGACTTGC CATTAAGTCA 714 0 

AAAATTGTAG CAATACGGAC TTTGTCACCA TTTGCTAAAG TGACTTGTCG AGTTGGAATT 7200 

GGACGATTGA ATATCCCATC TCCATCACTA TCAAAGTATG GGAATTGAAT TGTTTCTAAT 7260 

TCGTATCCAC CTTCTGTCAT TGATAATGTA GGGTTAATTT TAG AAC CATC TTCTGTTTCT 7320 

AGTTTTAAGT TCCACTTCTT ACCTTCTTCC CAACGTTGAC CCATTGTGCC ATTAGGTACT 73 8 0 

20 ACTAAACTAT CGCTGATTGC ATCATGAATA ACTGGCTTCC ATTCGCCTTG CTCTGTTGTT 74 4 0 

TGACCTAAGT CACTCGCTCT TAAAAATCGA CCCGCTTTAT ATCCATTTTC ATCTTCATCC 7500 

AGCATGATAA GAAACGGCAT ATCTGTATAT TGTTTAGCGT AATTTATAAA GCGTTCATTA 7560 

25 GGTTGATTAA CATAATGTTC TTGTAAAATA ACATGCGTCA TTGCTTGTGC AATTGCAGCA 7620 

TCTGAACCAG GATTCGGTGC TAG CCAGTTA TCTGCAAATT TCACATTTTC TGCGTAATCT 76 80 

GGTGCTACTG AAATGACTTT TGTACCTTTA TAGCGGACTT CAGTCATAAA ATGTGCATCC 774 0 

GGAGTACGTG TTAAAGGTAC ATTAGAGCCC CACATAATAA TGTATGATGC GTTATACCAG 7 8 00 

TCACTTGATT CAGGCACATC TGTTTGCTCT CCCCAAATTT GTGGAGAGGC AGGTGGTAAA 7 860 

TCTGCATACC AGTCATAAAA ACT AAG CATT TCACCACCAA GCAAATTGAT GAATCG AG CA 7920 

CCTGCTGCAT AACTAATCAT TGACATCGCT GGAATAGGTG TAAATCCTGC G ATTCG AT CT 7980 

GGAC<^TATT 8040 

40 CAATTTGAAC GCACGTGCCC TCCCATACCT CGGGCTTGCT TATATTGTTT GGCTTTGTCT 8100 

TCATTTTCAA CAATAGACGC CCATGCAGCA . ACGCGATTAC CATT G TTTTC TTCTAATGCT 8160 

TCAGTCCATA AATCCCAGAG TTTTCCACGA ATATATGGAT ATTTGATTCG AAGCGGACTG 8220 

45 TATTCATACC AAGAGAATGA CGCACCTCGT GGACATCCTC TCGGTTCATA TTCAGGCATA 82 80 

TCCGGACCAC AACTTGGATA GTCAGTTTGT TGATTTTC CC AGGTAATCAC ACCATTTTTC 834 0 

ACAAATACTT TCCAAGAACA TGAGCCTGTA CAGTTAACAC CATGTGTTGT TCTTACTTCT 84 00 

SO 

TTATCGTGGC TCCAACGTTC TCTGTACATT TTTTCCCATT CTCTACTTTT ACTTTCTAGG 84 60 

ATCGACCAAT TCC CATT AAA TTTTTCTGTT GG CTTAAAGA AATTCAATCC AAATTTTCCC 8520 
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TAAAATGCCC AAGACTATTG CTTTAATTAG 

TAGGGAATCA CCTAATTACT TAAGGAATTT 

5 ATACACAATC ATGTATGGTC ATGCTTATTG 

GACAAATAAG GCTTCAACAC GAATATATTC 

TTTTTATAAA TTAGTGACAT AACACTGTAT 

10 

TTACATTTTC TTG CTGCTTA AT AAATG CAT 

TGCCATTCGA TGTATCATTT GGGTTTAGCA 

CACAG AG CAT AAGGGCGTCA TGTTTAGAAC 

15 

CATATGATTT AGGAGTGTTT TCAGTATAAT 
GTTTCGCCTC AATTGAATAG TGTTGCAATT 

2 0 CAACAAACTC GTTAAAACAC GATGCTTGAA 

TGTATTGTTG TATCTGTGTG ATAATTTTCC 
ACGTGCCACC ATTCACTAAG GAATTGACAT 

25 GACCTGTAGA AAATGTCACA CTTTTAGCAA 

TAGCAACTGc tGCGCTCGCT GATGTCACCC 
CATTTAGTAT GTCGACTTCT TCTTGCACAC 

30 

GTCTAACAAC CTTGTTATAT CGACGCGCTG 
GCTGAATATG TTTTGCATAC GGCTTTTTAC 
CATATTGTAA AATTAACGGA TTCACTAATC 

35 

TTAAACGCTC AG CCTTTTTC GTCAAATAAT 
AAACCTTGCC ATATTC CTCT ACAGACATAT 
ATAAACATCT ACACAACCTT CATCAGGTTC 

40 

TTGATCGTGG AGCGGGCAAA ATACATATTC 
TTGTTTGTGT GGACAGATAT TGTGAATCGC 

45 CCCTACCTCT TTGCCTTTGA CAATAACCTT 

AGTTGTCACT TTAATTTTTT CTTTTGTTTC 
TCTACGTGCT TGAGCATTGC TAGTTATTGC 

50 AGCATCCATA ATGCGTTCAA ATAGTTCATT 
TACATTTTCA AATCCAAGTC TTCTTAACCA 

55 



ATTGTACATT TTTTCACAAA CATAAAATAT 864 0 

CCCTATCAAT AACGGGATTT CATTGAAATA 8700 

CCAATCTAAA TCGTTCAAAT TTGGCACAAC 8760 

TCTCGGTTGA AACCTTACTT ATTCATTTAT 88 20 

TAGCATCTGC ACGATCGGTT GAAATATATG 8880 

CATAGTAATC ATATTGCGAC GAATGATATG 894 0 

AACAG CCATA ACCTTCGTCA TATAAATGTT 9000 

CACTTACTAC ATAAAATTGC TTCATAGGAT 9060 

CAACAACTTC C C CT AT AAT A CATATAC CTG 9120 

TTGAAATAAT ATTACTTAAA CGCCCCTTAA 9180 

AGACAATCGC TATCGGGTAA TCAATATCTG 9240 

CTAAACGTTT TACCCCCATA TAAATTGCTA 9300 

CCACTTCATT TTCTTCTGAA TCTTTAAAGT 93 60 

CTGTACGCAT TGTCAAACCT GTCTGCATAG 9420 

CTGGTACAAT TTCAAACGCA ATATGATGTT 94 80 

GAC CAAATAT CGCTGGATCG CCACCTTTAA 954 0 

CTT CCACG AT ACAGTCATTT ATTTTTTCTT 96 0 0 

CAACATCGAT AATTTCAGTA GTCAAATTCG 966 0 

GATCATATAG AATGACATCC gCTTCACGTA 972 0 

TCGGATTACC TGGACCCGCA CCTAT CAAGT 978 0 

ATATACGTTC CCGTCTGTAA CTTCTACCTC 984 0 

TTGAACAATA CCTGTATTTA AATCAATTTT 9900 

CCCACTCACT GTCCCTTCAG ACAATGGTCC 9960 

ATGAATTTTG CCACTTTCTG TTAAAAACAA 10020 

TTTTCCAATT AGGGGTGTTA ATTCATCTAT 10080 

CATGTATTAC ACCTTCTCCA CTTCAAAAAT 10140 

TTCC CAAGGT TCAGCTTCGA CTGCTTTTTT 10200 

TTGTCTTTCT GGGTCAAGTA AGACTTCTTT 10260 

TGGCGCTGTT CTTTCAGCAT ATATACCTGT 10320 
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AGTTGTTAAA 
GAATCCATTT 
TGGGCAGCCT 
TTCTAAACGA 
AACACAGCTT 
GTCTTCCCAT 
ACCTGTCACT 
TTGGTCTG CA 
TTGAAT ATT C 
ATGTGGATAA 
TTTATTTTTA 
TATTTG CGTT 
TGCGGCAACA 
TTTACCACAT 
ATTTTCCGTA 
TTCATCATCA 
TGATACAAGT 
ACCATCATCG 
TTTTTTATAG 
TACAATTTGA 
TGATCCCTTG 
TTCATATAGT 
TGCATATACA 
ATCTAGACCT 
TAAGTCTGCC 
TAAGATTTCA 
TTTAAG CAT A 
TAACACCGTT 
TCCTCCACCA 



AATTCAGCTT 
TCAACTGAGA 
GATACACCCA 
ATGCCAAGTC 
TTAACTGAGC 
ATATTTGGTA 
TTAACTAGTG 
TCTGTAACAC 
GCATGGTAAC 
ACCATGTTTA 
AAGTTTAAAA 
ACTATTTGAT 
AAGTCATCTC 
GAATTCCCCG 
ATCGCATTTA 
GCCATATCAG 
GTGTAATCTT 
ATATCAC CAT 
TT ATT AT CAA 
CCAGCACTAT 
TAT CCGTTCG 
GGTGCAACGA 
TTGCTATCAC 
GATTCTTTGG 
GGAATCTCGC 
GTTGTGTTGG 
TTTCCAGCTT 
ACTTCCATAC 
ATTACAATTG 



TTTCAACTTC 
TAATACCAAA 
TTTTGAATTT 

GTGTCGTGTA 
GTGTTTTCTT 
ATTCTTCTTT 
G CACATG ATA 
CCCCACGCAT 
GTTCGTTAGC 
AATAATAGTT 
CATGTCGAAC 
CGCGTGTCAA 
CTAAGGTGTG 
CTTTTGTTTT 
CTATAGTACC 
CAATTGATAG 
CAGTGGATTC 
ATAGTACTGC 
CACTATTAAA 
ACAAGTCACA 
TTTCTTTATT 
GTCCATAAAC 
TTGTTTGCAT 
CTACTTCTGT 
GTCCATCAGC 
CTTGCATTTC 
TACGGTCTAG 
CTTGATCTAA 
CTTTCTTTTT 



TGTACCACCA TTACCACCGA 
ATCTTTAACA CCTGATTCAA 
ATGAGGTGTA TCGATGTATT 
TTGCGTACCA AATCGACAAA 
ACCATAAGCT GATGcTGAAC 
TTTAACTCCA TACAAACCAA 
TTTCTTAGCC ACTTCTCCTA 
TTGAGGTATA ACAGAAAATG 
AAATCTTGAT TCTCTTTCAT 
GATTGCTGGT CGACATTTTG 
TTCTTTAGAT GTTTTTAAAC 
ATCAGTACAA CCACATATAC 
CTGCAATATT TGAGCAATTT 
AGCCGTTACT TCTTCAACTG 
TTTATCAACA CCATTACAAC 
CGATGCCTCT TCTCCACCTT 
ACCTTTTTTC ATCATGTTAT 
ACCAACTACA TTACCGTCTT 
TATTTCAATA CCTTTAATTT 
CCCAGAAACT TTTAATGACG 
TGTTAAATGAT TCAGCTAATA 
TTTGCCGTTA TGTTCTGCAC 
CACATCATTG ACAACAATAC 
GTATGGTCGT ATACCTACTG 
CAATTTAACA CCCTCAACAT 
AAACTTCATA CCTTGCTTTT 
TTGCATTTCC ATCAACCATT 
TAAACCACGT GCACACTCTA 
AGTCTTAGCA ATGTTCATCA 



TATAGATTTG 
CACAACTTCT 
CAAATGTTTT 
ACTCTTTACC 
GCATACCTAG 
CACGTTGTGA 
GACGAATCAG 
TACCATCATT 
CTTCATGATC 
GACATCCACC 
CTTTCGCTCT 
CAG CAGGTTT 
GCGGTTTACA 
TTGTAAAGCC 
CACAAATTGT 
TAGTAAGCAA 
AAAAGCGTGA 
TTAAAAAGAT 
CTGCATTTTC 
TAAATGTTGT 
CTTTACCTTG 
ATTCACCAAC 
CACGATTAAC 
CCATAACAAC 
CATCTTCTCC 
CTAGATCTGC 
CAGCTAAATG 
AACCTAGTAA 
TTTGTTCAGT 



10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

1164 0 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 
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GAATGCTTTA GAACCTGTCG CAAAAATCAA TTTATCGTAT GATACTTCAA TACCATTTGC 12240 

AGTAGTAACT GATTGATTTG CTCTATCTAC TTCAATTACA GGATCATTTG TAATTAACTC 12300 

GATACCATGT TCCTCATACC ACTCATATGG ATTCATAATT GTTTCTTCAA CTGTCATTTT 123 60 

ATTTTGTAAA ATATTTGAAA GCATGATGCG GTTATAGTTT GGATAAGGTT CTTTACCTAT 124 20 

TACCGTAATA TCATATAAAT CGTTGGCGCG CTCTAATATT TCTTCGATTG TTCGAATGCC 12480 

CGCCATACCG TTACCAATCA TTACTAGTTT TTGCTTTGCC ATAAAATATG CCCCTTTACT 12540 

CCATAATATT TATTTCAAAA AAAGGTATTA ATTTTTCGTT AGTGCTTT T A TATTTTCATT 12600 

GGAATCATTA AGCTTTCTAA TCTATCGTTA ATGATTTGCT TTAAAATTGG GTCGAAGTTA 12660 

ATTGAAGGTG TGAAGTGTAT ATCTGTATTA ATAACCATGT CATTCATTTG CTGCTTCACT 12720 

TTGTTAACAA GTCTTCCGTC ATATAAAAAT AATGGTACGA CAATCAATTT TTGATACCGT 12780 

20 TTCGAGATGC TTTCTAAATC ATGTGTAAAA CTAATCT CTC CATATAGCGT TCTCGCATAT 12 840 

GTCGGCTTGC TAATTTGCAA ATTTTGAGCG CAT ATTTG T A ACTCTTCGTG TGCCTTAGTA 12900 

AACTTTCCAT TAATATTGCC GTGTGCAACA ACCATAACTC CAACTTGTTG TTCGTCACCT 12960 

GCTAATGCGT CACAAATACG TTGTTCAATT AATCGTCTCA TTAAAGGATG TGTGCCAAGT 13 020 

GGCTCGCTTA CTTCTACCTT TATGTCTGGA TACCGTCGTT TCATTTCATG AACGATATTC 13080 

GGTATATCCT TGAGATAATG CATTG CACT A AAGATTAGCA ATGGTACAAT TTTAAAATGG 1314 0 

TCAACCCCAC TTTGAATCaA CGTCGTCaTT ACCGTCTCTA AATCCtGATG CTCACTTTCt 132 00 

AAAAACGCAA TATCATAGTG ATGTATATCA TCTTTTACTA ATTCAGAAAT AAATGCTTCT 13260 

AACGCTTGaT TCTGTCGTCC GTGCCTCATG CCATGTGCAA CAATGATATT CCCATTCACA 13 3 20 

TTTACCAACC CTTTCACACG TATTGTATAC CAAATCATTT TGTTTTTGTG AAAAGAATCA 13380 

CATTATAATG TAAAATCAGG GAATTCCCTG ATGCCTGTAG TCATG CAT AT TCCTTATACA 1344 0 

40 TTTTCCCTTT TTGTTAAATC AAAAAAAGCG ACCGATATAT GAATCCCTAC TCAACATTTA 13500 

TTTGAGCAAG CATTAATATA TCGGTCGCTT QTAGTGTATA TTATTATCTT AAAATGGTGG 13560 

TTGGCCTAAT ATTGTTTCGT CAAAGCGCTC GGGTATCAAT ACTTTGCGCA TGATCACACC 1362 0 

45 TAAATCGCCA TCATCATTTT CATGTTCGCT GTATATTTCA TAACCTCTTT TTTCATAAAT 13680 

TTTAAGTAAC CACGGATGCA ATCTTGCAGA TGT AC CT AAA GTAACTGCCG CTGA C TTTAA 1374 0 

CGTATCTCGC AAAAATGCTT CTTCAACATA AGTAAGTAAT TGGCTACCAT AGCCTTTCCC 13800 

SO 

TTCATACTCA GGATTTGTCG CAAACCACCA GACAAAAGGA TAACCCGAAA TACTTTTCAC 13 860 

ACTTCCCCAA GGATATCTAA CCGTAATCGT AGATATAATT TCATCATCAA TTGTCATGAC 13920 

55 



30 



3S 



SOOCID: <£P 078651 9A2_I_> 



616 



EP0 786 519 A2 



CCAATCAATA CCTAGTTCTC TTAGAgG CGT AAATGCTTCA TGCATGAGTT CTTGCAATTT 14 040 

TTCTGCATCT T 14 05X 

<2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



75 



20 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

TAATCCTCAA CTTnGATTAT ATGGCTTGGG CGCATATGAA CTGCTTAGTT TAGTGTATGA 60 

CATTCATACA GTTCGCATGA CTATCATACA ACCTCGAATA GATAACTTTT CTACTGAAGA 12 0 

GTTACCAATC TCAAGATTAC TTCAATGGGG AACCGATTTT GTTAAACCCT TAG C CAG ACT 180 

TGCTTATAAC GGTGAAGGTG AGTTTAAAGC AGGTAGTCAT TGTAGATTCT GTAAGATAAA 24 0 

GCATTCATGT AGAACACGTG CAGAATACAT GCAAAATGTG CCTCAAAAGC CACCACATTT 300 

25 GTTGAGTGAT GAAGAGATTG CAGAACTTTT ATATAAACTG CCTGATATCA AAAAATGGGC 360 

TGATGAAGTA GAGAAATATG CGTTAGAACA AGCGAAAGAG AATGATAAAA CGTATCCAGG 420 

TTGGAAGCTA GTCACGGGAC GTT CAAGG AG AGTGATAACT GATACAAAAG CAGTCCGAGA 480 

CAGGTTAGTT GAAGCGGGTT ATAAACCTGA AGATATTACA GAAACCAAGT TACTTAGCAT 54 0 

TACGAATTTA GAAAAATTAA TCGGCAAAAA AGCATTTTCT AAAATTGCAG AAGGCTTTAT 600 

AGAAAAGCCG CAAGGTAAAT TAACACTTGC TACCGAGTCT GATAAACGAC CAGCTATAAA 660 

GCAATCTGCT GAAGATGATT TTGACAAACT ATAAAAATTA AAAAGGACGG TATATAAACA 720 

^TCAJ^GCAAA ^TATTAAAT AAAACTA^G TGATTACAGG ^AAAAC^AAGA GCMC^T^TCT 7 80 

CACaTATTTT TGaACCTCAC AGTATG CAAG AAGGGCAAGA AGCAAAGTAT TCAATCAGTT 84 0 

TAATCATTCC TaAATCAGAT ACAAGTACGA TAAAAGCCAT TGAACAAGCT ATAGAAGCTG 900 

CTAAAGAAGA AGGAAAAGTT AGTAAGTTTG GAGGCAAAGT TCCTGCAAAT CTGAAACTTC 960 

45 CATTACGTGA TGGAGATACT GAAAGAGAAG ATGATGTGAA TTATCAAGAC GCTTATTTTA 1020 

TTAACGCATC AAGCAAACAA GCACCTGGTA TTATTGACCA AAACAAAATT AGATTAACGG 1080 

ATTCTGGAAC TATTGTAAGT GGTGACTATA TTAGAGCTTC AATCAATTTA TTTCCATTCA 1140 

50 ACACAAATGG TAATAAGGGT ATCGCAGTTG GATTGAACAA CATTCAACTT GTAGAAAAAG 1200 

GCGAACCTCT TGGCGGTGCA AGTGCAGCAG AAGATGATTT TGATGAATTA GACACTGATG 1260 
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TTGAGGTGTC AAGAATTTGA AATTTATGAA TATAGATATT GAAACATACA GCAGTAACGA 13 80 

TATTTCGAAA TGTGGTGCCT ATAAATACAC AGAAGCTGAA GATTTCGAAA TTTTAATTAT 144 0 

AGCTTATTCG ATAGATGGTG GAG CGATT AG TGCGATTGAC ATGACTAAAG TAGATAATGA 1500 

GCCTTTCCAC GCTGATTATG AGACGTTTAA AATTG CTCTA TTTGACCCTG CTGTAAAAAA 1560 

GTATGCATTC AATGCTAATT TCGAAAGAAC TTGTCTTGCT AAACATTTTA ATAAACAGAT 16 20 

GCCACCTGAA GAATGGATTT GCACAATGGT TAATTCAATG CGTATTGGCT TACCTGCTTC 16 80 

GCTTGATAAA GTTGGAGAAG TTTTAAGACT ACAAAGCCAA AAAGATAAAG CAGGTAAAAA 174 0 

TTTAATT CGT TATTTCTCTA T AC CTTGTAA ACCAACAAAA GTTAATGGAG GAAGAAC r AG 1800 

AAACCTACCT GAACATGATC TTGAAAAAtG GCAACAATTT ATAGATTaCT GTATTCGAGA I860 

TGTAGAAGTA GAAATGGCGA TTGCT 1885 
20 (2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
30 



35 



40 



TAATCCTTAG 


TTCACTGnCA 


AATTTCAAAA 


CACCAGTTCC 


CTCTAT CTGC 


ATCCATAGAA 


60 


ACTGnATGTT 


TGTGTCAATA 


ACCGGATTAT 


ATTGTGATGn 


TGTTTGTAAC 


TCGATTAAGT 


120 


TAT CATCTTT 


CGAAAAATTA 


TCTACTACCA 


TTATTCAACC 


ACCTTTCCTT 


CGAATAAACT 


180 


CCATTTACCA 


ACkCCACCAG 


TACCAAAGTT 


TCTAACTAAA 


AATTGATGTG 


CAGACGGGAA 


240 


GTTATTACGT 


CTTAATACTT 


GTGTTGTATT 


ACCTGGTGTA 


TTCGATTTTA 


CTTCTAATAT 


300 


CCAACCTGCA 


ATACCTTTAA 


AGTCTTTAGG 


AAAATCAGTA 


AATCGGTTTG 


ATTCTTCAGT 


360 


AGTGATATAG 


AAATCTAAAC 


CAACGATTTT 


TAAATCTGAT 


AATTTTGTAA 


TACTCTTAGG 


420 


GATATGTTCC 


CAATAACCGG 


CGTTTTGCGG 


GCAGAAATTC 


CATGCTCCGT 


TGTTTTTCTT 


480 


ATTGAAAATG 


TCAATGACAC 


GTTCGAATTT 


AAGCATATTT 


CTACCTGTGC 


TGTTTCTGGt 


540 


AAGTACTTGT 


CTTAGAGCAC 


CATT AT AG TG 


TCCAGGCAGT 


ACATCCAAGA 


ACCACCCTGC 


600 


ATCTCTAAAC 


GCTTTCGGTA 


ACGGGAAATC 


TAATGCATTT 


TGTGTGTCTT 


GaCGTATAGA 


660 


TATAGTAATG 


ACCAACTTCC 


GTAATATCAC 


TTAGATATGC 


TGGGTTCTGT 


ATTGGTAACG 


720 


GTTTAACACG 


TCCGCCTGAA 


TCAGTCATTG 


ATACTTGAGG 


TGCGATGTTT 


TTCAAGAATT 


780 
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TAGTTACCCC GATTAGAAGT GCTTTACGTC CTGTTTCTAG ATCGTAATAC ATATCTAGAC 900 

CCTCAGCCTC TTGGAAATCT CCTTTAAAGT TGTTATTCAC ACCGCCTATA TCGATGCGAC 960 

GTTTAAATAA CAATTCTTTC GTTTTGATAT CGAAGCCTTG TAAGTAGTTA GGGTTGGCTG 1020 

TATTCGAATC ACCTGTATAC CAATATAAGA TACCTGCATC ATAAGTGATA C CTTG CAT AG 10 80 

GTTGTGTATC TGAAGTGTAT TCCATAGGTA TATCCATTTG ATACAATACT TTGTCTATAC 114 0 

CTTTATCAAT ATCGT CAGCA CTTCTAACCT CAACAAAGTT CAACGAATTC TTAAGTTGTC 12 0 0 

TTTCAGTGGG TTT AT ATT CA CGTCTAAAAA TCATTAAATT TTCTACCGGA TTATAAATCG 1260 

CTGACGTATA TCTGTCGTTA AATATATTCG GCATGACATC TTGCATTTCA TTACCATAAG 132 0 

TTATTTCTCC AGTTCTATAT TGGAAACGTA CAAACTTGTT GTTTTTGTTA CTGTCCAATA 13 80 

CAGCTGAATA AATCCATAAT TCTCCATCAA TGTATCTATA CGCATTGTGT GTACCGTGAC 144 0 

CGCCGTTTTT AACAAGCAAT CTATCAATAA ATTGTCCGTT GGGCTTCAAT CTAGATAACA 1500 

TGTAATGATT ACCTGGACGA GCTTGCGTCA TATAAATAAT TTTCGTTCTA GGGTCTACCC 1560 

AAAATGATTG CATTACTGCA TTTGTATATG GCGATAAATC AGTGATAAAT TCCGGTTCTT 162 0 

25 GCTCTTTTGG TTCGAATCGG TATTCTGTCG CTCGATATTC TTTATAGTGT TCATCTACAG 1680 

CTTTCTCAAC CTTTTTAGTG AAAACATCTA GTGTTGAATA AT CATGAT AC AAACGATCTT 174 0 

GCAATGTCTT ATGACCATAA CCTGTATTAT CAACGCGCGC GTCTTTTAcT TCGTTGATAC 18 00 

30 CGTCGCCGTT ATGACCTAGT ACCATGTTGC TAAATCGACC GTTTAAATAT GTTAAAAAGT 1860 

CAGAGACGTT ACTTGTAACA TTTAAATGTT CATACTTTAT TTGTTCTCCA TCATGTGCGA 1920 

ATACCTCTTT ATTTCTGTGG TATTCAAGAG AGAAATTAAA ATCCGTCAGC ATGTCTGAAA 19 80 

TAAGTTTAAA GTTATACTCA TTTTCATCTA CATATCTGTA GTCAAAGACT CTACTTAAAT 2040 

CTGTAATTAG~TTTATTACTC ATGTTTTCCT CCTTTACTAT CCATAAAACT "GATmATAATT 210 0" 

TTTAATAAGC TCATACATAA TAACTTCATG ACCTCTTTCA TTAGGATGTA ATCCATCAGG 2160 

CATGCTAGAT TTTCTAAATG CTGGATTATA TGGTTTGAAA TAATCTGTGT GATAAGCATC 2220 

ATATACTGGT ACATCCAATT CACTACAAGC CAAT AT CTG A GCATTGACAT AATCCTCTAA 2280 

AGTTAACCCT AGTTTGTTTT TGTCCGTATC TTTACGGCGT ATCGTTGTAC CACT CAT AGG 234 0 

GCATTGCCTA GTAGCTGTCA TTACAAGTAT TTTTGAAGCT GGATTATTTT TCCTGATAAC 24 00 

TTCAATTGCA GAACAAAAGG CGCCGTAAAA CGTTTTAGTG TCGGTTTTAT CAGTGCCTAT 24 60 

SO CGGTACGCCT GCCCAATAAC CATGTAACCA GTCATCATCT GTACCTTGTA ATATGATTAG 2520 

GTCTCCTCTT ATTTGCTCTG CTTGTCTaTA AATGCTGTTT TCTaCCGCTT CTTTACCTAT 2580 
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10 



25 



30 



35 



50 



55 



CTTGCCTAAC ATTTCT 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



2656 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
15 AAAATGAGGG TTCTAGCGGA AATTACCAAA AGCGTGGTTC ATACTATGGG CAG CGTAATC 

GTATTTCAAA AGAAAAAACA CCTAAATGGT TAG aAAAT AG AGATAAACCT AGTGAAGAAG 
ATTCGGCTAA AGATAATAGC GTAGATGATC AACAATTAGA GCAAGATCGA CAAGCATTTC 
20 TAGATAAATT ATCTAAAAAA TGGGAGGAGG ACAGTCAATA ATGAAGCAAT TTAAAAGTAT 

AATTAACACG TCGCAGGACT TTGAAAAAAG AATAGAAAAG ATAAAnCAGA AGTAATCAAT 
GACCCAGATG TTAAGCAATT TTTGGAAGCG CATCGAGCTG AATTmACGAA TGCTATGATT 
GATGAAGACT TAAATGTGTT ACAAGAGTAT AAAGATCAAC AAAAACATTA TGACGGTCAT 
AAATTTGCTG ATTGTC CAAA TTT CGTAAAG GGGCATGTGC CTGAGTTATA TGTTGATAAT 
AACCGAATTA AAATACGCTA TTTACAATGC CCATGTAAAA TCAAGTACGA CGAAGAACGC 
TTTGAAGCTG AGCTAATTAC ATCTCATCAT ATGCAACGAG ATACTTTAAA TGCCAAATTG 
AAAGATATTT ATATGAATCA TCGAGACCGT CTTGATGTAG CTATGGCAGC AGATGATATT 
TGTACAGCAA TAACTAATGG GGAACAAGTG AAAGGCCTTT ACCTTTATGG TCCATTTGGG 720 
ACAGGTAAAT CTTTTATTCT AGGTGCAATT GCGAATCAGC TCAAATCTAA GAAGGTACGT 780 
TCGACAATTA TTTATTTACC GGAATTTATT AGAACATTAA AAGGTGGCTT TAAAGATGGT 
40 TCTTTTGAlAA AGAAATTACA TCGCGTAAGA GAAGCAAACA TTTTAATGCT tgatgatatt 

GGGGCTGAAG AAGTGACTCC ATGGGTGAGA GATGAGGTAA TTGGACCTTT GCTACATTAT 960 
CGAATGGTTC ATGAATTACC AACATTCTTT AGTTCTAATT TTGACTATAG TGAATTGGAA 1020 
45 CATCATTTAG CGATGACTCG TGATGGTGAA GAGAAGACTA AAGCAGCACG TATTATTGAA 1080 
CGTGTCAAAT CTTTGTCAAC AC CATACTTT TTATCAGGAG AAAATTTCAG AAACAATTGA 1140 
ATTTT AAAAT GATTGGTGTA TAATGAATAC AAATCTAAAT CGTTTAAATG ATTGAAGACA 1200 
AGATGATCTA ATCAATATTA CACAGAAAGC CATTGTTTGA TGAGAATATG GTTAATAAAT 1260 
TAGATGATTA CTACTTCATT TATGGTATTT GTAATGAATA CCCGGATCAA GACCGTTATC 1320 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



840 
900 
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CTCGTCCCTT GTATAGGGGC GGGATTTTTT GTTTTTTTCA GACATAAATG TTTGTTGGTG 144 0 

TCATAAATTC CCTGTTTATT GTTAATAGGT TTAATGTTAA AACGATGATT GTTGTTCAAT 1500 

5 

TTTTTAACGA GGTCAGATAA AAGTATTTAT AAAGCAAATA GGAGGGTTTA ACATGGAACA 156 0 

AATTAATATT CAATTTCCAG ATGGTAATAA AAAGG CGTTT GATAAAGGTA CTACTACTGA 1620 

AGATATAGCA CAATCAATTA GTCCTGGATT ACGTAAAAAA GCTGTTGCCG GCAAATTTAA 16 8 0 

10 

CGGGCAACTT GTAGATTTAA CTAAACCGCT TGAAACTGAT GGATCAATTG AAATTGTGAC 174 0 

ACCAGGTAGT GAAGAagcGT TAGAGGTATT ACGTCATTCT ACTGCACATT TAATGGCACA 1800 

1S CGCGATTAAA AGGTTATATG GTAATGTTAA ATTTGGTGTA GGTCCTGTAA TAGAAGGTGG I860 

ATTCTACTAT GACTTCGACA TTGACCAAAA CAT CT CAT CT GATGACTTTG AACAAATTGA 1920 

AAAAACAATG AAACAAATCG TTAACGAAAA TATGAAAATC GAACGAAAAG TGGTTTCACG 1980 

20 AGATGAAGTG AAAGAGTTAT TCAGCAATGA TGAATACAAA TTAGAATTAA TCGACGCGAT 2 04 0 

TCCTGAAGAT GAAAATGTAA CATTATATAG TCAAGGTGAT TTTACTGATT TATGTCGTGG 2100 

AGTTCACGTT CCATCAACAG CTAAAATTAA AGAGTTTAAA CTATTATCTA CAGCAGGTGC 216 0 

25 ATACTGGCGT GGAGATAGTA ACAACAAAAT GTTACAACGT ATATACGGTA CTGCTTTCTT 2220 

TGATAAAAAA GAATTGAAAG CACATTTACA AATGTTAGAA GAGCGTAAAG AACGTGATCA 22 80 

TCGTAAAATT GGTAAAGAGT TAGAACTATT CACAAATAGC CAATTAGTTG GTG CTGGTTT 2 34 0 

30 

GCCATTATGG TTAC CTAACG GTGCAACAAT TAGACGTGAA ATTGAACGTT ACATTGTTGA 24 00 

TAAAGAAGTT AGCATGGGAT ATGACCACGT TTATACACCA GTACTTGCTA ATGTTGATTT 24 60 

ATACAAAACA TCTGGTCACT GGGATCACTA TCAAGAAGAT ATGTTCCCAC CAATGCAGTT 2520 

35 

AGATGAAACT GAAT CTATGG TATTACGTCC AATGAACTGT C CACATCAT A TGATGATTTA 2580 

TCCC3AATAAA CCACATTCAT ATCGTGAATT^ ACCTATCCGT ATCGCTGAGC" TAGGAACGAT 264 6 

GCATAGATAT GAAGCAAGTG GTGCTGTATC AGGATTACAA CGTGTTCGTG GTATGACTTT 2700 

40 

AAATGATTCA CATATCTTTG TTCGACCTGA TCAAATTAAA GAAGAATTCA AACGCGTTGT 2760 

AAACATGATT ATTGATGTGT ATAAAGACTT TGGTTTCGAG GATTATAGCT TTAGATTAAG 2820 

45 TTATAGAGAC CCTGAAGATA AAGAAAAGTA CTTTGATGAT GATGATATGT GGAATAAAGC 2880 

TGAAAATATG CTTAAAGAGG CAGCGGATGA GCTTGGCTTA TCGTACGAnG AAgCGATTGG 2940 

TGAAgCGGCA TTCTATGGTC CGAAACTAGA TGTTCAAGTT AAAACAGCGA TGGGTAAAGA 3000 

50 AGAGACATTA TCAACAGCAC AACTTGATTT CTTATTACCA GAACGTTTTG ATTTAACTTA 3060 

TATTGGTCAA GATGGTGAAC ATCATCGTCC AGTTGTTATT CATCGTGGTG TTGTATCAAC 3120 
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AGCGCCAAAA 


CAAGTTCAAA 


TCAnCCAGT 


TAACGTTGAT 


TTACATTATG 


ATTATGCGCG 


3240 




CCAATTACAA GATGAATTGA 


TA RTPT'O TA TV r^/^ 

AA 1 L. 1 LAAGG 


CGTTCGTGTA 


AGTATTGATG 


ACCGTAATGA 


3300 


5 


AAAAATGGGT 


TATAAAATCA 


f* ta/"" ft tv r+ r+T , /~ % i\ 
GAGAAG CTCA 


AATGCAAAAA 


ATACCTTATC 


AAATCGTAGT 


3360 




TGGGGATAAG 


GAAGTTGAAA 


RTR TV TV* A TA f'T 1 


GAATGTGCGT 


CAATATGGAT 


CGCAAGACCA 


3420 


10 


AGAAACAGTT 


GAAAAAGATG 


AA1 1 lAlLlVj 


GAATCTAGTT 


GATGAAATTC 


GTTTGAAAAA 


3480 




ACATAGATAG 


ACAGTTGTCG 


LttHlAAMAKj 


CTTTAAAACT 


TTTATTGCGT 


ATCAAGTTTT 


3540 




ACAGGGTTGA 


TTATGCGTGA 


1 G AA 1 V_ V» 1 G 1 


ATATTACAAG 


TTAGTTAAAA TATTAAATTG 


3600 


IS 


AGTTAGAGGT 


TGCATGTTTA 


ATTAG T AACT 


TGTCAGAAGT 


ATTTATGGTA 


CATAAGTTGA 


3660 




ACAAGTGAAA 


GGTAAAGATG 


r^r^r^ * mi « tv /•* iv 
v_ V_GAAATAGA 


TATAAACCAT 


TA TA TA T* r T» TV HH TV *T^f^ 

AAAHATATC 


TATTGGGACA 


3720 




GTTTTCGAAT 


AGGAACTGTA 


L 1 G 1 CACAGA 


ATGTGATGTG 


CT AC CTT AT A 


TAGATAATTG 


3780 


20 


CCAAAGTGGT 


TG CAT ATCTT 


ft ft TV /*^«T^ TV T , ^""T> 

AAAGGTATGT 


AGCCACTTTT 


r P r P IV ^W|W1WT*I' TV TV 
1 lAC-i 1 11 AA 


TATCACTATG 


3840 




TTCTGTAAAA 


AAGGGTATGA 


AAGTGAATAA 


AGGTTATTTA 


TTTCTTGGCC 


TCTAAAACAT 


3900 




GGAAAGGGAG 


CTTATATGTC 


AAAACsl T CAA 


AATGAAAGTA 


TA TV TA TV'TTO'P 


CAAAAGGGGA 


3960 


25 


CTTAAAGATC 


GTCATATTTC 


TATGATTGCG 


ATTGGGGGTT 


G1A1 ivjUlAL 


AGGl 1 iATTT 


4020 




GTAACTTCTG 


GTGGAGCAAT 


T CATG ATGCA 


GGTGCTTTGG 




AGGATACG CA 


4080 


30 


ATTATCGGAA 


TAATGGTATT 


ill C TT AATG 


ACGTCACTTG 




rp » ft/I rp t> mmnvi 

1 ACGTATTTG 


4140 


CCAGTATCAG 


GTTCATTTAG 


TA C AT ATGCT 


ACAAGATTTG 


X iviiii GGA1 L. 


TTTAGGGTTT 


4200 




GCGCTTGGTT 


GGAACTATTG 


GTTTAACTGG 


GTAGTGACTG 


T , TAr"*/~*TV^"»/'*'TV/"^JV 

1 AGCAGCAGA 


TATTACGATT 


4260 


35 


GCAGCACAAG 


TCATTCAATA 


1 I\?GACACCA 


TTGCAAGGCA 


TACCCGCTTG 


GGCATGGAGT 


4320 


GCGTTGTTCT 


TAGTTATAAT 


ill 1 AG 1 CTG 


AATTCGTTAT 


CAGTTCGCGT 


CTATGGTGAA 


4380 




AGTQAATACT GGTTGGCATT 


ft T* TA TV T\ TV /^"T*/** 

GA 1 AAAAGTG 


GTTACAGTTA 


TTGTTTTCAT 


TGCAATTGGT 


4440 


40 


TTATTAACGA 


TTGTCGGAAT 


LATGGGTGGT 


CATGTTGTAG 


GATTCGAAAT 


ATTTAATAAA 


4500 




GGTGAAGGTC 


CAATTCTTGG 


TT^r^r^ ft ft / wixi> j\ 

1 GO CAAGl 1 A 


GGAGGAAGTT 


TGTTATCAAT 


TCTAGGTGTA 


4560 




TTCTTAATCG 


CTGGTTTCTC 


ATTCCAAGGT 


ACTGAGTTAA 


TTGGTATTAC 


GGCTGGTGAA 


4620 


45 


TCAGAAAATC 


CTGAACGTGC 


TGTGCCGAAA 


GCAATTAAAC 


AAGTATTCTG 


GAGAATTTTA 


4680 




TTATTTTACA 


TTTTAG CCAT 


TTTTGTTATC 


GGTATGTTAA 


TTCCTTATGA 


TAGTAGTGCA 


4740 




TTAATGGGGG 


GTAGTGATAA 


TGTAG CAACG 


TCTCCATTCA 


CATTAGTGTT 


TAAAAATGCT 


4600 


50 


GGATTTGCGT 


TTGCAG CATC 


ATTTATGAAT 


GCAGTCATTT 


TAACGTCTGT 


GTTA 


4854 




(2) INFORMATION FOR SEQ ID NO: 107: 
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(A) LENGTH: 24 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



10 


ATCAAAAATT 


GATTGTTTTC 


XlATTTTTTGT 


TTCAGCGCGG 


GATCTTTTAC 


GTCTTTTGTG 


60 




AAAACGaTTT 


TATTATTAAC 


TACTTTTACT 


GGATAACTTT 


TGTATGTCGA 


GTCAGTAGCA 


120 




TTTTTTCTAT 


CGTTTGTAGT 


TGTGTCATAT 


TCACCAgTTA 


TTTTATGTGT 


GTTCTTATCT 


180 


15 


ACCTTTAACA ACATACGGTC 


TTCTTTTAAA 


AGCTCATCTG 


ATCCAACAAC 


TGAATAAGAG 


240 




GATTCTATAT 


ACCATGTGTC 


TTGATCATTA 


TTTTCATAAT 


GGGGATTATC 


GTGACCATCA 


300 




ATTTCATAAA 


GCGTTTCTAA 


GTTTTTAATA 


GGATACGTAC 


TTAGTACTTT 


TTTAAGACCA 


360 


20 


TCTTTCAAAT 


GAATTTGTTC 


CCACTTCATT 


GCCAAAAACA 


TATCGCCACT 


GACTACAATT 


420 




GAAATAATAA 


TAATTGCTGC 


TAAGTTTAAC 


CAGAAAATTT 


TATGTGCTTT 


CATACATTCC 


480 


25 


CACCGTTTCT 


CAAAATACTT 


CATTAACACT 


ATAATAATAT 


ATTTTGAAAA 


ATATTTACAT 


540 


CAGTATTAAA 


GTGAATATCA 


AATTTTAAAT 


TTATGAAAAT 


AATAGATATT 


T AT AAAAAG C 


600 




GGAAAAGAGA 


TACAATAAAA 


AACTGCATGA 


CGTTTGAGAC 


GTCACACAGT 


GTAACTAAAA 


660 


30 


ATTTAAAAAG 


TTGTTGCTAA 


TTTTTCAGCA 


TTATTAATAC 


TAGTTGCTTT 


AATTTCTTCA 


720 




GTCTTATGAG 


GTTCAGCATT 


GTGTCCTTCA 


ATAATGATTG 


TTTCATATGA 


TGGCACACCT 


780 




AAGAATGTCA 


TAATTGTT CT 


TAAATAACGG 


TCACCCATTT 


CAAAATCAGC 


AGCAGGTCCT 


840 


35 


TCAGTATAAT 


ATCCACCACG 


TGATTGAATG 


TGTAATACTT 


TTTTGTCAGT 


TAGTAAACCT 


900 




TGTGGTCCTT 


CAGCAGAATA 


TTTAAAAGTT 


TTACCTGCAA 


TTGAAATAGC 


ATCAATATAT 


960 





"GCTTTAACTA 


"CAGGTGGGAA 


AGAAAGGTTC 


CACATAGGCG 


TTACAAATAC 


ATATTTATCT 


"1020 


40 


GCACTTAAAA ATTCTTCTAA 


AATGTCACTC 


AATCTTGAAA 


CTTTCATTTG 


TTCATCATCA 


1080 




GTTAACGTTT 


CGCCATTACT 


CATTTTTCCC 


CAACCAGTTA 


ATACATCTTT 


GTCAATAACT 


1140 




GGAATATAAG 


TTTCArATAA 


ATCAATATGT 


TTCACTTCAT 


CATCAGGATG 


TTGTTGTTGA 


1200 


45 


TATGTTTCGA 


TAAATGCTTT 


ACCAGCCGCC 


ATAGAATTTG 


ATACCAGTTC 


ATTAAAAGGG 


1260 




TGTG CTGTAA 


TATATAATAC 


TTTTGCCATT 


TGAAAATTCT 


CCTCTGkTTC 


TGTTATTTTC 


1320 


SO 


TTAAGTATAA 


TTATTATACT 


CGATATAAAA 


TTTAATATCA 


ATCAAAATAT 


TCAAATTACC 


1380 


ATCATTTTCT 


TCATCTATAT 


nTGGCAGTAC 


TACTAAAGTA 


TGAGTGCATT 


TAATTATGAa 


1440 




ATAGTTGATT 


TaGAATAtAT 


ACTTAATACC 


CAAAATATAT 


GAAGGATGGA 


TGCCACTATG 


1500 
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ATTATTTATA TAGATGACAT TCAAAAATGG TTTAACCAAT ATACCGATAA ATTGACACAa" 

AATCATAAAG GACAAGGACA CTCAAAATGG GAAGACTTTT TTAGAGGGAG TCGGATTACT 

GAGACTTTTG GTAAATATCA ACATTCACCA TTTGATGGTA AG CATTATGG CATTGATTTT 

GCATTGCCAA AAGGTACACC AATTAAAGCG CCGACGAATG GTAAAGTAAC ACGTATCTTT 

AATAATGAAT TGGGCGGCAA GGTATTACAG ATTGCCGAAG ACAATGGAGA ATATCAC CAG 

TGGTATCTAC ACTTAGACAA ATATAATGTC AAAGTAGGTG ATCGAGTCAA AGCAGGTGAT 

ATT ATTG CAT ATTCAGGCAA TACAGGTATA CAAACGACAG GCGCACATTT ACATTTTCAA 

1S AGAATGAAGG GTGGCGTAGG TAATGCATAT GCAGAAGATC CAAAACCGTT TATCGATCAG 204 0 

TTACCTGATG GGGAACGTAG CCTATATGAT TTGTAGTTAT AGAAGGGTGC CCGCAGTCTA 2100 

AAAAATTAAG CAATCATTGT GTGAGTATGA TACTTACATA ATGGTTG CTT TTTTCAATGA 2160 

AAATCGTAAT GCTAAGTCAT ACTTGTTTGA TTTAGATATT AC TT AAAATG TAAGACAAGG 

TTGTTAGCAT TGGCAGTGAA ATATCGCACA TAAAAAACAT TATTGTCACA CTAGAAAATA 

GTTGTGCACT AT AT CAATTT TCTGTATAAA AGTTTAATTC TGACAGTAAT GTAAACGTTT 

ACAATTTATG ATTGACATTA ATAATGACTG AATATATGAT TTATGTAAGT ATTTGTGCAA 

CGTTTTCACA AAGTGTATTG CACaAyCAAA CTGtAAACaA aGTATGGGGg GCCATAACAT 24 60 

3Q GGCAGAACTA AGTTAGAGCn TATTAAAA 24 8 8 

(2) INFORMATION FOR SEQ ID NO: 10 8: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 4093 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TTTTCTTTAT TTCAAmCTGT ATATTaATGA TGTCACTTCA TTTGATACGA TTCTTGATAA 60 

45 CCTATTCAAA ATTCCGCCAA ATAACATAAA TATTATATAA ATGCCGATAC TTTTAATCAT 120 
TTTCTACTTT TTCTTCGATA CGGAAACTTG TTTTCGAATT GAACACTTCA CCAGCTTTTA 180 
AAATTGACGG TGCTTTTTCA CCATATAAAT TAATATCATT TGGTAAAAAT TGTGTTTCTA 24 0 

SO ATGTAAAGCC AGAATGTGGT TTATAAATAT TAAATGGACT ATCCCACTCA TCAGGCTGGT 3 00 

TAAAAGTAAA GAACACAACA TGAGGCATAT CTGTATCGAC CTCTAACATA AATTCATGAT 360 
TTTCAACATA CATTTTATGT TCACCAACTG TAAATGGGTG ATCGAGACCA CCAAAACGTG 420 



2220 
2280 
2340 
2400 



55 



iNSDOCID: <£P 0786519A2_I_> 



624 



10 



20 



25 



EP0 786 519 A2 

TATCTTCAAA CACTTCATGT AAATCTAGAA TATCACCTGT AACAATATTT CGCTCATCTA 540 

ATACATACAT ATCTAATTGA TTACTTGAAA TGCGATGATT ATCAACGACA TTATTATCTC 600 

GATTCAAATT GAAGTACACA TGATTCGTAG GACTAAACAA TGTGTCTTCT GATGCAACTG 660 

CTTCGTATTC AATCGACCAT TGGTGATCCG CATCATAAAT ATGTGTAATC GTCACATCGA 720 

TATCACCCGG GAAATGATCA TCAGCTGATT TCAACACCGT CTTAAATATA ACTTTAATTT 780 

GAGCAATTTC ATTTCTAATT TCATAATCAA ATAACTTATT GTCCAAACCA TGACATCCAC 84 0 

CATGTAAATG ATGTTCACCG TTGTTTTTTT CTAACTGATA TTCTTTACCT TTCAACTTAA 900 

15 ATTTAGCATT ATCAATTCTA CCG CTATATC TTCCTATAGA AG CACCAAAT TTAAAAGGAT 960 

TACTATGATa AAATTCATCC GCTTCAACAA CATTTCCAAG AACAATATTA TTATCATGAT 1020 

ATTTCCAAGA CACTACTCTT GCTCCATAAT TCGTAAAAAT AATTTTAGTT T CAT CATT AT 1080 

CAATTTTGAT TAAATCTACA CCTTGTCTTT GGTGCTCAAC TTCAACTATC ATTTTTACTT 1140 

CTCCCTTCTA ACCACAAGTG TTCAAGCTCT GCTGGGT AG C AACATTACTA AAACACCTAC 1200 

AATACAAATG ATTGCACCGA TAACATCATA TTTATCTGGC ATTTGTTTAT CTACGACCAT 1260 

CGCAAAAATC AAACTCATGA TGATAAATAC GCCACCATAT GCTGCATATA CTCTTCCGAA 1320 

TGATGGAAAT GATTGAAATG TCGCAATGAC ACCATATAAC ATGAGTATCG CACCGCCTAT 13 80 

TAGCCCAACA AGTGAAGACT GTCCTTCCCT AAGCCACAGC CAAATCAGGT ATCCCCCACC 144 0 

TATTTCACAT AAGCCAGCTA ATATAAATAT AAAAATCGGA TATAACATGA AATCACTCCA 1500 

TCACACATTT GCTATCAATA ATCTATCGGC TACATATCAT TTGTTTACAT TTCTTCTTAC 1560 

35 TTCACATTCC CATTTTAAAA AGTTCGTTTT CACATTCATA TTGTACACTT TTTTAGACAT 1620 

TATTCTATAG_ CTAAATATAA AAAAATAAGA _GTAACACGCT TTCATCATCA. TTTTATATGA -1680- 

TAAATGTGTG TCACTCTCAT CAATTTTATT TTTTAAATAC ACGTTTCATT GAATTAAATA 1740 

AGCCACGTTC AAATGTAAGT ACTGAATCTT TATATGTTTT AATTGCAATC CATATCAAGA 1800 

CAGCTACCAT TACAATTGAG ATTAAAGAAC TTAAGATGAC CT CAT AT ATT TGAAGCCCTG 1860 

AAGTTTGAGC GCGTACAACT AATTGAAATG GCGCTAAAAA CGGAATATAA CTTGTGATTA 1920 

AAGCAAGTTG TCCATCAGGA TTATTTATCG TGAATATCGC GATATAAAAT GCAATCATAC 1980 

CAAGTAATGT CAGTGGCATC AAAGATTGAT TTAAATCTTC TATTCTAGAT GTTAATGATC 2040 

SO CGAGGATGGC TGCAAGTAAT ACATACGCCG TAATTCCAAC AATACTACTT ATAATTCCGA 2100 

CAATAATAAT TTGCCAAGAC AATTGATTCA TTTCCACGTT AAAACCTTGT AGCAAGTCTT 2160 

TTAAGTCAAA GGCAAAAATG CATATAACTG CCATCAATAC AATTAAAATA ATCTGAGTCA 2220 
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TAATAAT CAT TTCAATGACA CGCGATGTTT TCTCACTAGC AATTTCCATA GCTATTTGAG 234 0 

ATG CAT AATT TAAAACAATG AAGAACATTA GAAAGATAAT GCCATmaGcT AAAGCATAGT 2400 

TGAAAATCTT TTGTCCTTCT GATACTTTAT CGACTTCATC ATTAGAAATC ACCTTATTAT 2460 

CAACTTTACT TTGTGCTTGT AATTTTTGTA AGTCTTCTTT GTTGATATTT AATTCCCCGG 2520 

CTACCATATT TGTTTGAATA GCTGTAAGCA GTGCTTGTAC TTTTTGTGAA TCTTCATGAC 2580 

TTACTCGCTT CTCACTAATG ATTGTCCCTT GTAACGTGCG ATTTTGATTC ACCTTGATAA 264 0 

TATAAGCTTT ATCAAGTTTA TGTTTTTTTA CTTCTTTTTC AGCATCTTCT ATAGAAACTT 27 00 

TAGTAAACTT AG CAT CA CT A TGAAATGTAT TCGCCTGTTG CTTGAAAACC TTATAGATTT 27 6 0 

GTTCATTCGG TGCTGCTACA CCAATTTTAT CTGGACCATC ATCAAACATG TTAATAATCT 2820 

TATCAATGTT AG AT AGG C CA ATCATTAAGG CAGCAATAAT AATCATAAAA ATTACAAATG 2880 

20 ATTTAGCTTT AATTTTTTTG ATATATGTCA AAGTAAATGT CGCCCAAAAC TT ATG CATC C 2 94 0 

TTGCCACCAA CCTTCTCAAT GAATATATCT TGTAATGATG GTTCTACAAC TTGGAATCGT 3000 

TTAACATAAC CTTGATGTGC CACAACTTGA TAAATATCTT TGGCTACGTC TTCATTCTCA 3060 

ATCGTCAACT GAAGACCTTG CTTCATGTTT TCACTATGAA TGATGC CTCT AATGTTTGTT 3120 

AAATCTGGTA GTGTTGTTTC TGATTCAATG ACAACTTTCT TGTT AC CATT AGATGCACGT 318 0 

ACATGATTGA TATCACCAGA AACAACAAGT TGACCTTTAT CTAAAATACA AACATCATCA 324 0 

CAT AATT CTT CAACATGCTC CATACGGTGA GAACTATAAA CGATTGTACT GCCCCAATCA 33 00 

TTTAAGTCTT TAACTGCTTc TTTTAATAAC TCAACATTAA CTGGGTCTAG ACCACTGAAA 3360 

35 GGCTCATCTA ATATTAGTAA TTCTGGTTTA TGTAACATAC TTGCTAACAG CTGAATTTTT 3420 

TGTTGATTCC CTTTTGATAG ACTATCAATT CGTTTTTTGC GGTTTTCAGT AATATCAAAA 34 80 

CGCTCAAGCC AATACGATAT TTGCTGTTGT ATTTCTGTTT TTGACATTCC CTTTAAAGTT 354 0 

GCCAAATATT TCAATTCTTC TTCAACTGTC AATTT CC CAT GTAAACCGCG TTCTTCCGGT 3600 

AAATAACCAA TACGATTGTA CATTGTTTTA TCTAGTTTTT TACCGTTATA CGTrrTGTGT 3660 

CCTTCAGTTG GTTCACTTAA GCCTAAAATC ATACGAAATG TCGTT G TTTT ACmTGCACCA 3720 

TTTCTTCCTA GAAAACCTAA CATTTTACCT GATTCTAACT TTAATGAAAT ATCATTTACT 3780 

GCCGTCATCT TGCCAAAACG TTTCGTAACA TGTTCAATTA CAAGTCCCAT ACTTTGCCTC 384 0 

SO CTAAAAAnAT ATGTATTTAT CTTAATATAA CATTTCCATT CTCTATAAAT GCAATATTTT 3 900 

TAAAATGAAT TTATTTTTAA AATTTCTGAA ATTGAAAAAT TTAAATAGTG CCATTTTTGC 3960 

ATGTTAAGTA T CATT AG CAC TAGATATGTT TTTTCCATGC CTTTATTGCC TTATTTGTAA 4 020 
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CTTnCCGGTG TTT 4 093 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TGCCAAACTA CCTTTTGACA GTCGTTGCTG TACTTCAGGA TGATCAATCA CATATnTTAC €0 

TTTATCAAAT AGGGCATCTT CATCATTTTT AGTAATTAAA TAACCATTGA AATCTGAAGT 12 0 

AATCAGTTCG TTAGGT C CAT ATTTAATATC ATAACTAATA ACTGGAACAC CATGTGCTAA 180 

20 AGATTCAAGT AGCGCTAAAG AGAAACCTTC CATGTTACTT GTTATTAAAC TCAAATAGGC 24 0 

ATCG CTATAT TCTTGGTCTA GATTGCTTAA AAAGCCGCGT AAGTAAACAT GATTTTCCAA 300 

TCCATATTTT TGTATCAATT CATTTAATTT TTTACTTTCA GAaCCAAAAC CATACATATG 36 0 

AaGCTCTATT TTTGGGACAT ACGATACTAA GCGTTTAATT AATTCAATTT GTTGATGTAA 420 

TTGTTTTTCA GGTGAATAAC GAGCAACGGA AATTAATTTA ACACTGCGCT GATCTAATGT 4 80 

TTGGACTGGT GTATCAATTG TTTCACTATA GCCGACAGGA ATATTAACAA CTGGAATAGT 54 0 

ATGGTTAATA CGTTTTTCAA CATCTAATTT TTGCTGCTCA GTAGAAACGA TAATTGCACG 6 00 

ATATCGAGAT AAATTTTCAA ACATCGCTTT ATATACATTT TTAAATGGCG ATGAATCTAA 66 0 

35 TGCATCAATA TTTTTAATGT GTGTACTGTG AAGCACAGCT ACTACTGGGA TTGACTCAGG 720 

CGTTAAGTTG AAAATAGGTG CTGTGTACAC ATTACGATCA CTGAAAAATA AATCCCCATG 780 

TTGATATAGT TGTTTAATGA AAAATGCGCC TAATTCCGTT TCATTATTAA AGAAATATTG 84 0 

TTTGTTAGCA TAGTAAACAA TAATTTTTTG TACTTCTGGT TTGCCATCCT TGTAAGAAAA 900 

ATACTTTTCT AATTTTGTGT CACCTTCTGG ' ATTATAGAAA AATTCACATA ATGTTTGTTG 960 

TTTATCAACA AGAATCCTAC TACAACTTAA AAAGCCACGC ACATCATAAA AATCACGTTT 1020 

TACTTtTCGT CTTTGACTAT CAAAATGATT TACATAATCT AATATACGAT ATTTAGGATC 1080 

TTGAAAATGG GCATACATTA AGAAACG CTC TTGATCATAT ATTCTAAAGT CATGACTATT 1140 

SO TTCAACATGT TTTAAAGTAT AATGACATTC ATCAGTCCAA TACGACAACC AGTCAAATGG 1200 

TTCATTGCGT TCTAAATATG TTGCTTCTTG GAAGAAATCA TACATATTAA TATAGTCAGA 1260 

ACTAGTAATA TAATTTTGGG CATTTCTATA TAAATATCTA TTCCATGACA GAAATACACA 1320 
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CCCAGTTAAA TTAACACCTA AACTATT AC C TACAAAATAA TTCATTTACA ACACCACTTA 144 0 

TATCTATTTT TTATAATTAT ATCACATAAT ATTTAATTAC TTCTTTTAAC TGGAAGATGT 1500 

GTTTATTTAT AAAACAACAA ATTTTGATAT TTATAATGAT AGTAGTTATT CAATCAcTAC 1S60 

GACCcAATAT ATCATkGTAG AGCTTAGGAT ATTGATTTAT GACTCAGGCA CATCAAATG a 1620 

GAgGATTTAT- AAArGAGATA TACAACTCTA GAAGGTATAA TAAAAACGCG CAACTAATGT 16 80 

TACGCGTTTG AATTAATCAT ATGATATTAT TTG CGATACT TTAATTTAGC GAAAgcATCA 17 4 0 

TGTTGATGGA TAGACTCTTC ATTACGACAT TCGATATCGA AACCGTCTAA CCAATCAAAT 18 00 

TCAACTAAGT CCGCGGCAAT TAAACGAATT AAGTCTTCGA CAAAACGTGG ATTTTCATAT i860 

GCACGCTCTG TCACACGTTT TTCATCAGGA CGTTTTAAAA TAGGGTATAG AATTGAACTT 1920 

GCATTAGCTT CCATTGCATC TAAAATTTTA TTTTTATAGT CATCAACTAT GTCTTGATCT 1980 

20 TTATTAATAT ATGTTTTAAC AGTGACAACA CCACGTTGGT TGTGCGCTGA ATACTCACTT 204 0 

ATTTCTTTTG AACAAGGGCA TAGCGTTGTG ACAGTTGCTT CAATAGTAAG TTCTTTACGT 210 0 

G T An CTTT AT CACCGTCAAT TGCTAATCCA TAAGTGACAT CGGCATTACC AACTGCTTTA 216 0 

ATATTTGTGG TTGGACTATA GCGATCAAAG AACCATTTCC CAGAAACATC AACGCCTGCC 22 2 0 

GCATTTTGTT TCATATTCGT TTGTAAAGTG CGTAACACCT GATAAAGTGT ATTAAATTCA 22 80 

AGTTCAATAC CATTATCATA GTGCTTTTCA ACACTTTCGA TTATACGGCT CATATTAATA 234 0 

CCTTTTTCGT CTTTTGTTAA ACTTGTTGAA AAACTAAATG TGCCAGCTGT TTGATACTGG 24 0 0 

TCAACAAGTA CAGGGTACAC TAAGTTTTTA ATACCAACTT CTTCTATTTC AAATAAAAAA 24 60 

3S TCTTTATGTG TACTTTGTAA ATCTGTCATT TCGTTCTTAG TAGTAGGTTT CGTGCCTTCA 2520 

ATAGGATCTA CGGAACCAAA GTGTTTCCAA CGACCTTCTC GTGTCGATAA ATCAAATTCA 2580 

GTCATTTTTT TCCTCCGTTA AGATTTAAAG TGATATGTCC AATATGGTTC GACTGTTAAA 264 0 

40 AAGCTGTGTT GTTTACCATC GATTTCAGGA CTTGCTAATT GTTTTAAAAA TGGACCTGTT 2700 

TGAGAAGCAT GTGCTTCAAA TGCCTTAATT TTAAGTTCTT TAAAAT CTGT AAT AT CATTT 2760 

TGAATATCAG GTTCTCCAAG AGCTTCGGTT G CATGATTAC TGAACGCAAC TAAAGTTAAA 2820 

CGAGGGCGTT CTTCTTTAGG CATGCGTTCA ACCGTTCGAA TTACAGCGTC TGCTGTTGCT 2880 

TCGTGATCAG GATGTACTGC ATATCCAGGA TAAAATGAAA TAATCAATGA TGGATTTGTA 2940 

TCATCGATTA AAGATTTAAT CATACCATCT ATATGTTCAT AGGGTTCAAA TTCGACAGTT 3000 

TTGTCACGTA AACCCATTTT TCTTAAATCA GTAATACCGA TAACTTTACA AGCTTCTTCT 3060 

AGTTCACGCT CACGAATACT TGGTAATGAT TCGCGTGTTG CAAATGGGGG ATTACCTAAA 3120 
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TAATTTGCTA ATGTGCCTGC AGATGAGAAG GTTTCATCAT CAGGATGTGG AAATATTACT 3240 

AATACATGTC TTTCGTCAGT CATGTTGATG CCTCCTCTAT AAATTAAATG GTCGCTCACT 3 300 

AATTTGAAGT GCTGCAGCGA GTTGACCTTC GTAATTAAAA CCTGCAATTA AAAATTCATC 3360 

ATGCTCATTG ACCTCAAAAT GCGTTAGACC TTGTACATAA ACCCAACCAC CATTTGATAG 3 4 20 

TTTAAGACCA ATGCGATAAG GTTCTTTATT ACCACCT TT T AGTTGTGCAT GCGTATATGT 3 4 80 

TATTTGTATG TTTCTTAAAA AAGTACCAGC ATTAAAAACA CGTTGATCGA AATGGTTCGC 3 54 0 

ATAGGCCCCA TTTGTCGTTT CAACATGCAG ATACACAGGT TTATGTTCAA AAGAAGCAAG 3 6 00 

1S TAAATCTATA ACTTCTTGTT CTTTAATTGG TTCCAACACG TTCACTCCTT ACACTATCAA 3 6 60 

TGTGTTT AT C TTTCTATTTT ACTAAAAACT ATTCGATAAT TGTATACGAT TGCTCAATTA 3 720 

TTTATAAATT AATTTTCATG AAGGGTAATT ACTCAGGATT ACGTAATCAT ACAGCATTAG 3 780 

20 TTTTTTACTT TTAAAAATCA AAAATTTGTT GGAATTTGAA AAGTGTTAAA CATTAAAAAT 3 84 0 

GATGCTATAT TAATGGTGTA TGAATGAATT CATAAGTTTT TAAAATGTAT TAAATTTGTG 3 900 

GAGGCATGTA AACAATGAAA GTATTAAACT TAGGATCGAA AAAACAAGCA TCATTCTATG 3 960 

TTGCATGTGA GTTATATAAA GAGATGGCAT TTAATCAGCA CTGTAAACTA GGTTTAGCAA 4 020 

CTGGTGGTAC AATGACAGAT TTGTATGAAC AACTTGTTAA GTTGTTAAAT AAAAATCAGT 4 0 80 

TAAACGTAGA CAATGTATCC ACGTTTAATT TAGACGAATA TGTAGGTTTA ACCGCATCAC 4 14 0 

ATCCGCAAAG TTATCACTAT TATATGGATG ACATGCTTTT CAAACAATAT CCTTATTTTA 4 200 

ATAGAAAGAA CATTCATATT CCAAATGGAG ATGCCGATGA TATGAATGCG GAAGCGTgCA 4 2 60 

35 AAATATAATG ACGTTTTAGA ACAACAAGGT CAACGTGATA TTCAAATTTT AGGTATTGGT 4320 

GAAAATGGTC ATATTGGATT TAATGAACCT GGTACGCCGT TTGATAGCGT TACTCATATC 4380 

GTTGATTTGA CTGAAaGTAC TATTAAGGCT AATAGTCGAT ATTTTAAAAA CGAaGATGAT 444 0 

40 GTTCCAAAGC AAGCCATTTC GATGGGACTT GCTAATATTC TTCAAGCCAA ACGTATCATT 4 500 

TTACTCGCAT TTGGTGAAAA GAAACGTGCT GCTATTACAC ATTTATTAAA TCAGGAAATT 4 560 

TCTGTTGATG TTCCAGCCAC ATTACTTCAC AAACACCCGA ATGTTGAGAT ATATTTAGAC 4 620 

GACGAAGCTT GCCCGAAAAA TGTTGCGAAA ATTCATGTCG ATGAAATGGA TTGATTGCAA 46 80 

TGTTTAATTA AGAAATG CCT CGGGAAAGGT TCCAATAGAA AGATAAAAAG CATTGGAAGG 4740 

ATGATTTTTA GTGGAATTAC AATTAGCAAT TGATTTATTA AACAAAGAAG ACG CGGCTGA 4800 

GTTAGCAAAT AAAGTAAAAG ATTATGTAGA TATCGTAGAA ATCGGTACGC CAATCATTTA 4860 

CAACGaAGGT TTACCAGCAG TTAAACATAT GGCAGACAAC ATTAGTAATG TAAAAGTATT 4920 
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ATTTAAAATA 

ATAATTTATT 

ATAAACATTT 

GC t ATCGCTA 

TATTGTTTAA 

ATTTTTGAAC 

AAACTAATTA 

TTTAATAAAG 

ACCGGTATCG 

TACATTGGAG 

TTCGCAGCAT 

ATTGGTGACA 

TCAACATTAA 

CTATTATTGC 

ATGACATATG 

GAAATTGGGA 

TTTTTAACAG 

TTCCTAGGAT 

AATGATGTTG 

TATTACAAAG 

ATGGTAACTG 

ACAAGTGTAT 

AAGTTAG CGG 

TTATTCAGTA 

GGTATATTTA 

CCAACGATGT 

GTTTCGATAT 

GATCCATTAG 

ACATTCTTAC 



AATATTTATT 
TTTGTAAAAA 
AAATACGATG 
TTTTTAGTTA 
TTCAAATGTA 
AAACATACTT 
ACTCCGTATA 
AGAATATTAA 
GTAATGCAAT 
CGAACTTCTT 
TAGCCATTGC 
AATATGGACG 
CCATTGGATT 
TTGCAAGAGT 
TTGCCGAATC 
CATTATCAGG 
ATGAACAAAT 
TATTCGGCTT 
CAACACAACC 
ATATATTTGT 
CATATTTACC 
TAATTACTTG 
ATAAAATAGG 
TCATCGCATT 
TATTAGGATT 
TTTACAGTCA 
TTGGTGGTaC 
CmCCTGCGTA 
ATTTAAGTAC 



AAACATTATG 
TAAATTAAAG 
TCGAAAATGG 
TAATTCCAAA 
TGAGGGTATA 
TTGTATTTAT 
ATTATGAAAC 
CATGGTGGAT 
GGAATGGTTC 
CTCTCCAGTA 
GTTTTTATTA 
TAAAGTTGTA 
ATTGCCAAGC 
ACTACAAGGG 
ATCTC CAG AT 
TTACATAGCT 
GGCATCATTT 
ATATTTACGT 
AGAAAGAGAT 
ATGTTTTGTA 
AACCTATTTA 
TGTCATGGCA 
TGAAAAGAAA 
TATGTTATTA 
TTTCTTATCA 
TATAAGATAT 
GaCGCCATTA 
TTATTTAACA 
AGCAGGAAAA 



AATTTTTAAA 
TAATGACAAA 
CGATAGCATA 
AAGTTAATCG 
AAATCATTGA 
ATAAAAGTTT 
ATACAAGAGG 
GCAAAGAAAG 
GATTTTGGTG 
GAGAATGCAG 
AGACCAATTG 
TTAACATCTA 
TATGATCAAA 
TTTTCAACAG 
AAGCGTCGTA 
GCTTCAATTA 
GGTTGGAGAA 
CGTAAGCTGG 
AACATTAACT 
GCTGTTGTAT 
GAACAAGTTA 
ATAATGATTC 
GTATTTCTAA 
CATTCACAAT 
ACTTACGAAG 
CGAACTTTAT 
GTkGCAmCaT 
GCAAT CAGTG 
TCTCTAAAAG 



GAGTAATGTC 
GTTATTGAAG 
TCACTTACAT 
TTCGATGATT 
ATTTAATTCG 
AAATTCTTAT 
GAGTGTATGA 
CTAAAAAAAC 
TCTATGCATA 
ACATTCGACA 
GTGGTGTCGT 
CAATTATTTT 
TTGGACTTTG 
GTGGAGAGTA 
ACTCATTAGG 
TGATTG CTGT 
TCCCATTCTT 
AAGAATCACC 
TTTTACAAAT 
TCTTCaATGT 
TTAAATTAGA 
CATTAGCATT 
TTGGTACTGG 
CATTTGTTGT 
CGACAATGCC 
CAGTAACATT 
GGTTaGTTAC 
TTATTGGCTT 
GTTCGTATCC 



TGACTCGTTG 
TAAATTGAGT 
GAAGTTGTGT 
TAAGAATTAT 
ATAAAGCGAA 
AAATTTGACA 
ATTCATGGAT 
CGTTGTTGCA 
TAcAACTGCG 
AATGTTGACT 
ATTTGGTATT 
AATGGCATTT 
GGCACCAATA 
TGCGGGGGCA 
TAGTGGACTA 
ATTAACATTC 
ACTCGGTTTA 
AGTTTTCGAA 
CATCAGATTT 
TAC^WVCTAT 
TGCAACGACA 
AATGTTTGGT 
TGGGCTAACA 
AATAGTAATC 
AGGGTCGTTA 
TAATATCTCT 
GAAAACTGGA 
TTTAGTTATT 
AAATGTAGAT 



6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 

-7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
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unn^o 1 AAu A 


ATTAGAGATT 


TTAATaAAAA 


GTATAAATCA ATCGTATATA 


AGCACTTTAA 


8640 






TTCTG CT AAC 


TTT AAAGTGC 


X X X X XAAAX X 


VjAVjAAU X vj X A 


ATTAGCCGTA 


8700 


5 


A X AAAWj X X X X 


1 GTATATACA 


TAAACCCCCA 


V» X X Van x 


TATfY!r*AATr* 
Ini^uLAAlu 


GGGGAAAGAG 


8760 




VjVjVjAv. X X AAA 


<j CATATGl 1 1 


AGCTTTGAAT 


AVwX XAAAAX X 


\-iv.i lvv,lAl 


TGAAATGTTA 


8820 




vjvj A ibi AAA 1 


A i G lull AG A 


GTATTTTGTC 


CAAmrAATT 

WviLULAnl X 


A AT HTTP A^ A 
AAX AX 1 


CTCTAACCTT 


8880 


10 


PZi ATA TT1TT 

AA X AX X AX 1 


TV TV ^ A /" * TV tv j^*, 

ATAGAGAACA 


CAAACTTAAA 


X /-iVj/"! X X VjVjO X 


vjAL X X A 1 1 1 G 


TGTCAGTTAT 


8940 




«A1 X uLu 


ATAACTTCTT 


TTCTCTATAT 


A. AT A T A. f2T a 
t\\~t\ X n X nu X A 


A r*!** T/^ r P r P 7A 
Al— .^j 1L1 1 A I L- 


TAATAAAAAA 


9000 


IS 


V^AXuOlAL X A 


CAGTATCAAA 


TTTATCTAGG 


wU X X AAVj X X X 


uAl l ITi ATA 


ATAGGCAGGT 


9060 




1 IALCTGATA 


AAAATACTTA 


TTCATTATAT 


A ATYJTTA aril 
AAXoX X AAL-A 


ATATGTATTT 


TAAAGTTTAC 


9120 




TV »prp/*i J% /"'"TV » 7V P 

A I 1GAGTGAG 


GGATATTGAT 


GAACGTAATT 


TT An A A C* 7X /""T 
X X AuAALAu X 


XGAAAACACA 


TACTCAAAAT 


9180 


20 


Tt Ml f~*t~ m 1 1 A TV TV^ 

AAACCJTAATG 


ACATAGCATT 


ACATATCGAT 


P ATr* A A Af*ft A 
vj A X unAALAA 


1 1 Aua 1 A TAG 


TCAACTAAAT 


9240 




G (_ <_ L. G CAT CA 


CTAGCGCAgT TGAATCTTTG 


vJiuAAAiAi 1 


LAL1TAACCC 


TGTCGTTGCT 


9300 




ATTAATATGA 


AATCAC CGGT 


/""* AAA /"*T^ A T"? 

G LAAAGTATT 


All X Lj 1 1 A 1 1 


TAGCTTTGCA 


TCGTTTACAT 


9360 


25 


AAAGTGC CTA 


TGATGATGGA 


AGGTAAATGG 


PA A APTA/T JV 
I— AAAG 1 AC 1 A 


HTA ^**A 'T^/^/'^' rr^/^ TV 

1 ACATCGTCA 


ATTGATTGAA 


9420 




AAA r T^ TV fTV ^O 1 * TV 

AAATATGGTA 


TTAAAGATGT 


AATTGGAGAT 


A\ — Avjtaj 1 1_ X V_A 


rpf TV TV TV rr-l TV Ti 

1 GCAGAATAT 


AGACTCACCG 


9480 




Al G 1 xTAnG 


ATTCAACGCA 


ATTACAGCAC 


lALLtLftAl I 


A 'I'M* A TV rrt Tv m 

TA1 1ACATAT 


TGGTTTTACT 


9540 


30 


' A /"■*/■*• TV Z'"' "TV TV 

TCAGGGACAA 


CTGGACTGCC 


AAAAG CAT AT 


X A X VwVj X VjA X VJ 


AAGA I 1 CATG 


GTTGGCTTCT 


9600 




111 bAAGTTA 


ATGAAATGTT 


GATGTTAAAA 


AATf2A A A A*m 
*"w"V X \ J AAAA X Vj 


LAAiAbv^vjv, 


CCCTGGACCA 


9660 


35 


Vw X AX LbLAL 1 


CGTTAACATT 


ATATGCGTTA 


x lull i X X 


X AAvj X XCvJvtvj 


TCGTACTTTT 


9720 


ATAGGACAGA 


CCACTTTTCA 


TCCTGAAAAG 


"l"l"A f' w l w I ' A & TP 
X X nL X 1AA1L 


A RTPTP A T A A 
AAlblUAI AA 


AATATCATCA 


9780 




TACAAAGTTG 


CTATGTTTCT 


TGTTCCAACG 


ATYIATTA h AT 
AXVXAX XAAAX 


LAI 1 A 1 IGl 1 


AGTTTACAAC 


9840 


40 


AATGAACATA 


CAATCCAATC 


ATTTTTTAGC 


AW X StOAVjA X A 


AGL XAjvJAl X L. 


TTCTATTTTT 


9900 




AAAAAGATAA AAAATCAAGC 


AAATGACATA 


AAX X XuAX XVj 


AA 11111 I\jG 


TACATCGGAA 


9960 




ACCAGTTTTA 


TCAGCTATAA 


CTTGAATCAG 


LAnuLnL L-Ao 


1 loAA 1 CAGT 


AGGTGTGCTA 


10020 


45 


TTTCCAAATG 


TGGAATTGAA 


AACAACGAAT 


CACGATCACA 


ATGGTATAGG 


AACTATTTGT 


10080 




ATAAAAAGTA 


ATATGATGTT 


TAGTGGCTAT 


GTAAGTGAAC 


AATGTATAAA 


TAATGATGAA 


10140 




TGGTTTGTTA 


CTAATGATAA 


TGGCTATGTA 


AAAGAGCAGT 


ATTTATATTT 


AACGGGACGT 


10200 


SO 


CAACAGGATA 


TGTTAATTAT 


TGGTGGTCAA 


AATATATATC 


CAGCACATGT 


TGAACGCCTT 


10260 




TTAACGCAAT 


CTTCGAGCAT 


TGATGAAGCA 


ATTATCATCG 


GTATTCCAAA 


TGAGCGTTTT 


10320 
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CAATTTTTAA AAAAGAAAGT GAAaCgnTaT GAAATTC CAT CGATGATTCA TCATGTAGAA 10440 

AAGATGTATT ACACTGCAAG tGGTaAAATT GCTAGAGAAA AAATGATGTC GATGTATTTG 10500 

AGAGGTGAAT TATAATATGA ATCAAGCAGT CATAGTTGCA GCTAAACGAA CTGCATTTGG 10560 

GAAATATGGT GGCACTTTAA AACATTTAGA GCCaGAACAA TTGCTTAAAC CTTTATTCCA 10620 

ACATTTTAAA GAGAAGTATC CAGAGGTAAT ATCTAAAATA GATGATGTAG TTTTAGGTAA 10680 

TGTTGTTGGG AATGGTGGCA ATATTGCAAG AAAAGCATTG CTTGAAGCGG GGCTTAAAGA 10740 

TTCAATACCT GGCGTCACAA TCGATCGGCA ATGTGGGTCT GGACTTGAAA GTGTTCAATA 108 00 

TGCATGTCGC ATGATCCAAG CCGGAGCTGG CAAGGTATAT ATTGCAGGTG GTGTTGAAAG 10860 

TACAAGTCGA GCACCTTGGA AAATCAAACG ACCGCATTCT GTGTACGAAA CAGCATTACC 10920 

TGAGTTTTAT GAGCGTGCAT CATTTGCACC TGAAATGAGC GACCCATCAA TGATTCAAGG 10980 

20 TGCTGAAAAT GTGGCCAAGA TGTATGATGT TTCAAGAGAA TTACAAGATG AATTTGCTTA 1104 0 

TCGAAGTCAT CAATTGACAG CGGAAAATGT AAAGAATGGA AATATTTCTC AGGAAATATT 11100 

ACCTATAACC GTTAAAGGAG AAATATTCAA CACTGATGAA AGTCTAAAAT CACATATTCC 11160 

GAAAGATAAC TTTGG C CGAT TTAAGCCCGT GATCAAAGGT GGGACCGTTA CCGCTGCGAA 11220 

TAGTTGTATG AAAAATGATG GTGCAGTTTT ATTGCTTATT ATGGAAAAAG ATATGGCATA 112 80 

CGAATTAGGT TTCGAGCATG GTTTATTATT TAAAGATGGT GTTACGGTAG GTGTTGATTC 11340 

TAATTTTCCT GGCATTGGTC CAGTACCAGC CATTTCCAAC TTACTAAAAA GAAATCAATT 114 00 

AACGATAGAA AATATTGAAG TCATTGAAAT TAACGAAGCG TTCAGTGCAC AGGTAGTTGC 114 60 

CTGCCAACAA GCTTTAAATA TTTCAAATAC GCAATTAAAT ATATGGGGTG GTGCATTAGC 11520 

ATCAGGTCAT CCATACGGTG CAAGCGGTGC CCAATTAGTG ACTCGATTAT TTTATATGTT 11580 

TGACAAAGAG ACTATGATTG CATCTATGGG GATAGGGGGA GGTCTAGGAA ATGCAGCATT 11640 

ATTTACTCGA TTCTAACCAG CGATTAAATG TGTCATTTTC TAAGGATAGT GTGGCTGCAT 11700 

ATTATCAGTG TTTTAACCAA CCTTATAGAA AAGAAGTACC AC CATT AATG TGTGCGTCAT 11760 

TATGGCCAAA ATTTGATTTA TTTAAAAAAT ATGCAAATAG CGAACTGATT TTAACAAAAT 11820 

45 CAGCAATTAA TCAAACTCAA AAGATAGAAG TAGACACAAT ATATGTAGGG CATTTAGAAG 11880 

ATATTGAATG CCGACAGACT CGCAATATCA CACGTTATAC AATGGCTTTA ACATTAACTA 11940 



25 



30 



35 



40 



AAAATGATCA ACATGTCATA ACGGTtACAC AAACTTTTAT TAAGGCGATG AAGTAGAGAT 12000 
GGAGTTTAAT GAGATATGGA TAAATGAATA TTTGG CGCTC GTAAATGATG ATAATCCAAT 12060 
ACATAATGAG ATTGTGCCAG GACAATTAGT GAGTCAAATG ATGCTGATGG CTATGTCATT 12120 
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ATTCATTGAA CAACACGAAC ACGAAATTAT AGCAATTAAT GACGATGGAG AGATTAAAAT 1224 0 

AAAAATTTCT TTGAGCACAA AAAAATAACC GATATTAGCT GCATGAACGC ATATTAATTA 12300 

GGAGATGAAA GGACAGCTAA TATCAGTTAT GTATTGTTAT TATTATTGGG AACAGAGATG 12360 

AATATAGGTT ACGTTTCTTT CTTTGCACGG GGATGCATTA ATCTAAAATA ATAATAACAA 12420 

CTATATCAAT GTTTAATAAA TTCTGGATTA TTGGAACGAT TAGTCAATTT AACTAACTTT 12480 

CATATGATCT ATATCGTCTT GTAATAAAGA GAGCAATTTG AATATTTCAG TATCACTAAA 12 540 

TGAATCGTCA CATTTAATTG AAACATGCTG AAACGTTTTG GTTATAATTT CATAAACTGG 12600 

TGCGCCTTCA TGGTGATACT GTCGATAAAT AATCATAACC TATATTACCT CCTTTGCTAC 12660 

TCTATGGTTA TATTATAAAT AACATTTTTA TGTGTGACAT CAACCTTAAG TATCAACTTT 1272 0 

TTATCAGACA TAGAACGTAT GATTTACTAA GACTATTTAT GTATAAAAGT TCTAAATAAA 12780 

20 TATATATTTA TAG AGTCG CC TGGCAGTCAT TTGGGaAATA TAACATATAT GATTAGAGAG 12840 

G CAT CTATCG CAAAAGAATG ATAATGATAG AGGTATTGAG CATATAGATG AGTTTAAGTT 12900 

CATCTTGAAA ATAAAGGGTT ATTTAGTCAT AGATGTAGAT GTATAGGAAA TATTTGTATG 12 96 0 

TATTGTTCGA TATGTATGAA ATTTTCAATA AAAGCTAATA ACG CTTATAT GTAACTTTCA 13 020 

AATTTAAATT ATATACAGAG CATGATGATT ATAAAAAAAT AACCACATCA CATAAATTGA 13 08 0 

GTTCATACCC AATTTAAGTG GTGTGGCTAA TAATGTTGAT TTATAGATGA ACCGCCTAAT 1314 0 

CGTTAAACCT CTGTTACTTC AACATCGATA TGTTCAATAC GGTTGTATGC ACCGTGATCC 13 200 

ACAGGACCAA CAAAATCATT CATTTTCCAA CCGTTTTTAA TAGCAGAAGC GACGAAAGCT 1326 0 

TTCGCGCTAA TCACAGCTTC TTTCGGTGAC TTACCGTTAG CTAAATATGC AGTTGTTGCC 13320 

GCAGCAAATG TACAACCAGC ACCATGGTTA TAACTTTGTT GGAACATGTC TGTTGTTAGT 13380 

TGATAAAATG TTTGACCATC ATAGTATAAG TCATACGATT TATCTTGATC TAAAGCTTTG 13440 

40 CCACCTTTAA TGATGACATG CTGTGCGCCT TTATCAAAGA TAATTGTTGC AGCCTTTTTC 13 500 

ATATCTTCAA TTGAATTTAA TTTACCTAAT CCTGATAATT GACCCGCTTC AAATAAGTTT 13 560 

GGTGTCACTA CCGTTGCTTT AGGTAGTAAA TATTTAATCA TCGCCTCAGT ATTTCCAGGA 13 620 

45 TTAAG CACTT CATCTTCGCC TTTACAAACC ATGACAGGAT CTACTACAAA ATATTGTGCA 13680 

TTAGATGCCT CATATACTTC TCCAGCACGT TTGATTATCT CCTCAGTACC TAACATACCT 13740 

GTTTTAATAG CATCAGGTCC GATTGATAAA GCCGTTTCAA GTTGTTTTTC AAATACATCC 13 800 

SO 

ATTGGTAATG GTGTAACATC GTGTGACCAT GTATCTTTAT CCATAGTAAC GATGGCAGTT 13 860 

AAAGCG AC CA TGCCATACGT ATCTAATTCT TGGAACGTTT TCAAATCTGC TTGCATACcT 13920 
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CACTCCTACA TAATAATATT GTATTCATCA TATCATTTTT AACCTAATTG AAAAATATTA 14 04 0 

AGCATTCAAT ATTTGATGAT TGTTGAAATG AATCATTCAT ACTATTGTAA CTTTTGAAAA 14100 

TGTCATTCAC TTTAGATAAG TGTGATATGT TAAAATATGT CCTGAGGTGA GATTGAATGG 14160 

AATGGTCGCA AATTTTTCAT GACATAACAA CGAAACATGA CTTTAAAGCT ATG CATGATT 14 220 

TTTTAGAAAA AGAATATTCG ACTGCAATCG TATACCCTGA TAGGGAAAAT ATATATCAAG 14280 

CGTTTGATTT AACACCGTTT GAAAATATCA AAGTTGTTAT ATTAGGACAA GACCCGTATC 14 34 0 

ATGGTCCAAA CCAAGCACAT GGATTAGCAT TTTCAGTGCA ACCTAACGCA AAATTC CCTC 14400 

CATCTTTACG TAATATGTAT AAAGAATTAG CAGATGATAT TGGATG CGTT AGACAAACAC 14460 

CGCATTTACA AGATTGGGCA AGAGAAGGCG TCTTGTTATT GAATACAGTT TTAAC CGTAA 14520 

GACAGGGTGA AGCAAATTCT CATCGTGATA TTGGTTGGGA AACATTTACT GATGAAATTA 14 580 

TTAAAGCAGT GTCTGATTAT AAAGAACATG TTGTCTTTAT TTTGTGGGGG AAACCTGCAC 14 640 

AGCAAAAAAT AAAGCTTATC GATACATCTA AACATTGTAT TATAAAATCA GTGCATCCTA 14 700 

GTCCACTGTC TGCATATAGA GGATTCTTTG GATCAAAACC GTATTCCAAA GCGAATGCCT 14 760 

25 ATTTAGAGTC AGTAGGAAAA TCACCAATTA ATTGGTGTGA AAGTGAGGCG TAGATGTTGA 14 820 

ATAGAGAAAC TTTAATAGCA CGAATTGAGC AAGAATTAGT ACAAGCAGAG CAGGCACAGC 14 880 

ATGACCATGA CTTTGAAAAA CATATGTATG CCATACATAT ATTAACATCT TTATATGCTT 14 94 0 

CAACATCAAA TACACCACAT ATTGGTGAAC AACAAATGAA TCGTCGTATT GCTAACCATA 15000 

ATCAAATGCC ACAATCACAA ATAACGCAGC CAACTCATCA AGTGACAGTT GCTGAAATTG 1506 0 

AAGCGATGGG TGGTAAAGTA AATACGCATT CAGCACATCA TCATAATAAG T CAT ATT CAC 1512 0 

AACCTTCAAA CCAACAACAA AGATTAGCGA CAGATGATGA CATTGGCAAT GGTGAATCCA 15180 

"~ ^ATTSrGATTT TTAAAAAGCA ACAATGAAAC ATAATTACTT "AATAGCTTGT TAAGTATGTA 15240 

GGTTAATAAT CAAGACGCAT ATACTTTTAT TCGAGTGTTC GGATTTAAAC ATTTATTAAT 15300 

ACTGAATTAT ATAAGGAGAG GTAGCAATGA AATTATTTAT TATTTTAGGT GCATTAAACG 15360 

CGATGATGGC TGTCGGTACA GGTGCATTTG GTGCGCATGG TTTACAAGGA AAAATAAGTG 15420 

45 ATCACTATTT ATCAGTATGG GAAAAAGCAA CGACGTATCA AATGTACCAT GGCTTAGCAT 15480 

TATTAATTAT AGGTGTAATT AGTGGTACAA CTTCAATCAA TGTTAACTGG GCTGGCTGGT 15540 

TAATATTTGC TGGTATTATT TTCTTTAGTG GATCATTATA TATTTTAGTA TTAACTCAAA 15600 

50 

TTAAAGTTTT AGGTG CGATT ACGCCAATTG GTGG CGTATT GTTCATCATT GGATGGATAA 15660 

TGTTAAT CAT TGCGACATTC AAATTTGCTG GTTAAATTTT AAAACTTTAG ATTACCTATG 15720 
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TGGGTATAGA ATACCTTCGA GGTGAGTTTT TATTTATGGA AAAAAAGAAT AAGCAAATAG 15840 

ATAGAGGCGA TTTAAAACAA AACCTATCTG AAAAGTTTGT ATGGGCGATT GCATATGGTT 15900 

CATGTATCGG ATGGGGCGCA TTCATCTTAC CAGGAGACTG GATTAAGCAG TCAGGTCCGA 15960 

TTGCAG CATC AATTGGTATA GTTATTGGTG CATTATTAAT GATATTAATT GCGGTTAGTT 16020 

ATGGCGCATT AGTAGAGAGA TTTCCAGTAT CAGGGGGCGC GTTTGCCTTT AGTTTCTTAA 16080 

GTTTCGGCAG ATATGTGAGT TTCTTCTCAT CATGGTTTTT AACTTTTGGT TATGTCTGTG 1614 0 

TCGTTGCTTT AAA t GCGACC GCATTCAGTT TACTAGTTAA ATTCTTATTG CCAGATGTCT 16200 

TAAATAATGG GAAACTATAC ACCATTGCGG GCTGGGACGT TTATATTACG GAAATCATTA 16260 

TTGCGACCGT ATTACTACTT GTATTCATGC TAGTAACGAT TCGTGGCGCA AGTGTATCTG 16320 

GATCATTACA ATATTATTTC TGTGTGGCGA TGGTAATCGT CGTATTATTG ATGTTCTTTG 16380 

GTTCATTCTT TGGTAATAAT TTTGCACTTG AAAATTTACA AC CGTTAG CT GAACCTAGCA 16440 

AAGGATGGTT AGTGTCTATT GTGGTTATTG TATCCGTGGC ACCATGGGCA TATGTTGGAT 16500 

TTGATAATAT TCCACAAACA GCAGAAGAGT TTAACTTTGC ACCAAACAAG ACATTTAAGC 16560 

TTATCGTGTA CAGTTTATTA GCAGCATCAT TAACTTATGT TGTCATGATT TTATACACTG 16620 

GTTGGTTATC AACAAGTCAT CAAAGTTTAA ATGGGCAGTT GTGGTTAACA GGTGCTGtTA 16680 

CACAAACAGC ATTTGGTTAT ATTGGATTAG GTGTATTAGC AATTGCAATT ATGATGGGTA 16740 

TATTTACTGG TTTAAATGGA TTCTTGATGA GTTCAAGTCG CTTGTTATTT TCTATGGGAC 1680 0 

GTTCAGGTAT TATGCCAACA ATGTTTAGTA AATTACATAG TAAATACAAA ACACCATATG 16860 

TCGCAATCAT ATTCCTAGTA GGAGTGTCGT TAATTGCACC TTGGCTAGGA AG AACTG CAT 16920 

TGACTTGGAT TGTAGATATG TCATCTACTG GTGTATCCAT TGCCTACTTT ATTACATGTT 16980 

TGTCTGCAGC GAAATTATTC AGTTATAACA AACAAAGTAA TACGTATGCA CCGGTTTACA 17040 

AAACGTTTGC T ATT ATCGG C TCATTTGTAT CATTCATTTT CTTAGCGTTG TTATTAGTGC 17100 

CAGGTTCTCC TGCAGCACTG ACTGCACCGT CTTATATTGC ATTACTTGGA TGGTTAATCA 17160 

TCGGTTTAAT ATTCTTTGTG ATTCGATATC CTAAATTGAA AAATATGGAT AATGATGAAT 17220 

TAAGTCGCTT GATTTTAAAT AGAAGTGAAA ATGAAGTTGA TGATATGATT GAAGAACCTG 17280 

AAAAAGAAAA AACTAAATAA TAAAAGAATC GCACAATAAA CCTTCTTCAT TCGGAGGCGT 1734 0 

ATCGTGCGAT TTTTTGTATT ATAAATTGAC ATTTAAGACG AGGCAGCTGA AC CTT AT AT A 17400 

TAATTGCTAA GAGTTAGGGC TGAGCCATTT CTAACAAATA TTTATAATCG TTTAAAAGAT 17460 

TTCACGAACC CAGAAACAAT TAATTTGGAA ATTTGGTCGG CGAATAATAA ACCTAATGCG 17520 
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AAGACTAAAT 


TTTTTGTAG C ATCGTATGCT AAGCCACCAG 


GTACTAATGG 


AATGATACCC 


17640 




GTTACCATAA 


AAATGATGGC 


AGGTTCTTTT TGTTTACGAG 


CCATATAATG 


ACTTAACAAG 


17700 


5 


CCTAATGCTA 


AACTACCAAA 


GAAACTAGAG TATATAGTGT 


GCACATTAAA 


GCCGTTGAAG 


17760 




AATAAGGTGT 


AAACCATCCA 


TCCACACGTA CCAACGAAAC 


CACATGATAG 


ATATAATTTT 


17820 




CTAGGTGCAT 


CAAAAATGAC 


GCAGAA 






17846 


10 


(2) INFORMATION FOR SEQ ID NO: 110: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 








ATTGACACTT 


GGTGAAAGTA 


ATATCGCCGC GCTATTTTGG 


CAAAATGGAC 


ACTTAGAACC 


60 




TGAGTTACAA 


GATGAACAGC 


CAATTAATAT ATTAGGATCT 


GkTCAAATCA 


ACGAATGGAA 


120 


25 


TGGTAATCAA 


TCACCGCAAA 


TAATTATTCA AGATATTGCG 


ATGAATGAAC 


AGCAAATATT 


180 




AGATTATAGA 


AGTAAGCGAA 


AAAGTTTACC TTTTACAGAA 


AATGATGAAA 


ATATTGTCGT 


240 




GCTTATTCAT 


CCTAAAAGTG 


ATAAAGTAAA TGCGAATGAA 


TATTATTATG 


GTGAAGAAAT 


300 


30 


TAAACAACAA 


ACTGATAAAG 


TAGTATTAAG AGATTTACCA 


ACGTCAATGG 


AAGACTTGTC 


360 




TAATTCCTTG 


CAACAACTGC 


AATTTTCTCA ACTTTATATA 


GTTTTGCAAC 


ATAATCATTC 


420 


35 


GATTTACTTC 


GATGGTATAC 


CTAATATGGA TATTTTTAAA 


AAGTGTTATA 


AAGCATTAAT 


480 


AACTAAACAA 


GAAACAAATA 


TCCAGAAAGA GGGTATGTTA 


TTGTGTCAAC 


ATTTAAGTGT 


540 




GAAACCAGAT ACACTTAAAT TCATGTTGAA AGTTTTCTTA 


GACTTAAAAT 


TTGTAACACA 


600 


40 


AGAAGATGGT 


TTAATTCGAA 


TCAATCAACA ACCTGATAAA AGATCGATTG 


ATTCCAGCAA 


660 




AGTATATCAA 


TTAAGACAAC 


AACGTATGGA TGTTGAAAAG 


CAATTATTAT 


ATCAAGATTT 


720 




TTCAGAAATA 


AAAAATTGGA 


TAAAGTCACA ATTGTCGTGA 


GCAATTTAGG 


AGGAAATATT 


7B0 


45 


AATGGATTTA 


AAGCAATACG 


TATCAGAAGT TCAAGATTGG 


CCGAAACCAG 


GTGTTAGTTT 


840 




CAAGGATATT 


ACTACAATTA 


TGGATAATGG TGAAGCATAT 


GGCTATGCAA 


CAGATAAAAT 


900 




TGTAGAATAC 


GCAAAAGACA 


GAGATGTTGA TATCGTTGTA 


GGACCTGAAG 


CGCGTGGCTT 


960 


SO 


TATCATTGGC 


TGTCCTGTAG 


CTTATTCAAT GGGGATTGGC 


TTTGCACCTG 


TTAGAAAAGA 


1020 




AGGGAAATTA 


CCTCGTGmAG 


TCATTCGTTA TGAGTATGAC 


CTAGAATATG 


GTACAAATGT 


1080 
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w 



15 



20 



25 



30 



35 



40 



45 



50 



ATTAGCTACT 
CGTAGTAGGT 
AGATTACGAT 
AATGAAATCC 
TTTCTTAACA 
ATAC CTTAAT 
AAAAATGAAA 
TACAAATTAA 
TTATCGTCGA 
TCATTGTTTA 
ATCCATATAG 
TGAGTATGTT 
AAACGGATTA 
ATTAGACGGA 
TACATTTGAA 
GACGAAGCTT 
CAAGTTATTT 
ATTACATAAT 
AG AAA C ATT A 
GGAACTAGAA 
TTTAATGAAG 
ACGTACTGAA 
TTACAGTATT 
GTTGGCGATA 
GCATACGTTA 
AAATTTGTAT 
CCAAATACGA 
TTACAAAGAA 
GTTAAAAGAA 



GGTGGTACGA 
ATTGCATTTA 
GTTATGAGTT 
TTCATCAAAT 
TGAGATGTTA 
AACATCGTTT 
CAGTAGATTT 
ACTCGCTCAA 
CGGACGTATG 
AGGCGAAGTA 
TGCAGACGAA 
TTAAAAAGCT 
CCATACATTA 
CCGACGATTG 
GATGTAAAAG 
AAAAAAGTAA 
ATTGCGATTG 
ATGCGTACCT 
GAAATTTATG 
GATACGGCTC 
AAGAAACGTA 
ATGGACCGAA 
TATCGGAAAA 
CGTGTTATTG 
TGGAAACCGA 
CAGTCATTGC 
ACGTTTGATA 
GGTAAAAAAG 
TTAGCTGAAG 



TTGAAGCAGC 
TAATTGAATT 
TAATCTCATA 
GTATAAGAAC 
GGATTTTTTA 
ATTTATTTCA 
AGGTCGAATT 
GTAAAATTAA 
ATTGGTGTGG 
ATAAATATGA 
tTCTTCACAA 
AT CATATTGC 
TGCATCCTAT 
TCGCAGGTTT 
AAATGTTCAA 
AATACCGCTC 
CCAAAGATGT 
TGAAAGCCAT 
CACCATTAGC 
TTCGTTATAT 
GTGaACGTGA 
TGAATATCGA 
TGATGAAGCA 
TCAATTCTAT 
TGCCAGGACG 
ATACTACAGT 
TGCACGAAAT 
TAAGTGAAAA 
CGGATCATAC 



AATAAAATTA 
GAAATATTTA 
CGACGAATAA 
CAATGACTTA 
TTTACTGAAA 
TAAATTGTAG 
TTTGTAAAAG 
TATTACGATT 
GACAAATACT 
ATGGGGTGTA 
AGCAAAATCA 
TTATGAAGCA 
ACAAGTTGCA 
TTTGCATGAT 
TGAAGAAGTT 
AAAAGAAGAA 
ACGCGTAATT 
GCCGCGCGAA 
ACATCGTCTT 
TGATAATGTG 
AGCGTATATC 
AGGCGATATA 
GAAAAAACAA 
TAATGATTGT 
TTTTAAAGAT 
AGTAGG CCCA 
TGCTGAGCAT 
AGATCAAACT 
ATCGTCTGAC 



GTTGAAAAAT 
AATGGTATTG 
TAAATAATAT 
ATTAAAAAAG 
ATGTTAGATG 
TATCATAGAA 
TTTTAAAAGT 
AATGACGACA 
ATTCAACAAG 
TCATATAATG 
TATTTGTCAG 
CATAAAGGTC 
GGTATTTTAA 
GTAATTGAAG 
GCTCGAATTG 
CAACAAGCTG 
TTGGTGAAAT 
AAACAAATTA 
GGTATTAATA 
CAATATTTTA 
GAAACGGCTA 
AATGGTAGAC 
TTTGATCAAA 
TATGCGATAC 
TATATTGCAA 
AATGGAGACC 
GGTGTTGCAG 
TATCAAAATA 
GCTCAAGAAT 



TAGGCGGTAT 
AAAAAATTAA 
AATTTTATCA 
TTGTTTAAGT 
ATTGAGCATT 
CTAATATTTA 
AGGAATAGTA 
GGATAAATAT 
AGTACCTAAA 
AACAACGAAT 
CAGATGAATA 
AGTTCCGAAA 
CAGAAATGCG 
ATACACCGTA 
TTGATGGTGT 
AAAATCATCG 
TAGCAGACAG 
GAATTTCTCG 
CAATCAAATG 
GAATAGTCAA 
TTGATAGAAT 
CTAAACATAT 
TTTTTGATTT 
TTGGGTTGGT 
TGCCTAAACA 
CGCTCGAAAT 
CACACTGGGC 
AGTTAAATTG 
TTATGGAAAC 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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TGAGTTGCCA 


TATGGTGCTG 


TGCCGATTGA 


TTTTGCTTAT 


GCGATTCACA 


GTGAAGTAGG 


3000 




TAATAAGATG 


ATTGGTGCCA 


AGGTGAATGG 


CAAAATTGTA 


CCAATTGACT 


ATATTTTACA 


• 3060 


s 


AACAGGCGAT 


ATTGTTGAAA 


TACGTACTAG 


TAAACATTCA 


TATGGACCAA 


GTCGTGATTG 


3120 




GTTGAAAATT 


GTTAAATCGT 


CTAGTGCCAA 


AGGTAAAATT 


AAAAGTTTCT 


TCAAAAAACA 


3180 




AGATCGTTCA 


TCTAATATTG 


AAAAAGGCCG 


AATGATGGTT 


GAAGCTGAAA 


TAAAAGAGCA 


3240 


'10 


AGGATTTAGA 


GTCGAAGATA 


TTTTGACAGA 


GAAAAATATT 


CAGGTTGTTA 


ATGAAAAATA 


3300 




TAACTTTGCA 


AATGAAGATG 


ATTTATTCGC 


AGCTGTAGGA 


TTTGGCGGCG 


TGACATCCTT 


3360 


IS 


ACAGATTGTT 


AATAAATTAA 


CTGAAAGACA 


ACGTATTTTA 


GATAAACAAC 


GTGCTTTAAA 


3420 




TGAAGCACAA 


GAAGTTACGA 


AATCATTGCC 


TATTAAAGAC 


AACATCATTA 


CTGATAGTGG 


34B0 




TGTCTATGTA 


GAAGGTTTAG 


AAAATGTACT 


TATCAAGTTG 


TCAAAATGTT 


GTAATCCTAT 


3540 


20 


ACCaGGTGAT 


GATATTGTAG 


GTTATATCAC 


CAAAGGTCAC 


GGTATTAAAG 


TACATCGCAC 


3600 




TGATTGCCCA 


AATATTAAGA 


ACGAAACTGA 


ACGACTAATT 


AATGTTGAAT 


GGGTAAAATC 


3660 




AAAAGACGCA 


ACTCAAAAAT 


ATCAGGTTGA 


TTTAGAGGTA 


At G CGTATGA 


CCGAAATGGC 


3720 


25 


TTGTTGAATG 


AAGTACTACA 


AGCTGTTAGC 


TCGACAGCCG 


GCAATTTAAT 


TAAAGTTTCA 


3780 




GGACGTTCAG 


ATATTGATAA 


AAATGCAATA 


ATAAATATTA 


GTGTCATGGT 


GAAAAACGTG 


3840 




AATGATGTTT 


ATCGTGTGGT 


AGAAAAGATC 


AAACAACTTG 


GTGATGTTTA 


TACAGTAACA 


3900 


30 


AGAGTTTGGA 


ACTAGAGGTG 


CAAAATATGA 


AAGTAGTTGT 


ACAAAGAGTT 


AAAGAAGCAT 


3960 




CGGTGACGAA 


TGATACATTA 


AATAATCAAA 


TCAAAAAAGG 


ATATTGTTTA 


TTAGTCGGTA 


4020 


3S 


TCGGTCAGAA 


CTCTACAGAG 


CAAGATGCAG 


ATGTAATTGC 


AAAGAAAATT 


GCTAATGCAA 


4080 


GATTATTTGA 


AGATGACAAT 


AATAAATTAA 


ACTTTAATAT 


CCAACAAATG 


AATGGTGAAA 


414 0 




TAC^TCAGT 


TTCACAATTT 


^CTCTCTATCf 


CAGMGTAAA 


AAAAGGTAAC 


CGTCCAGGTT 


4200 


40 


TCTCAAATTC 


TAAAAATCCT 


GATCaAGCGG 


TAAAAATTTA 


TGAGTATTTT 


AATGcaTGCG 


4260 




CTACGAGCGT 


ATGGTCTTAC 


TGTGAAAACA 


GGTGAATTTG 


GAACACACAT 


GAATGTTAGC 


4320 




ATAAATAATG 


ATGGTCCAGT 


CACTATTATT 


TATGAAAGTC 


AGGACGGCAA 


AATTCAATGA 


43B0 


45 


AAAAAATAGA 


GGCATGGTTA 


TCTAAAAAGG 


GTCTTAAAAA 


TAAACGTACT 


CTAATAGTAG 


4440 




TGATTGCCTT 


TGTCTTATTT 


ATCATCTTTT 


TATTTTTATT 


GCTGAATAGC 


AATAGTGAAG 


4500 




ATAGTGGGAA 


CATCACGATA 


ACTGAAAATG 


CTGAATTACG 


TACAGGTCCA 


AACGCTGCGT 


4560 


SO 


ATCCAGTCAT 


ATATAAAGTT 


GAAAAAGGTG 


ACCATTTTAA 


AAAGATTGGT 


AAAGTAGGTA 


4620 




AATGGATTGA 


AGTTGAAGAT 


AGATCCAGTA 


ATGAAAAAGG 


TTGGATAGCT 


GGATGGCACA 


4660 
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TAGTGCTTGA TCCTGGTCAT GGAGGTAGTG ACCAGGGTGC TTCAAGCAAT ACTAAATATA 4800 

AAAGTTTAGA AAAAGATTAT ACGTTGAAAA CAGCAAAAGA ATTGCAGCGT ACTTTAGAAA 4 860 

AAGAAGGCGC AACTGTTAAG ATGACAAGAA CAGACGATAC ATATGTTTCA CTAGAAAATC 4920 

GTGATATCAA AGGCGATGCC TATTTGAGTA TACATAATGA TGCGTTAGAA TCATCTAATG 4 980 

CAAATGGAAT GACaGTTTAT TGGTATCATG ATAATCAAAG AGCTTTAGCA GATACGTTAG 504 0 

ACGCTACGAT TCAGAAGAAA GGTCTACTTT CTAATCGCGG TTCAAGACAA GAAAATTATC 5100 

AAGTGTTAAG ACAAACAAAA GTTCCTGCTG TTTTATTAGA ATTAGGTTAT ATTAGTAACC 5160 

CAACTGATGA AACGATGATT AAAGATCAAT TACATAGACA AATTTTAGAA CAAGCAATTG 5220 

TTGATGGCCT TAAAATTTAT TTTTCTGCGT AGGGCTTGCA AAAATATGTG AAAGTAGTTA 5280 

T CATTGAT AT TGAATTTTAT AACTAAAACC GTTAGTATTC TTGAAATGGT AAATGAAATA 534 0 

20 GGTAGCAATC TAACTAAGAT TGTGTAGGAA TATAATCCAT AGACTGAAAG ATTATGCTGA 54 00 

GTAGTTTATA TACATTGAAC ACAAGAAGAG GTGCTTTATG AAAAGTAAAG CCGTTAAACG 5460 

TACGTTaAAC GTTTTGAGTG GGTTTATTAA ATGCACGCTT ATAAAAAGTA ATGATGATTA 5520 

CAATTAGGCA TGTTTTTTAA ACCA 5544 
<2) INFORMATION FOR SEQ ID NO; 111: 



IS 



25 



30 



35 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AAAAGATTGC AAATATAAAT GGCATGTTTA ATATGTTAGA ACAACAAATC ATTCATAGCC 60 

40 AAGATATGGC TCATTTTAGA AGTGAA TT T T TTTACGTCAA TCATGaGCAT CGAGAAAACT 120 

ATGAAgCACT CCTAATTTAT TACAAAAATA , GTATCGACAA TCCTATTGTA GATGGTGCAT 180 

GTTATATTTT AGCCCTACCT GAAATTTTCA ATAGTGTTGA TGTTTTCGAA TCAGAGTTAC 240 

45 CATTTTCATG GGTATATGAT GAAAATGGCA TTACCGAAAC AATGAAATCA CTTAGCATTC 300 

CATTACAATA TTTAGTTGCA GCAGCTTTAG AAGTAACTGA TGTGAATATA TTTAAGCCTT 360 

CAGGATTTAC AATGGGAATG AATAATTGGA ATATTGCTCA AATGCGAATC TTTTGGCAAT 4 20 

50 

ATACAGCAAT TATTAGAAAA GAAGCACTAT AACATTAATA ATTAATTAGC TATAAAGATG 4 80 

ATTCACAACA ATCATCTTTA TAGCTTTTTT ATGTCTAATT ATTTTTGAGG AAAATmACAA 540 
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AATTTTATGT TTTCAAAAGT AAACAATCAA AAGATGTTAG 


AAGATTGCTT 


CTATATAAGA 


660 




AAGAAAGTGT TTGTAGAAGA ACAAGGCGTC CCTGAGGAAA 


GTGAAATTGA 


TGAATATGAA 


720 




TCTGAATCTA TTCACCTCAT TGGATATGAT AATGGACAGC 


CAGTTGCCAC 


TGCTCGAATA 


780 




CGCCCTATTA ATGAAACAAC TGTCAAAATA GAACGAGTAG 


CTGTGATGAA 


ATCACATCGT 


840 




GGACAAGGAA TGGGTAGAAT GCTTATGCAA GCTGTAGAAT 


CATTAGCTAA 


AGATGAAGGT 


900 


10 


TTTTACGTAG CTACTATGAA TGCC CAATGT CATGCTATCC 


CATTTTATGA 


AAGTTTAAAC 


960 




TTTAAAATGA GAGGTAATAT ATTTCTTGAG GAAGGCATCG 


AGCATATTGA 


AATGACAAAA 


1020 


15 


AAGTTAACCT CG CTTAATTA AAAAAAGTTG TATCTATTTT 
(2) INFORMATION FOR SEQ ID NO: 112; 


AGAAACA 




1067 


20 


(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 18613 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 








25 


<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 








AAGACGtAtG ATAACAACAA TACgTGTAGT GAAAGATTTT 


AATCTACATA 


TTACTGACAA 


60 




AGAATTCATT GTATTTGTTG GACCATCGGG ATGTGGTAAA 


TCAACAACAT 


TACGAATGGT 


120 


30 


TGCTGGACTA GAGTCTATCA CATCTGGAGA TTTTTATATT 


GATGGGGAAC 


GCATGAACGA 


1B0 




TGTTGAACCA AAGAATAGAG ATATTGCGAT GGTATTTCAA 


AACT ATG CAT 


TATATCCACA 


240 


35 


TATGACTGTT TTTGAAAATA TGGCATTTGG GCTAAAGCTA 


CGTAAAGTAA 


ATAAAAAAGA 


300 


GATTGAACAA AAAGTTAATG AAGCAGCTGA AATATTAGGA 


TTAACTGAGT 


ATCTTGGTCG 


360 


... 


TAAACCAAAA- GCGTTATCTG GCGGACAGCG TCAACGTGTT- GCTTTGGGCA 


GAGCTATTGT . 


_ _ _420_ 


40 


TAGGGATGCG AAAGTCTTTT TAATGGATGA ACCATTATCG 


AATCTTGATG 


CGAAyTtCGA 


480 




GTACAAATGC GCACAGAAAT ATTGAAATTA CATAAGCGAC 


TTAATACTAC 


GACAATTTAT 


54 0 




GTTACACATG ATCAAACTGA AGCATTGACG ATGGCTAGTC 


GAATTGTTGT 


TTTGAAAGAT 


600 


45 


GGCGACATTA TGCAAGTCGG CACACCTAGA GAAATATATG 


ATGCCCCTAA 


TTGCATATTT 


660 




GTGGCGCAAT TTATCGG CTC ACCAGCAATG AATATGTTGA 


ATGCTACAGT 


TGAAATGGAC 


720 




GGATTGAAGG TAGGAACACA CCATTTTAAA TTACATAATA 


AAAAATTTGA 


AAAGTTAAAA 


780 


50 


GCTGCTGGCT ACTTAGACAA GGAAATTATT TTAGGTATTC 


GAGCTGAAGA 


CATTCATGAA 


840 




GAACCAATAT TTATTCAAAC TTCTCCAGAG ACACAATTTG 


AATCTGAAGT 


AGTTGTATCC 


900 
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AAATTAGATT CAAGAACTCA AGTGATGGCG AACGACAAGA TTACACTAGC ATTTGATATG 1020 

AATAAGTGTC ACTTTTTTGA TGAAAAAACA GGAAATCGTA TCGTCTAAGG GGGAGTATTC 1080 

ATGTCTAAAA TTTTAAAATG TATCACGTTA GCCGTGGTAA TGTTATTAAT CGTAACTGCA ll 40 

TGTGGCCCTA ATCGTTCGAA AGAAGATATT GATAAAGCAT TGAATAAAGA TAATTCTAAA 1200 

GACAAGCCTA ACCAACTTAC GATGTGGGTG GATGGCGACA AGCAAATGGC GTTTTATAAA 1260 

AAAATTACGG ATCAATATAC TAAAAAAACT GGCATCAAAG TAAAGCTTGT AAATATTGGT 13 2 0 

CAAAATGATC AACTAGAAAA TATTTCGCTA GACGCTCCTG CAGGAAAAGG TCCAGATATC 13 8 0 

is TTTTTCTTAG CACATGATAA TACTGGAAGT GCCTATCTAC AAGGCTTAGC TGCTGAAATC 144 0 

AAATTATCAA AAGATGAGTT GAAAGGTTTC AATArGCAAG CACTTAAAGC GATGAATTAT 1500 

GACAATAAGC AACTAGCATT GCCAGCTATC GTTGAAACAA CCGCACTTTT TTATAATAAA 156 0 

20 AAATTAGTGA AAAATGCACC GCAAACGTTA GAAGAAGTTG AAGCTAATGC TGCCAAACTA 162 0 

ACTGATAGTA AAAAGAAACA ATACGGTATG TTATTTGATG CTAAAAATTT CTATTTTAAT 168 0 

TATCCGTTTT TATTCGGCAA TGATGATTAT ATTTTCAAGA AAAATGGCAG TGAATATGAT 174 0 

ATTCATCAGC TAGGACTAAA TTCAAAACAT GTCGTCAAGA ATGCTGAACG ATTACAAAAA 1800 

TGGTACGACA AAGGGTATCT TCCTAAGGCA GCAACACATG ATGTCATGAT TGGTCTTTTT I86 0 

AAAGAAGGAA AAGTAGGACA ATTTGTCACT GGACCGTGGA ACATTAATGA ATATCAAGAA 192 0 

ACGTTTGGTA AAGATTTAGG AGTAACAACA TTACCTACAG ATGGTGGCAA ACCTATGAAA 19 80 

CCATTTCTAG GTGTACGTGG TTGGTATTTA TCTGAATATA GTAAACATAA GTATTGGGCT 204 0 

AAAGATTTAA TGCTGTATAT CACTAGTAAA GATACATTAC AAAAATATAC AGATGAAATG 2100 

AGCGAAATTA CTGGACGTGT TGACGTGAAA TCATCTAATC CAAATTTAAA AGTGTTTGAA 2160 

AAGCAAGCAC GTCATGCTGA ACCGATGCCT AATATTCCTG AAATGCGACA AGTTTGGGAA 222 0 

40 CCGATGGGCA ATG CAAGCAT ATTTATTTCA AATGGTAAGA ATCCTAAACA AGCGTTAGAT 228 0 

GAGGCGACGA ATGATATAAC GCAAAATATT AAGATTCTTC ATCCATCACA AAATGATAAG 234 0 

AAAGGAGATT AGTTATGACG AAACGTAACC CT AAATT AG C GGCATTATTA TCTGTTATAC 24 0 0 

CTGGTTTGGG ACAGTTTTAT AATAAAAGAC CCATTAAAGG GACGATATTT TTTATCTTTT 24 60 

TCATCAGTTT TATTTCTGTT TTTTATAGCT TTTTAAATAT TGGTTTTTGG GGATTGTTCA 252 0 

CATTAGGGAC AGTACCTAAG TTAGACGATT CTCGTGTCTT ACTTGCACAA GGTATTATTT 2580 

CTATCTTACT CGTTGCTTTC GCAATCATGC TATATATCAT TAATATTTTA GATGCATATC 264 0 

GTAATG CTG A ACGATTTAAT CGCAATGAGG AAATAAAGGA TCCGAAGcGC GTATGGTGGC 2700 



30 



35 



45 



SO 



55 
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TGTAGTTGTA TTTCCATTAA TAyyTATGTT 


TGGAGTAGCA 


TTTACAAATT 


ACAATTTATA 


2820 




CAACGCGCCT 


CCGAGACACA 


CATTAGAATG 


GGTTGGTTTA 


GATAACTTTA 


AAACGTTATT 


2880 


5 


CACAATTGGC 


GTTTGGCGTA 


aaacattttt 


CAGTGTTATT 


ACTTGGACAT 


TAGTATGGAC 


2940 




GCTTGTTGCA 


ACGACACTTC 


AAATTGCATT 


AGGGCTGTTT 


TTGGCAATTA 


TTGTAAATCA 


3000 


10 


CCCTGTCGTC 


AAAGGTAAGA 


AATTTATCCG 


TACTGTGTTA 


ATCCTACCTT 


GGGCTGTACC 


3060 


ATCATTTGTG 


ACAATTTTAA 


TATTTGTAGC 


GTTATTTAAT 


GATGAATTTG 


GTGCGATAAA 


3120 




TAATGATATT 


TTGCAACCTT 


TATTAGGTGT 


AGCACCAGCA 


TGGTTAAGTG 


ATCCGTTTTG 


3180 


15 


GGCAAAAGTG 


GCATTAATCG 


GCATTCAAGT 


ATGGCTTGGA 


TTCCCATTTG 


TCTTTGCACT 


3240 




GTTCACTGGA 


GTACTGCAAA 


GTATTTCATC 


AGATTGGTAC 


GAAGCAGCAG 


ATATGGATGG 


3300 




TGCGTCTAGT 


TGGCAAAAGT 


TTAGAAACAT 


CACATTCCCG 


CATGTCATTT 


ACGCCACAGC 


3360 


20 


GCCATTGTTA 


ATTATGCAAT 


ATG CAGGTAA 


TTTCAATAAT 


TTTAATCTTA 


TTTATCTATT 


3420 




TAATAAAGGC 


GGTCCACCAG 


TGTCAGGGCA 


GAATGCTGGT 


AGTACAGATA 


TCTTGATATC 


3480 




TTGGGTGTAT 


AATCTGACAT 


TTGAGTTTAA 


CAACTTCAAC 


ATGGGTGCAG 


TTGTGTCATT 


3540 


25 


AATTATTGGA 


TTTATTGTTG 


CTATTGTCGC 


ATTTATTCAA 


TTCAGACGTA 


CAAGTACGTT 


3600 




TAAAGATGAG 


GGAGGTTTAT 


AAGATGACAA 


AGAAGAAAAA 


CATATTAAAA 


GCAATCGGTA 


3660 




TTTACAGTTT 


TATAGCGATG 


ATGTTTGTCA 


TCATTTTATA 


TC CACTACTG 


TGGACATTTG 


3720 




GCATTTCCCT 


TAATCCAGGT 


ACGAACTTGT 


ATGGTGCCAA 


AATGATACCA 


GACAATGCAA 


3780 




CATTTAAAAA 


TTATGCATTC 


TTACTATTCG 


ATGACAGTAG 


TCAATACCTG 


ACTTGGTATA 


3840 


35 


AAAATACGCT 


TAT CGTAGCA 


TCTGCAAATG 


CACTGTTTAG 


TGTGATATTT 


GTCACGTTAA 


3900 


CAGCATATG C 


TTTTTCTAGA 


TATCGCTTTG 


TTGGTCGTAA 


ATACGGGCTG 


ATTACATTTT 


3960 




TGATTTTACA 


AATGTTCCCT 


GTATTAATGG 


CAATGGTCGC 


AATCTATATT 


TTC5CTAAATA ~~ 


4020 


40 


CAATTGGATT 


ATTAGATTCT 


TTATTTGGAC 


TAACACTGGT 


ATATATTGGT 


GGATCAATAC 


4080 




CGATGAATGC 


CTTTTTAGTG 


AAAGGTTACT TCGATACGAT 


TCCAAAAGAA 


CTTGATGAAT 


4140 




CTGCCAAAAT 


TGATGGTGCA 


GGG CATATGC 


GTAT1TTCTT 


ACAAATTATG 


CTTC CATTAG 


4200 


45 


CTAAGCCGAT 


TTTAGCAGTT 


GTTGCTTTGT 


TCAATTTTAT 


GGGGCCATTT 


ATGGACTTTA 


4260 




TATTACCTAA 


AATACTATTA 


AGAAGTCCTG 


AAAAATTCAC 


ATT AG CAGTT 


GGATTGTTCA 


4320 




ACTTTATTAA 


TGATAAGTAT 


GCAAATAATT 


TCACAGTGTT 


TGCAGCAGGG 


GCAATTATGA 


4380 


50 


TTGCAGTACC 


TATAGCAATC 


GTATTCTTGT 


' TCTTGCAACG 


1 CTATTTAGTA 


> TCAGGTTTAA 


4440 




CAACAGGTGC 


GACAAAAGGT 


1 TAGTTTGAAA TTAGGAGTGG 


\ GGCAGAATTG ATAAAGAACC 


4500 
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GGGTGTGGTG GTATTGCGAA TGGCAAGCAC ATGCCAAGTT TACAAAAAGT TGAAAATGTT 4 620 

GAAATGATCG CATTTTGTGA CGTAGACATT TCGAAAGCAG CGAGTG CGGC AGAAGCATAC 4 68 0 

GGAACTGACA ATGCAAAGGT TTATGATGAT TACAAAGCAT TGTTAAAAGA TGACACGATT 4 74 0 

GATGTTATCC ATGTTTGTAC GCCAAATGAC TCGCATTGTG AAATTACTGT AG CAGGGTTG 4 800 

CATGCTGGTA AACATGTGAT GTGTGAAAAA CCAATGG CT A AAACGACAGC AGAAGCTCAA 4 860 

AAAATGATAG ATACAGCTAA ATCAACAGGT AAAAAATTAA CAATAGGTTA TCAAAATCGT 4 92 0 

TTCCGAGCAG ATAGTCAATT TTTACATCAA GCAGCGCAAC GTGG CGACTT AGGAGACATT 4 980 

TACTTCGGAA AGGCACATGC CATTCGTCGT CGAGCAGTAC CAACATGGGG TGTCTTTCTA 504 0 

GACGAAGAAG CTCAAGGTGG AGGACCATTA ATCGATATCG GTACACACGC TTTAGATTTA 5100 

ACGTTATGGA TGATGGATAA TTATGAACCA GAATCAGTGA TGGGTTCAAC ATTCCATAAA 5160 

20 TTAAATAAAC AGCATCATGC GGCAAACGCT TGGGGTTCAT GGAATCCAGA TGAATTTACA 5220 

GTTGAAGATT CTG CGTTTGG ATTTATTAAA ATGAAGAATG GAGCGACGAT CATTTTAGAA 52 80 

TCCGCTTGGG CGATTAATTC TTTAGAAGTG GATGAGGCAA AATGTTCATT ATCAGGAACT 534 0 

AAAGCAGGTG CTGATATGAA AGATGGTCTA CGTATTCATG GTGAAGACAT GGGTACACTT 5400 

TAT AC CAAAC ACGTTGAATT GGAAAACAAA GGCGTCGACT TTTATGAAGG TAATGAAGTG 54 6 0 

GATGAAGCTG AAGAAGAAGC AAAAG CTTGG ATTGATGCAG TTGTAAATGA TACTGAACCA 5520 

GTTGTGAAAC CGGAACAAGC AATGGTAGTT ACAAAAATTC TTGAAGCGAT TTATCAGTCT 5580 

GCAAAATCAG GCAAAGCAAT TTACTTTGAA TAACATCATA CGGTAAGGAG GCACATCATG 5640 

ACAAAATTAA AAGTTGGTGT GATAGGTGTT GGTGGTATTG CACAAGACCG TCATATTCCA 5700 

GCATTGCTGA AACTCAAAGA CACAGTCTCA TTAGTTGCAG TACAAGATAT TAATACAGTG 5760 

CAGATGATTG ATGTTGCGAA gCGCTTTAAT ATACCTCATG CAGTTGAGAC ACCTAGCGAG 5820 

40 CTGTTTAAAC TTGTTGATGC GGTGGTCATT TGTACACCTA ATAAATTCCA TGCTGATCTT 5880 

TCTATAGAAG CATTGAACCA TGGTGTC CAT GTATTGTGTG AAAAGCCAAT GGCGATGACG 5940 

ACGGAAGAGT GTGATCG CAT GATTGAAGCG GCTAATAAAA ATCACAAATT ATTAACTGTC 6000 

45 GCATATCATT ATCGTCACAC AGATGTGGCA ATTACTGCTA AAAAAGCAAT TGAATCAGGT 6060 

GTGGTTGGTA AACCTTTAGT AGCACGTGTA CAAGCGATGC GTAGGCGTAA AGTGCCTGGC 6120 

TGGGGTGTTT TTACCAATAA AGCGTTGCAA GGTGGCGGTA GTTTAATCGA TTATGGTTGC 6180 

so 

CACTTGTTAG ACTTATCTTT GTGGCTACTA GGTAAAGATA TGGTGCCGCA TGAAGTGCTA 6240 

GGAAAAACAT ATAATCAATT GAGCAAACAA CCGAATCAAA TTAATGATTG GGGAACATTT 6300 
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40 



45 



50 



GCAAGCATGC 


AGTTTGAATG 


TTCGTGGTCT 


GCAAATATCA 


AAGAAGATAA 


GGTTCACGTT 


6420 


AGTTTATCAG 


GAGAAGATGG 


CGGTATCAAT 


TTATTTCCAT 


TTGAAATATA 


TGAGCCCCGC 


6480 


TTTGGAACTA 


TTTTTGAAAG 


CAAAGCTAAT 


GTTGAGCATA 


ACGAAGACAT 


TGCTGGTGAG 


6540 


AGACAGGCGC 


GTAACTTTGT 


CAATGCGTGT 


TTAGGGATAG 


AAGAGATTGT 


GGTGAAACCG 


6600 


GAAGAAGCAC 


GCAATGTAAA 


TGCCCTTATA 


GAAGCGATTT 


ATCGTAGCGA 


TCTTGATAAC 


6660 


AAG AG CAT AC 


AACTTTAATG 


ATTATCATAT 


ATGATACAAA 


ATTCTCAATA 


TAAAAAGAAG 


6720 


GAGTGCTTTT 


CAATGAAAAT 


AGGTGTATTT 


TCAGTATTAT 


TTTACGATAA 


AAATTTTGAA 


6780 


GATATGTTAG 


ATTATGTCTC 


AGAATCTGGA 


TTGGATATGA 


TTGAAGTTGG 


AACAGGTGGT 


6840 


AACCCAGGAG 


ATAAATTTTG 


TAAGTTAGAT 


GAGTTGTTAG 


AAAATGAAGA 


CAAGCGCCAA 


6900 


GCATTTATGA 


AGTCAATCAC 


AGACAGAGGC 


TTACAAATAA 


GTGGTTTCAG 


TTGTCATAAC 


6960 


AATCCAATTT 


CTCCAGATCC 


GATAGAAGCG 


AAAGAAGCCG 


ATGAAACGTT 


ACGTAAAACA 


7020 


ATCCGTTTAG 


CAAATCTATT 


AGACGTGCCA 


GTTGTTAATA 


CATTTTCTGG 


CATTGCAGGA 


7080 


TCAGATGATA 


CCGCTAAAAA 


GCCTAATTGG 


CCTGTTACAC 


CTTGGCCAAC 


AGCCTACTCT 


7140 


GAAATTTATG 


ATTATCAGTG 


GAATGAAAAG 


TTGATACCAT 


ATTGGCAAGA 


TTTAGCTGAG 


7200 


TTTGCAAAAG 


AGCAAGATGT 


AAAAATTGCC 


ATAGAGTTGC 


ATGCAGGATT 


TTTAGTG CAT 


7260 


ACACCATATA 


CAATGTTGAA 


GTTACGTGAG 


GCTACAAATG 


AATATATCGG 


TGCTAACTTA 


7320 


GATCCTAGTC 


ATCTATGGTG 


GCAAGGTATT 


GACCCAATTG 


CTGCGATTCG 


CATATTAGGC 


7380 


CAAGCAAATG 


CAATTCATCA 


CTTCCATGCT 


AAAGATACGT 


AT ATT AAT CA 


AGAAAATGTA 


7440 


AATATGTATG 


GTCTAACTGA 


TATGCAACCA 


TATGGTAACG 


TTGCGACAAG 


AGCATGGACA 


7S00 


TTCCGTACAG 


TTGGTTATGG 


ACATAGTCCA 


TATGTATGGG 


CAGATATCAT 


AAGTCAACTT 


7560 


ATTATTAATG 


GATATGATTA 


TGTATTAAGT 


ATTGAACATG 


AAGATCCTAT 


TATGTCAGTA 


7620 


GAAGAAGGTT 


TCCAAAAAGC 


TTGTCAAACT 


TTGAAATCTG 


TTAATATTTA 


CGACAAGCCA 


7680 


GCAGACATGT 


GGTGGGCATA 


ATACGAACTC 


GAGGTTAGTC 


TGAAGTTTGT 


CTGAAGTAAG 


7740 


ACTGGTGGCA 


GTGTTGAATA 


AATGCATATG 


TCGCCAAGCC 


ATTGCCAAAA 


ATTTCACACC 


7800 


TTAAATCAAG 


TCATTGTTTG 


TAAAGAAGGT 


GTACTTTATA 


TAAGTATATA 


GCGATGGTCA 


7860 


TACCCATTCA 


CAGTAACAAT 


CCTCACCATT 


GAAAAGAGTA 


TATAACCTTT 


TCAATAGTGA 


7920 


GGTATATGAT 


AATAAAAAAA 


GCCTGTTGTC 


ACAATGGTCA 


TAGACACGAC 


ATACTTTAAA 


7980 


GGTTTCTGAA 


TATAATATTT 


CAGAATGCAC 


TTTAAAGATG 


GACGTCGATG 


TAGACTAAAG 


8040 


TGATGACAGG 


CTTTCATCTT 


TTTAAATATT 


CATTAATTTC 


TCTTCTTGTT 


TAATACGTAC 


6100 
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TAATACACCG ATTAATTCAG GAATGATGTT TAAGAAGTAA TTTGGGTGTT TTGTAATTTT 8220 

ATATAATCCA GATTTAATAA TAGGATGGTT AGGTAAAATG AATAATTTTA ATGTCCAAAT 8280 

ACCACCTAAA GTTTTAATAA CCATAAATAA CATGATATAA G CAAAG ATT A ATATAACTAA 8340 

GCCAATACCA TTTGCAAAGC TAAATGTATC TTTATTAATA AATGCCTCTA CACCAGCCAA 84 00 

TACATAAATT AAAACGTGTG TTATTGCTAA AAACTTCGAA TTTTTAACGC CATATTCAAC 84 60 

TGCACCGTCT GCTTTTAATT GTTTTGAGTG ATTAATAGAT ATCTTTAAGC TGACAAGTCT 8 520 

GATACAGAAA AAGATAAGTA ATATAGATAG AATCATGATG TCCTCCGTCA TTATGTCATA 8580 

TGTATAAGCG TTGATTTTGA CAACATAAAG TATTTTATAG ATAAAGCTTG TCAAATACTA 8640 

TTAACTATTT ATTAATTTTA GTACATAAAT ATGTTTCTAA GTATGTGTTT ATGTTCAGTA 8700 

TTTTGGATAA TTTAATAATT TTAAGGATAT TAAGCGCTTA CACCGACGTG ATATATTTGG 8760 

20 CTTAACGAAA ATGATTGAGG TGACAGAGAT GAACTTTTTT GATATCCATA AGATTCCGAA 882 0 

CAAAGGCATT CCATTATCGG TACAACGTAA ATTATGGCTT AGAAACTTCA TGCAAGCTTT 8880 

CTTCGTAGTG TTCTTTGTTT ATATGGCTAT GTATTTAATT CGAAACAACT TTAAGGCGGC 8 94 0 

ACAACCGTTT TTAAAAGAGG AAATTGGATT ATCTACATTA GAACTTGGTT ATATCGGATT 9000 

AG CATTTAGT ATCACGTACG GTTTAGGAAA AACATTACTT GGATATTTTG TCGATGGACG 9060 

TAACACAAAA CGTATTATCT CGTTCTTACT TATCTTATCT GCGATTACAG TTTTAATTAT 9120 

GGGATTTGTT TTAAGTTACT TTGGTTCTGT AATGGGATTA TTAATTGTAC TTTGGGGACT 9180 

TAACGGGGTG TTC CAATCAG TTGGTGGACC TGCAAGTTAT TCAACGATTT CAAGATGGGC 9240 

GCCAAGAACG AAACGTGGCC GATACTTAGG ATTCTGGAAT ACATCACATA ATATCGGTGG 9300 

TG C CAT AG CA GGTGGTGTTG CACTTTGGGG TGCTAATGTA TTCTTCCATG GAAATGTTAT 9360 

AGGGATGTTC ATTTTCCCAT CGGTGATTGC ATTACTTATT GGTATCGCAA CATTATTTAT 9420 

40 CGGAAAAGAT GATCCGGAAG AATTAGGATG GAATCGTGCT GAAGAAATTT GGGAAGAGCC 9480 

GGTCGATAAA GAAAATATTG ATTCTCAAGG TATGACG AAA TGGGAGATCT TTAAAAAATA 9540 

TATCCTGGGA AATCCTGTTA TATGGATTCT ATGTGTTTCA AACGTCTTTG TATACATTGT 9600 

45 ACGAATCGGT ATTGATAACT GGGCACCGTT ATATGTGTCA GAGCATTTAC ACTTTAGTAA 9660 

AGGCGATGCA GTTAATACGA TATTCTACTT TGAAATTGGT GCATTAGTTG CAAGTTTATT 9720 

ATGGGGCTAC GTATCAGACT TATTAAAAGG TCGTCGTGCA ATTGTAGCTA TTGGCTGTAT 9780 

SO 

GTTTATGATT ACATTTGTTG TCTTATTCTA CACAAATGCT ACAAGTGTCA TGATGGTTAA 984 0 

CATTTCATTG TTTGCATTAG GTGCGTTAAT CTTTGGTCCG CAATTATTAA TTGGTGTATC 9900 
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X X Cvj^j X Vj**A_ X 




AGTTGGTTTG 


GCGGCTATTG 


CTGATCCAAC 








a p^Ta app/*it 


X Innnwil V— X 


T— T<^> O TA T TV T TV P 

1 XVjVjAXAIAL, 


ATTAAGTGGA 


TGGACAGATG 


TTTTCATCGT 


1008 0 




5 




PPA'!"!* A'T'T/"*/'*' 


X AV» VjL- A ion X 


TCTATTAGGA ATCGTTGCTT 


TCTATGAAGA 


1014 0 






& A R CI A A A ATT 


AARTTT A A 


A A ATTTA ATA 


TAAATCGGAT 


TAAAAGTATC 


GCCAATCTAT 


10200 




10 


X Uvnn X n X 




Tf2r*pr*pf^ a pp. 


GCATGTGCGT 


GAAGAGATGA 


AAGATACTGC 


1026 0 




X X L* X AwUv- X X 


GPAAATATAT 


P A TPTPT A Tfi 


TCTCGGGGCA 


GAT CAT AATT 


CCCTGTTATG 


10320 






AAljrXnX ^.L X X 


attwpppp. a 


PTTAPnP.TP.A 


CTCAATGAAT 


TTACTCCTTA 


CAATAAAGAC 


103 8 0 




15 


ATATAfSfYSflT 


fSTPAATATm 
VilL/Vil/ii XVJ 


T Aftfifi A fIT AT 

X rlVTVJVl/lU X *\ X 


TGTTTTATAT 


TTAAACTCTC 


TAAAAAGCGG 


1044 0 








7A AP.TY2 A A A 7X P 
AAVj luAAAnt 


X ILiLlAlLn 


GTCCGCTTTT 


TCATAGAACA 


AAATGGAGGC 


10500 






p a t ii a tp a 

Vst-L-A X AA 1 V-A 


X XAxaX lAlul 


VjV_ X AAX L1AX 


TTTGCTTGCT 


TACAATAATC 


ACTTGGCGAC 


10560 




20 


XTTWTR A TVT 

ATI IAj X AAA 1 


AX 1 X 1 X XAAA 


TA TP TA T TA P PT 11 
A X LxA X M\jV_ X M. 


A^CZATTTTAT 


ACTCTGAAAA 


GPPTAPTTTR 


10620 






1 V- iuuvil X X 


LAlnOlvl X X 


rtT A TATPiT PP. A 
Vj X AA X V7 X CLxM. 


TCTAACAATT 


GCAGAGATTG 


TAAAATACGA 


10680 






#T1 ft ft ft 

TACjCOAX 1 1 a 


TV TATA TATPP" 7A C 
AA1AA1 V_Lj>VV_ 


A ATTPT7A ATA 
AA X x\j X AA X A 


CCAACATGAT 


CTTTAAATGT 


TPGPAT PfiPA 


1074 0 




25 


X ALAxA 1 X V__H.V_ 


TA APATPPAT 
lAALn X V- vxr\ X 


TA ^PP TTY"1 TA TA J i — i ' 


AAATCTGAAA 


CAGtCACTTT 


CfSTTTGATAA 

^ w X X Xw^^XXV^ 


10800 






GATTGCTTAA 


TTTGATC CAC 


AA1L XLtVj X X X 


ACATAATAAT 


CATCGTATTC 




10860 








IwA X V_A X L7 


TA P TA TA P TA TP PT 
AL AAVjA 1 L»L, 1 


AAGCTACGGC 


CGTTCTGTGA 


TTnTTCPT PT 

X X V7 X X VJV_ 1V.1 


10920 




30 


T\ * rpi\ TV /"VP TA f* 




TPPT1 TA TA ATA 
I LLIAAAA1A 


ACTTCTAATT 


GTGCATGGTC 


T A PTGGTTTT 

X *^>^-. X VJw X X X X 


10980 






A TATTA A ATB. AT 
AA1AAA1AA1 


PTATAP'AAP'TA TP 


ITPTTT TA ATP. 


CCGGCTTTCA 


TATATTCAAA 


GTCATCGTAA 


11040 




35 


I- X V~L*H. X MA X A 


t<z a tp. a p att 

X Vxn X Vjx*V_jH X X 


TA P & 21 TPT AP. A 


TGCGCAATAT 


CATTGAGTAA 


ATCGACGCCA 


11100 




1111 IAUjXu 


f2P ATA/"V2 A TAT 


RTPAfiT TA ATT 
AlLAulAAi X 


ACT AATT CTG 


GCTGATGTTG 


TTGAATTAGT 


11160 






— - f*^ TA T'TA-fc TV"» PTT 

IjAlMnivjU 1 X 


A AO AP P TA TP - 


- TTTAP.PHP.T<T 
X X XAL3l~AVjr XVj 


TATATTGTAT- 


TGAAATG AT A 


GTGTCGPGAA — 

X V— <* X W^rVrWVt 


- 11220 — 




40 


ry A ATPATIT 
VjVjAAX VxAX 1 X 


VjL. X X X AM, X 


Jl X L- X LVjAM X r\ 


ATTCGTTCAT 


CATCACAAAT 


AACTACCTTA 


11280 






X ALA 


X lwUV.LV, 1 1V> 


AAAV9 X\9V7 X A X 


TTTAT A A PA A 
X X 1/VlArVV^nn 


ATTAAPGTAP 


CTTGATTACG 


11340 






CTTTGAaAAA 


Ax AX oViAVj 1 L, 


n«pp A TV2TY2 A 


AC CAT ATTGA 


ATCATTGCTT 


TATTGTGTAA 


11400 




45 


HtV^TtlTI'IV TAT 

ATuAiTXAA 1 


PPPA TA ATPPT 
LLLAAAloL X 


T APT ATP A A A 
lAVjlAlLAnA 


TACATCATTA 


TTAAGAGATT 


GGPGT APATA 


11460 






TTGCAGGCGA 


GATGACGACA 


TCCCGATACC 


ATTGTCGCAA 


ACTAAAACAT 


GTAAATTCTG 


11520 






ACGTOCCAAT 


GTCAGGCGTA 


TAGTAATGTC 


CAATGACTCA 


GTATCTCTAC 


CATGTTTAAT 


11580 




SO 


AGCATTTTCT 


ATGAGTGGCT 


GAAGCATCAT 


TTTACCAATT 


GTCTGGTGAC 


GCGCTTCTTC 


11640 






AGAACTTTCA 


ATATGGAGCT 


TAATCATGTC 


ATCAAAACGG 


aTGTTTTGTA 


TTGCAACATA 


11700 
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GTAACGTAAC 


ATTTG CGATA 


ATTGTTGGAC 


CACAGTTtGT 


GCTAATTTCG 


GAGATAACGT 


11820 




AATTAAATAT 


TGTATTGTTT 


GCATCGTATT 


GAATAGGAAA 


TGAGGCTGGA 


ATTGGCGTTC 


11880 


5 


TATTTCCTTT 


AACTGAATAT 


CACGCAAGCG 


ACGTTCTGTA 


TGCTCGATAG 


AATGGATCAG 


11940 




TTGCTCATTT 


GATTCAAATA 


AATCGTAAAT 


ATAATTATTA 


ATTTCTTCTA 


GTTCACTGTT 


12000 




GTTTTTTAAA 


GGCGTATATG 


TACCTAGATG 




GCATAGTAAA 






10 


ACGATTTTTG 


ttttttgaat 


12060 


AATCGTTTCG 


ATATCITITG 


TTTGTCGTTT 


AGCCATATTA 


TCTGCGCTAA 


TGAAACCAAA 


12120 




TATTACTAGT 


AAAACAAGAA 


CTACGGCCAT 


AACAATTAAC 


AACGTGATAC 


CATCTTCAAT 


12180 


15 


GTTTTCATGT 


ATATCTTTAT 


AAATAATGAG 


ACGATGGTCA 


GCATGGTTTA 


ATTTTACAGA 


12240 




TTCATTCATA 


AATCCGAATT 


GTTGTGGTcT 


ATACTTTTCA 


CCTATAGTAA 


AACGGTCATC 


12300 




GTTGGCGTAT 


AAAATATTGT 


CATATTGATC 


AmCGATAAGT 


GCGAATTGTC 


GGTTATCTTT 


12360 


20 


CtTAATTTCA 


CTTAAACGTG 


GGGTGTtAGC 


CATATAAATt 


TTaAGCATAT 


ATGTACTATT 


12420 




TTTGAATTTA 


AGCTGATGCG 


TTGAAAATAA 


ATACATATTT 


TTAGTGTTTA 


AATGTTCATA 


12480 




ATTATTGGTT 


ATAAACTGAT 


TTGGTCCAGA 


TAATTCATAA 


TAAAGTGTTG 


CGGGCTGTTG 


12540 


25 


GkGTATTAAT 


TTTAATAATT 


CACGTTTTGT 


AGCGGTCACA 


TCATGATGAT 


TTGyTAAATC 


12600 




GAGCTCTTGA 


AACGAATTAT 


TATGCTGTGT 


AATAAATGTC 


TGAATCTGCT 


TTTCAGTATG 


12660 




ATGTAAAGAT 


GACTGACTTT 


CATCAACATG 


TTGATGAATC 


GTACGATGCT 


CAATCCAAAT 


12720 


30 


ATAGATGGCA 


TAGAAGCTTA 


CTAGTCCAAT 


AATAATGACT 


AAAAATACTG 


GAAAAATAGT 


12780 




AGACnCAAAT 


AACGATCGTC 


TTAATTGATG 


TCTATAAGGT 


TTGTATGCCn 


TCATTGAATC 


12840 


35 


ATCTCCAAAA 


ATTTATGATG 


TGGAATATCC 


GGTAATTTAG 


ATTTCGGTAT 


TAAAGGTATG 


12900 


TTCTTAAGAT 


TTTCGATAGA 


CTGATCGCTT 


TGTTCACTAA 


CATCCTTTCG 


AATTGACTTG 


12960 




GCATCGAACT 


CTGCAACTAA 


TCGTtGTTGT 


ACTGAGCGGC 


TTGTTAAATA 


TTGCACTAAC 


13020 


40 


TTTTTACGCT 


TAGGATGAGG 


GTGTGCATTT 


TTAACTAAAG 


CAATrCCATC 


AACATTTAAC 


13080 




ATTGTTCCTT 


CAATTGGATA 


AACGATTGAT 


ACAGGATAAC 


CTTTGTTTTT 


CCATGTGCGT 


13140 




GCATCTTGTT 


CGTAGCTTAG 


AC CTGCGTAA 


TATTTACCTT 


TTGCAACATC 


TTCAATGACT 


13200 


45 


TTAGACGTCT 


TTGACAGTTG 


CATCGCATGG 


TTTTGGAATT 


GATGCACATC 


ACTTACTCGA 


13260 




TGATGCATGC 


TATAAATAGC 


ACGCATATGT 


TGATAGCCTG 


TCGTTGTTGT 


ATTTGGATTT 


13320 




GAGTACGCAA 


TTTTACCTTT 


AAGTATAGGT 


TGTAATAAAT 


CTTGATAACC 


TCGAATCTTA 


13380 


SO 


ATATCTCCTT 


GTAAATCTGA 


ATTCACTACT 


ATAACTGTTG 


GCATTAATAG 


AAAACTAGTA 


13440 




ACATATTTAT 


TGTTCGAGCG 


ATAATCCTCT 


AATTGCTGTG 


TTACAGATGT 


ATCTTGATAG 


13500 
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CCACGCTCCG 
TTAATTTCAA 
TTTGATTGAT 
TTAGCGCATG 
ATATCCCATC 
GTAATGTTAA 
ATAGCACAAA 
ATTTTATTTG 
TTTACGTCAA 
TGTTGATTTA 
CTTTTAAAGC 
TATTACTGTT 
TACATAATTT 
CATAGCATTT 
TaAAAATCAT 
TGTAAGAGAG 
AGGACCAACA 
ACGTGAACGT 
TGATGCTGGT 
ATTCAAACGT 



AAAAATCTTC 
TTTTGACATG 
ACGGAGAATA 
CTGaTAAAAA 
AATTCTATGT 
ATATTTAGAA 
ATTTTTGTTT 
TAGCTGTTAT 
TAAAAGTATT 
ACG CTTATCA 
AATGAAAATA 
ACACAAATTA 
ATGTGAAAAA 
CAAATTCACA 
GCAACAGCTT 
TTTATCCAAT 
GAAGCAACTT 
GGCGGCATGT 
TATTTAGACA 
TCAATGCAAC 



GTTATGCAAG 
CTCTTGTTTT 
AACTGTTAAT 
AATGAGAAAT 
ATATTTTAAT 
ATGTTTATAG 
GTCAAGACGA 
ATAAAAATCG 
TAATCCAGTC 
ACAATCATTT 
GTGAACATTA 
GTACAGTTTC 
AATCACAACA 
ACATTATACA 
GGCAAGGATT 
TAAACTACAC 
CTAAACTTTG 
GGGATATGGA 
AAGATTTAGA 
CATTCGGTGG 



TTTGAAAGCA 
TCAAATTCAT 
ACATTTTTAT 
AATAGCAAGA 
ACAATAATTT 
ATGACTTGTT 
TTTACCGAGG 
GCAAGATATT 
TCTTCATATA 
TTTATAAACA 
TAACTGTTGT 
TATGTTTTGA 
AACATGCTAC 
GATGGAGGCG 
TAAAAATGGA 
TCTTTATGAA 
GGAACAAGTA 
CACGAAAGTA 
AACAATTGTA 
TATTCGTATG 



GTACTTGAGT 
TTAAAATTGG 
CGGATTCAGA 
TATAAATTTT 
TAGCAATAAA 
AAGACGTTGC 
CTGTAAAATC 
GAACGGTTCA 
TAAAAGTAAA 
AATATATACT 
GTAACAGAAT 
CATACATTTG 
AATGACTATG 
TTTAGTATGT 
AGATGGAACA 
GGTAATGATT 
ATGCAGTTAT 
GCTTCAACAA 
GGTGTACAAA 
GCGAAAgcAG 



AGATCCGTGT 
ACGAATCAAG 
GTGACGCGTA 
TGATTTCATG 
TG ACG CAT AA 
AAATGTTGTG 
AAACTGTTAT 
AAAGTGAATT 
TCTTTCTAAG 
CCTAAATTAA 
GCAATTAGCA 
ATGAAAATTG 
AAAACGTTAA 
TAGAAACAAA 
GACACGTAGA 
CATTTTTAGC 
CGAAAGAAGA 
TCACATCTCA 
CTGAAAAGCC 
CTTGTGAAGC 



13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 



TTACGGTTAC GAATTAGACG AAGAAACTGA AAAAATCTTT ACAGATTATC GTAAAACACA 14 82 0 

TAACCAAGGT GTATTCGATG CATATTCTAG AGAAATGTTG AACTGCCGTA AAG CAGGTGT 14 880 

AATCACTGGT TTACCTGATG CATACGGACG TGGACGTATT ATCGGTGACT ATCGTCGTGT 14 94 0 

AGCTTTATAT GGTGTAGATT TCTTAATGGA AGAAAAAATG CACGACTTCA ACACGATGTC 15000 

TACAGAAATG TCAGAAGATG TAATT CGTTT ACGTGaAGAA TTATCAGAAC AATATCGTGC 15060 

ATTAAAAGAA TTAAAAGAAC TTGGACAAAA ATATGGTTTC GATTTAAGCC GTCCAGCAGA 15120 

AAACTTCAAA GAAGCAGTTC AATGGTTATA CTTAG CAT AC CTTGCTGCAA TTAAAGAACA 15180 

AAACGGTGCA GCAATGAGTT TAGGTCGTAC ATCAACATTC TTAGATATCT ATGCTGAACG 15240 

TGACCTTAAA GCAGGCGTTA TTACTGAAAG CGAAGTTCAA GAAATTATTG ACCACTTCAT 15300 
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AG ACCCAA r*T 


TV2/^2T*a a /"tv* 
X VjVjVj 1 AAtTTG 


AAT CT AT CGG 


TGGTGTAGGT 


ATTGACGGAC 


GTCCACTTGT 


15420 




TACGAAAAAC* 


TP* A TTP PY2 r P r P 


TCTTACACTC 


ATT AGATAA C 


TTAGGTCCAG 


CTCCAGAACC 


15480 


5 


AAArTTAAHA 


v» XM.X lAlVjul 


CAGTACGTTT 


ACCTGACAAC 


TTCAAAACAT 


ACTGTGCAAA 


15540 




AATGAGTATT 
.rtrt X Vj/"VVj 


A A A A P A A fiTT 


LIA xCCAATA 


TGAAAATGAT 


/■^ m i> Hull *K mnA 

G A CATT ATG C 


GTGAAAGCTA 


15600 


70 


TGGCGATGAP 

X wVJVwVJ^V X VJ^AVw 


TATGGTATCG 


CATGTTGTGT 


ATCAGCGATG 


A CAATTGGTA 


AACAAATGCA 


15660 


/IX 1V.X X V»VJ\J X 


GCACGTGCGA 


ACTTAGCTAA 


AACATTACTT 


TACGCTATCA 


ATGGTGGTAA 


15720 




an a tv: a a a a a 

nunlonnnnn 


TCTGGTGCAC 


AAGTTGGTCC 


AAACTTCGAA 


GGTATTAACA 


GCGAAGTATT 


15780 


75 


AGAATATGAr* 


GAAgTATTCA AGAAATTTGA TCAAATGATG 


GATTGGCTAG 


CAGGTGTTTA 


15840 




PATTAAfTr'A 
\— rl X X e\J-\\^ X V_-r\ 


TTAAATGTTA 


TTCACTACAT 


GCACGATAAA 


TACAGCTATG 


AACGTATTGA 


15900 




a iiwraTTa 

/iA.XVjv7V_A,X 1A 


CATGATACAG 


AAATTGTACG 


TACAATGGCA 


ACAGGTATCG 


CTGGTTTATC 


15960 


20 




GACTCATTAT 


CTGCAATTAA 


ATATGCACAA 


GTTAAACCAA 


TTCGTAACGA 


16020 




AVjAftljVj 1 L. 1 1 


GTAGTAGACT 


TTGAAATCGA 


AGGCGACTTC 


^*.^^#**ft. fti ft mm **m^% 

CCTAAATACG 


G T AACAATG A 


16080 




r"Y*!J a r'pOTPT TV 
vJvj I Vj x A 


GATGATATTG 


CAGTTGATTT 


AGTAGAACGC 


TTCATGACTA 


AATTACGTAG 


16140 


25 




TATCGTGATT 


CAGAACATAC 


AATGAGTGTA 


TTAACAATTA 


CTTCAAACGT 


16200 




XVj IAIAHju I 


AAGAAAACTG 


GTAACACACC 


AGACGGACGT 


AAAGCTGGCG 


AACCATTTGC 


16260 






AACCCAATGC 


ATGGCCGTGA 


CCAAAAAGGT 


G CATT ATCTT 


CATTAAGTTC 


16320 


30 




ATCCCTTACG 


ATTGCTGTAA 


AGATGGTATT 


TCAAATACAT 


TCAGTATCGT 


16380 






TTAGGTAAAG 


AACCAGAAGA 


TCAAAACCGT 


AACTTAACTA 


GTATGTTAGA 


16440 


35 




ATGCAATGTG 


GTCACCACTT 


AAATATTAAC 


GTATTTAACC 


GTGAAACATT 


16500 


A AT A n ATY2 r* A 


ATGGAACATC 


CAGAAGAATA 


TCCACAGTTA 


ACAATCCGTG 


TATCTGGTTA 


16560 




rnPT^PTTa a r* 


TTCATTAAAT 


TAACACGTGA 


ACAACAATTA 


GATGTAATTT 


CTCGTACATT 


16620 


40 


f^r* A*TG A RRPT 
LWiawinAljl 


ATGTAACAAA 


ATTTAAGGTG 


GGAGCACTAT 


G CTTAAGGG A 


CACTTACATT 


16680 




v_ X Vj x WwiAAv 


TTTAGGTACT 


GTCGATGGAC 


CGGGATTAAG 


ATATATATTA 


TTTACACAAG 


16740 




vxrt. 1 VjV_ 1 1 AL, X 


TAGATGCTTG 


TATTGCCACA 


ATCCAGATAC 


TTGGAAAATT 


AGTGAGCCAT 


16800 


45 




CACAGTTGAT 


GAAATGGTGA 


ATGAAATATT 


% ^^^^ m wry ^% «k it n 

ACCATACAAA 


CCATACTTTG 


16860 




ATGCATCGGG 


TGGCGGTGTA 


ACAGTCAGTG 


GTGGCGAACC 


ATTGTTACAA 


ATGCCATTCT 


16920 




TAGAAAAATT 


ATTTGCAGAA 


TTAAAAGAAA 


ATGGTGTGCA 


CACTTGCTTA 


GACACATCGG 


16980 


SO 


CTGGATGTGC 


TAATGATACA 


AAAGCATTTC 


AAAGGCATTT 


TGAAGAATTA 


CAAAAACATA 


17040 




CAGACTTGAT 


ATTATTAGAT 


ATAAAACATA 


TTGATAATGA 


CAAACATATT 


AGATTGACAG 


17100 
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TATGGATTCG 


ACATGTCCTT 


GTGCCTGGTT 


ATTCTGATGA 


TAAAGACGAT 


TTAATTAAAC 


17220 




TAGGGGAATT 


TATTAATTCT 


CTTGATAACG 


TCGAAAAGTT 


TGAAATTCTG 


CCATATCATC 


17280 


5 


AGTTAGGTGT 


TCATAAGTGG 


AAAACATTGG 


GCATTGCATA 


TGAATTAGAA 


GATGTCGAAG 


17340 




CGCCCGATGA 


TGAAG CTGTT 


AAAGCAGCCT 


ACCGTTATGT 


TAACTTCAAA 


GGGAAAATTC 


17400 




CCGTTGAATT 


ATAAATACAA 


TTCAGACCGA 


AAAGAAAGCA 


TATGCAACTT 


CAAGAGTGAA 


17460 


10 


GGGGCATATG 


CTTCTTTTTC 


AATTGAGTAT 


TGAGTATTAG 


CAAGACGTAG 


TAAGTATATG 


17520 




AGACAACTTC 


TACAATGGTT 


GAAGGAAGAC 


GTTTTTGTAA 


GTAGCTATGC 


TGATAAAGAA 


17580 


15 


TGTGATGTCT 


TGTTAAAGGT 


GGGGTTCCAA 


TATCATCATT 


TAGCTGATGT 


TGAATGGGTT 


17640 


ATTATTTGCT 


ACTTG CAT AT 


GAATATGAGT 


CTTTTCAAAT 


TTTTATTGAC 


CCTGAGTAAT 


17700 




GAAAAATATT AAGATGAAAC TTAATATTAA AgCAATGCGG 


AGCGTGATTA 


TGAAGAGAAT 


17760 


20 


TAGTAAAGAT 


ATATGGGCAG 


TATTTAAATT 


ACTGTATCaA 


AATAAAGGGC 


GTTTTAGCAT 


17820 




TAATGCCTTA 


CTATTGCAGT 


TAATCATGAT 


TTTTATTAGT 


AGTACATACT 


TAATTTTACT 


17880 




ATTTAATATG 


ATGTTAAAAG 


TAGCTGGCAA 


AGCCAACTTA 


CGATTAACAA 


TTGGACGGAA 


17940 


25 


ATCGTTAGTC 


ATCCCGCCAG 


TGTGATACTT 


CTTATTATAT 


TCATATTAAG 


TGTTGCCTTT 


18000 




CTGATTTATG 


TAGAGTTTTC 


ATTGTTAGTT 


TATATGGTTT 


ATGCCGGCTT 


TGATCGACAG 


18060 




ATTATTACAT 


TTAAATCCAT 


TTTTAAAAAT 


GCCTTTGTAA 


ATGTGCGTAA 


ACTCATAGGT 


18120 


30 


GTACCAGTTA 


TTTTCTTTGT 


CATTTATTTA 


ATGTTAATGA 


TACCCATTGC 


CAACCTAGGA 


18180 




CTAAGTTCAG 


TATTAACAAA 


AAATATTTAC 


ATACCTAAAT 


TTTTAACGGA 


AGAACTTATG 


18240 


35 


AAAACGACGA 


AAGGTATAAT 


CATTTACGGT 


ACCTTTATGA 


TTGCTGTATT 


TATATTAAAT 


18300 


TTTAAATTAA 


TATTTACTCT 


ACCGTTAACG 


ATTTTAAACC 


GCCAGTCGTT 


ATTTAAAAAT 


18360 




ATGAGACTAA 


GTTGGCAAAT 


TACGAAGCGA 


AATAAGTTTC 


GGCTTGTTAT 


AGAAATAGTT 


18420 


40 


ATATTAGAAC 


TCATCATTGG 


TGCGATTTTA 


ACATTAATTA 


TTTCAGGAGC 


AACATAT CTT 


18480 




GCTATTTGTG 


TAGATGAAGA 


AGGAGATAAG 


tttttagtct 


CATCAATTTT 


ATTTGTTGTA 


18540 




TTGAAAAGCG 


CATTGTTCTT 


CTATTATkTA TT t ACGAAAT 


TATCATTAAT 


CAGTGTGTTA 


18600 


45 


GTACTGCACT 


TAA 










18613 




(2) INFORMATION FOR SEQ ID NO: 113: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AAAGTTTTAA AAGGGGTGAG ATACTTGGCG AATAATCCAT TCCAGCTTTG CGTTTAAAAG SO 

GAATTATACT TGCCATTGTC GGTGCTTGTT TATGGGGATT AGGTGGTACT GTTTCTGATT i 2 0 

TCTTGTTCAA ATATAAGAAT ATTAATGTCG ATTGGTACGT CACTGCTCGA CTTGTAGTCA 180 

GTGGTGTTTT CTTACTTATT ATGTACAAAA TGATG CAACC CAAACGTTCA ATATTTAGCG 24 0 

TATTCCAAGA TCGACGTATG TTAGGCAAAT TACTTATCTT CAGTATACTG GGCATGTTAG 300 

TAGTACAATA TGCTTATATG G CAT CTATT A ATACAGGTAA TGCTGCGATT GCAACATTAC 3 60 

T ACAAT A CAT TGCGCCAGTT TATATTATTA TTTGGTTTGT CATAAGAGGC GTTGCAAAAC 420 

TAACATTATT TGATGTG CTT G CTATTATCA TG ACA CTATT AGGAACATTT TTATTATTAA 4 80 

CAAATGGTTC ATTTTCTAAT TTAGTCGTCA ATCCTGCAAG TTTATTCTGG GGTATTTTAG 54 0 

20 CTGGTGTAGC ACTCGCTTTT TACACAATTT ATCCTTCAGA CCTACTTAAC CGCTTCGGTT 600 

CGATTCTAAT TGTCGGGTGG GCAATG CTTA TTTCTGGTGT TGCGATGAAT TTACGCCATC 66 0 

CAATTTGGCA CATTGATATC ACTAAATGGG ACATATCAAT TATATTATTT TTAATCTTTG 72 0 

25 GTATTATCGG TGGTACCGCA CTCGCATTTT ATTTCTTTAT CGACAGTTTA CAATACATAT 78 0 

CAGCGAAAGA AACAACATTA TTCGGAACTG TTGAACCTGT CGTAGCCGTT ATCGCAAGCA 84 0 

GTCTATGGTT ACATGTGGCA TTCAAACCAT TTCAAATCGT AGGCATCATT CTTATTATGA 900 

TTTTAATTTT ATTACTATCA CTTAAAAGAC AACCTGAAAC ATTAGATGAA TAAGAAAACT 96 0 

CTGATAATCA CTTTAG CAAG TAACTATTAT TTAACAACGT AGTTACCTTA TAGGTGATAT 102 0 

CAGAGTTTTT TATTTTAGTT AATAATATTT TTCACTTGGT ATAAAAAaGC GTCGTCGCTC 108 0 

TGGTAATCGG AAATACTGGA ATAAAATATG GAATTGGGTA ATAATCCCAG GTAnTAAAAG 114 0 

TCCAJGTTCC GATAnC CTnT CCGCAnCTCC AACCAAATTT GCCGATAAGG TTCCAAAAGG 1200 

CATCCTGGGG GTAC 1214 
(2) INFORMATION FOR SEQ ID NO: 114: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9458 base pairs 
45 (B) TYPE: nucleic acid 

<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
ATTTTGGTTT CATTCACGAT GGGGTnATAC AGCAAACACA nCTAAAATAA CTATCAATAG 60 
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CTTAGACAAT AAAAAATATG CCACTACAAT CGCTAATATT ACGATTAAAA AAGAAGCGTT 18 0 

AACGATTACT TTCATCGTTG TTCTATCTCT GAACATCATA TTAAAGACAA CTAGACTAAT 24 0 

TGATAATGAA ACAGCAAAAA AAGTAATAGC TAACACTAAT TTCATCATAA ATAGACAGAC 300 

TAAACCTATG ACTAATAATG TATTAGAAAT TACAGCTGAC GTTTTTAACA TTCTCGaATT 360 

AATATGCACT CACCCTTTTT ATTTAAATAA CTTACATAAT CATAATAATA CATGATGTTT 42 0 

CATAGGCCTG TCGATGATTG ATTCACAATA GCACGTGATT TTTTTGTTTT TCAATATTAT 4 80 

TCATTTATTC CATCAAAAAC ACCCTTTTTA ATTTTTACAA AAATTAAAAA AAGTGCTCCT 54 0 

ACACTGCTTG CATGTAGAAA CACTTTTTCA TTGTAATGTT ATTCTTCTCG AG ACATAC CT 600 

TTTAGCATAT TAAGCATGTA TGTTAAACTA CGGTTCATGT CGTCATCTTT CAATACGCCC 660 

AATAGACTTC TTATAGTTGT CTTAGCATTT GGACTCGCTT GATTGGCAAC GTGTAATCCT 72 0 

TTATTAACTT TATTTAGGAA GTCGCTTAAA TCTGATACAT TGAGTTCACC TAATAAAAAT 780 

ACCATTGAAG CCATATTAGA TAATAGCCCT GTATAAATAT CTTTATTAAG TTCAACTGCA 84 0 

AATTTATTTA TGATGACTTG ACGTCCTCGA ATTGCACCAT TTAAAGCATC TAATAGTTTT 900 

25 GCATCATCTA ATGTTTTAAT AAGCTTGATT GCTTTTAATA TACTATCTTT ATTCGCTGCA 960 

ATTGCCTCTG TAACTTCATT TAAACTTTCT AACTTAATTT GTTCTTCTGA TTTTTCTAAG 102 0 

CGTCTAATTT TAGAAGATAT TCTCTCAGCC ATTATTTATC CACCTGATTT CCCGGGAAAA 1080 

CATAATCTGA ACGTTCCCAT TTTTTCTGTA CTTGAACACT GTACTGCGGT TGACGTTTTT 114 0 

TATTGACACG GAAATTATTA GGGTTCAACG GTGACTTACC ACGTTTCGTA ATTACCTCCA 12 0 0 

AACGACAGCT AGTACGTTTA TAAGATGGTG TATCCGTGTA TTGATCAACA TCACT a TT AG 1260 

TTAATAAGTT AATTGCACCT AGATCTCCAT TTTCCAT CGC aTCaTTATTT AATGGAATAT 1320 

AGAT3TCTTT ACCTTTAACAT CGATCTGTCA CGTGAACTTG TAATACCGCT TCTCCTGTyT 138 0~ 

CAGAAATCAG CTTAACTTCT GCACCTTCAT GAATGCCTCT ATCTTCAGCA AGCTCTGGAG 144 0 

AAATTTCAAC AAATGCACGT GGCACTTTGT ATTTAATCAT TGGTGTTTGA TAAGTCATAT 1500 

TACCTTCATG GAAGTGCTCT AACAATCGAC CATTGTTTAC ATGAATATCA TAAATTTCAT 1560 

45 CTTGCTTAAA GTAATTATCA AATGATAATG GGAATAATTT TGCTTTACCA TTATCAAAAT 1620 

TGAATCCTTC TAAGTATAGA ATAGGCTCAT CAGTACCATC AGGTTGTACT GGCCATTGTA 1680 

AACTATTGAA TCCTTCTAAA CGATCATAAC TTACCCCAGC ATATAGAGGT GTTAAGCGTG 1740 

60 CTACTTCATC CATAATTTCA CTAGGATGCT TGTAATTCCA ATCAAATCCT AATCTATTAG 1800 

CAATTGCTTG GAAAATTTTC CAGTCAGGTT TTkAATCACC AAGAGGTTCT AATGCTTGGT 1860 
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TTGCTGGCAA TACAACATCT GCGTATGTTG CTGTGAATGT TAAAAATTCA TCTTGGACTA 1980 

CCATGAAATC TAATTTTTCA AACGCAGCTT GTACAAAATT AATATTTGAA TCCACAATAC 2040 

CCGTATCTTC ACCATATAAG TACAATGAGT GTACTTCTCC GTCATGTATA CCTTCTACCA 2100 

TTTCATGATT ATCTTTACCA GCTTTTGGAT TCAATTTAAC GCCATATTCT TTTTCAAATT 2160 

TAGCGCGAAT ATCATCCGCT TCAATACTTT GATAACCAGT AATCTTATCA GGCATACTTC 2220 

C CATAT CACT ACATCCTTGA ACATTATTAT GTCCACGTAA TGGATACGCA CCAGTACCAG 22 80 

GACGACGATA ATTACCTGTT ACTAATAATA AGTTTGAAAT CGCTGTACTT GAGTCACTAC 234 0 

CAATGTCTTG TTGTGTAATA CCCATTGCCC AACAAATTAC AACAGATTCA GCTTTAGCAC 24 00 

ATTCTTCAGC AAATTTAATC AATTCTGATT CAGGAATACC TGTTGCTTCT TCAGCAAAAG 24 60 

CCATTGTAAA TGTTTCTAAT GATTTGTAAT ATTCATCAAA ATCATCTACC CACTCATCAA 2520 

20 TAAATGCTTT ATCGTGTAAA TCATGATCAA TAATATACTT AGTCACTGCA CTTAACCACG 2580 

CTAAAT CCGT ACCTGGTTTA GGTTGATAAA AACGATCCGC ACGTTCTGCC ATTTCATGTT 2640 

TTCTAATATC AAATACATGT ATTTTTTGAC CAAATAATTT TTGTGCACGT TTCATGCGTG 2700 

ATG CGATAAC TGGATGAGCT TCGG CTGTAT TAGTACCTAT CAATACAGAC ATTG CCGCTT 2760 

TTTCTAAATC TTCAATACTA CCTGAGTCAC CGCCGTGTCC AAC CGTTCTA AATAAG CCTT 2820 

TTGTTGCAGG TGCTTGGCAA TATCTTGAAC AGTTATCAAC GTTATTTGTG CCAATAACTT 28 80 

GTCTTGCTAA TTTTTGCATT AAATACGATT CTTCATTCGT CGCTTTAGAA GAAGAAATGA 294 0 

ATGATAGTGC ATCTGGGCCA TGCTTTTCTT TAATAGCTGT AAAATTATCT GCAATGACGT 30 00 

TTAAAGCTTC ATCCCATTCT ACTTCATGGA ACTCACCATT TTT CCTT ACT AG TGGTTT AG 3060 

TTAATCGTTG ATCTGAATTA ATATGTCCCC ATGAAAACTT ACCTTTAACA CAAGTCGCAA 3120 

TTTTATTTGC TGGAGAATCA TGTGATGGTT GTACTTTTAA AATTTCTCTA TCTTTAGTCC 3180 

40 AAACTTCAAA TGAACAACCC ACAC CACAAT AAGTACACAC TGTTTTAGTT TTCTTAATAC 324 0 

GCTCTTTACG CATTTCTGCT TCTGAATCTG AGATTGCAAA TAGTGGACCA TAACCAGGTT 33 00 

CTGCTTTTTT AGTTAAATCA ATCATTGCTG CTAATGAACC AGGTTCCGTA TCAGTCATAT 3360 

45 AACCCGCATT ACCTTCCATA TTCACTTCCA TCATGGCATT ACATGGACAT ACCGTCGCAC 3420 

ATTGACCACA AGATACACAT G AAG ACT CAT TAATCGGTAC ATCATTATCC CAAATAACAC 34 80 

GTGGATGTTC ACGATCCCAA TCAATTCTAA TAGTTTCATT CACTTCGATA TCTTGACATG 354 0 

SO 

CTTCTACACA ACGCCCACAT AAGATACATT GATTTGGATC ATAACGATAA AATGGGCCGT 3600 

AATCTTTTTC GTATGGCTTC TCTTTATATT CATACGTTTG ATGCTGAAGC CCCCATGCAT 3660 
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TATGCTTTTC TAAAATTCGA TCAAGCGCTT 
CAGTATTTAC AGTCATTGGA CGATCAATCA 
5 CAATCTCAAC AGTACATGTA TCACATGTTT 

TTGAAGGTAC AAAAGTATCT TGTGATTTAA 
CAAGATAATC TTTTCCATCA AGTGTAACCA 

10 

CTATATATAT TTTCCGTAAA TGACTTTTAA 
TGCCCCACAC ATCTTTCAGA TAGAATTAAT 

15 AAGTAAAATT TTGTATTTTG CCTTTTTACA 

ATTAAATCAT CTTTTTGTTT AATTGAAAAT 
TTTCACGCTT TTTGCCATAT CTTTCACAAC 

20 CACCTAAAAA TCGTTATACT ATTTATAAAT 

TTGATAAATA TCTACTATCA TTTAGAAGGT 
GATTAATTTA TAAAAATCAA ATCAGGCATT 

25 ATCACCTTCT ATTTACGGGC TATTAGTTCT 

CTAATTAATT TGTGTACAAT TTTGATAACT 
TCTTTTAATA ACTTAGTACT TTCAGCTTTT 

30 

CCGTCACTTT GAATGCCGCC TTGACCACTC 
TCTTTATAAT TGCTTCTAAT CGTATTCAAA 
TTTTGAATTT CATTCATTAG ACTATTAAAA 

35 

TTGGCCATCG CTTCAAGCAC AATTTGCTGA 
CCTTCTTCTT TACGACTTCT AATAAACTTC 

40 TGTCCTTTTG TAAAACGAAC ACCATCAACA 

CCGCCGATGC TATCTATCAT ATTATGCAAA 
ATTGGCACAT TCATTAATTT TTCAAGTGAT 

45 TAGGCATGTG CAATTTTTTC AGTAGTACCA 

GGTATACTTA CTATTTCAGT TTTCTTCGTT 

t CACTACGCT CTCCGCCACC CTTTTTCTTA 

SO 

G CGATTGTG A ATGGATCACC ATCGTTTAAA 

TTGCGATCTA ACGGATTGTG TATCTTATTA 

55 



CTTTTTGAGC ATCTTTCACA TCATTGTTCA 3780 

CCGTACTACA TGAACGTTCA ATTTTACCGT 3840 

GAATTGGTCC CATCGACTCG TTATAACAAA 3 900 

TAAATTCAAG TAAATTCGTA CCTGGTTCTA 3 960 

CCAAATGTTC TTG CAT ATT A CTCACCCCGT 4 020 

TAAATTGCTC ATATCCACCT AAAATAACGA 408 0 

TTAATTGTAT TACTTTATGT ACTAGTTGTT 414 0 

ATCATTTTTA TTTGAAATAT TTTGCGCGAA 4200 

AATTATCATT ATT AGTTTT C CAATTATCTG 4260 

CTTATTAATG ACAATATTTA ATAATCACCT 432 0 

ACCCTTTTTC TGAAAATTAA TAACCCAAGT 4380 

AATATTTATC TTTAAATTAA ATTTGTAATG 444 0 

AAATAAAATA GCCCATAAAT ACAAAGTGTT 4 500 

ATTCGTTATT CTATTTACAG ATCATTCTAT 4 560 

TATTTTCCCT TAGTTTACTA CTCTAGATTA 4620 

GACTGCTCAC TAGGAATGAA GTAGTACAAT 4 68 0 

AATTGATGTT TATTAATCGT GTCATTAGCA 4 74 0 

TCACCTAATG TTAAATCTGT TTTAACATTA 4 80 0 

TGTGTAATCG ATGATGGGCT TGCAATCTTA 4860 

CGTTGTTGTC GACCAAAGTC ACCACCAGCA 4920 
AATGCTTGAT ^lACCATTTAC ATGTGTCTGC " 4 98(f 

GTGAATGTAT CATTACTTAC TACATCAACA 504 0 

. CCATCCATAT CGATTGTCGC ATAATGATCA 5100 

TTAACAGCCA TATTTGGTCC ACCATATGCA 5160 
CGGCCAACAA TTTCCGCTCT TGTATCACGC 5220 
TTAGGGTTGA TAGATAAAAT CATAATACTA 5280 
CGATCAGCAT CTGAATCGAC ACCAAATAAA 534 0 

CTCACTTTTT TATCTCTTAA TTCTGAATGA 5400 
CCAGTAATAA AAATTTTAGC AGCTACATAC 5460 
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GGTAGGCTCA TTTTACTTTT AGACGAACGT TTCAATCCCA CCACTCCTTT ACTATTCCTT 5580 

ACATACTTTG TCTGTTTTCT CTATTTATTA TATAGTAAAA TAATTTTTTT ACTATACTTC 564 0 

TGTAGACGTA TAACTATTTT TTATCATTTT TTATCTCTAG AGAATATCTA TCTGTATTTT 5700 

TGATAACCAC CATTTGCATT TAAAATTTTA AGTACCGTTT CATGACATGC TTTATTACTT 5760 

ATAATAAAAG GTG CACCCTT TAAATGATCA ATTG CCTT AC CATCTAAAGT CGTCATTTTT 5 820 

AG ATT CAAT A GTTCTGCAAA TAAAAACTGT GCAGCAATGT CCCAAGGTTT AGGATTTGTA 588 0 

TTAATATGTG CCCCAAATTG ACCTTTTGCC ACTCGCATAG AATCTAATCC GCAAGCACCA 594 0 

ACTAAACGAT AACTAAATGA GGCGTCAAAT AAATCTTGCA CCGTATCTAG ATTCATCACT 6000 

TGTGCATTAA ACGATATAAT AGCGTCTTCC AATTTTAACG ATGGTGGTTC TTCCATCTTA 606 0 

ATTCCATTAC AAAAAGCACC TTCTCCTCGT ATTGCTTTAT AAAGCTTTTT ATGCGGATAA 612 0 

T CAT AT ACGT ACGATAACAT TGGTTTACCT TCATAAAAAT ACGCCAATAT AATACAATAA 6180 

TCTTCTTGCT GTTTTACTAA ATTGGCAGTT CCATCAATGG GATCCATAAT CCATAAATGA 624 0 

TTAATTTCAT TCGTAATCAT TTCATTACTT TTTTCTTCCG CTAATAGTTG GTGTTCCGGA 6300 

2S AAATGTGTTG CTAAAAATTG TTGGAATTGT TGTTGAATCT GTTTATCTAC ATTTGTAACT 6360 

AAATCAAATC GATGACGCTT AGTTTCTGTA GTCATTTCCA TAATTAATTG CGGAATAACA 6420 

TTGTCTATTT GTTTCAACCA CGAACATATT AACTTATCTA TTTGCTGTAA TGTTTTATCT 64 80 

GTCATTTCGT CCACCACTTC T CAT AT CATT ATCATTTTAT TATTACCCTA TATTAAAAGA 654 0 

ATCAACAATA CAACTGAAGA CTTCTTCATT TTATGCATAA AAAAATCGGC TAGTCACGTG 6600 

CTAGCCGACA AATAGAAAGG AAAGTAAGTA ATAAATATTG AAGATGTTGT GATGTAACTT 6 660 

GAACGATTAA AAGCTATCTG TT AT AT AG CT CTACCCCTTT GTTTAATCGC TCCCCCTGTT 6720 

ACAAGTAATA TCATAGCACA ATCTTTTTTA AAATGTAAGC GTTTTCCACA AAATTTTTAC 6780 

GATTTTTTTA AAAAGATATT GAAAATGTCC TCATTGTCAC TCTTATGTTA TACTTTGTGT 6840 

AATATATCAT CTTTTAGGAG GTGGCTGTCA, TGAATAAAGC TGAAAGGCAA AATTTAATAA 6 900 

TTACTGCAAT TCAACAAAAT AAAAAAATGA CCGCTTTAGA ATTAGCTAAA TATTGCAACG 6960 

45 TATCCAAACG CACAATTTTA AGAGATATTG ATGATTTAGA AAATCAAGGT GTTAAAATTT 7020 

ATGCGCATTA TGGGAAAAAT GGTGGTTACC AAATACAACA AGCACAATCT AAAATTGCAT 7080 

TAAACTTATC TGAAACACAA TTATCAGCCT TATTTTTAGT GCTTAATGAA AGTCAGTCGT 7140 

50 ACTCGACATT AC CATATAAA AGCGAAATCA ACGCAATTAT AAAACAATGT TTAAGTCTTC 7200 

CACAAACACG CTTAAGAAAA TTGCTTAAAC GCATGGACTT TTATATTAAA TTTGATGACA 7260 
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ATGTGATGTT AGTAGATCAT AGGGTTGATG ATAATATTAA AGCTGAAAAC GTTATATTTA 73 80 

TTGGCCTTTT GTGTAAACAT GGACATTGGC ATGCAGTCAT TTATGACATT GCTCAAGACA 744 0 

AAACTGCCGA ACTCGAAATT GAAAATATTA TAGATATTTC GTATTCATTC GGTAAGACGA 7500 

TTCAAACCAG AGACATATCC ATTGATAACT ATCATCAATT TTTAAACCCC ATCGATTCCT 7560 

AAAAAACAGC AGTAAGATGA TTTTCAATTA GAAAATATCT TGCTGCTGTT CTCTATTTAT 7620 

ACAATACTTC GTATTGAATG GnTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG 76 8 0 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC 774 0 

TTTGAAATAC TTAAAAAAAT AAAACACTTT GCCCAACTTA CACTACCAAT AGAAACTGCT 7800 

GTTAGAATTC CTCAAAATGA TATTTCGCGA TATGTTAATG AAATTGTTAA AAAGATAGCT 7860 

GATAGCGAAT TCGATGAATT CAGACATCAT CGTGGCGCAA CATCCTATCA TCTAAAAATG 7920 

ATGTTAAAAA TCACCTCATA TTCATATACT CAATCTGAAT TTTCTGGCCG TAGAATAGAA 798 0 

AAATTACTTC ATAACAGTAT TCGAATGATG TGGTTAGCTC AAGATCAAAC ACCTTCTTAT 804 0 

AAAACTATTA ATCTTTTTAG AGTGAATCCT AATACTGATG CGCTAATTGA ATCTTTATTT 8100 

25 ATTCAGTTTC ATAATAAAAT GCATATCAAA AAAGCTGATT TCTATCAAAT AATTAATAGA 8160 

AATCAGCTTT TTTCaTTGCC TAAAAACTTA ATGTCC CGAC CTCTTTATCT ACGCATAAAT 822 0 

ACTTATTACT GATATAACGA AAGAAACAAA ATTATTTGCT ATATGTAATG CAATTGTTGA 8280 

ACCTAGGTTT CTTC C AG ATT TTAAATAAGT GAAAACTAAT ATGATGGATA GTATGAGATA 834 0 

TGGACCAAAC TCAAACGGCG ACTTTGCATC AGTCACATGA ATAAATGCAA ATAAGAACAC 84 00 

CGAAACAATA CTCATAGCTA TAAAATTAAA CTTCTTACCT AATTCTCCAA TTAAAATATG 8460 

TCTAAATACG ATTTCTTCAA CTATTGGACC TACAATCACA ATTAATAAGA ATGCTACAGG 8520 

—TAAAAATGCA -GGCACTTCAA ACATTTTATT __T AGCTCAAG.T. TCATTGGCTG TTtCACTATA 8580 _ 

TTGCAAATGT TTAGGTAGAA ACTGTGTCAT ATATTCATAT GTATAAATTA AG ATGAG AG C 864 0 

AATAATATAC GTTATTGACA ATCTAAGCCA ATATTTTTTG ATATACGCAA AACCAGCTCG 8700 

AAGCCTTGAT GGCATCACTT TTAAATGAAA TAAATAAAAT GCGCCAATCC CAATCGTATA 8760 

45 TGCTAAAGCT TGTGTGATAG TCGCTACAAA TATCAGATTA CTATCGATTT CAT AATAAC C 8820 

AAACAAAATT GGTCCTATGT AAGCTGCAAT TGTGAGTGCA TAAAATATAA CACCTATAAT 8880 

TGGAATTATA AGCAAATCTC TCCATGCTAT ATCTTTAAAC GTGTATTTCT TTTTTTCATT 894 0 

60 TTCCaCTGTT ATATCCt TTC CTGTTTAATA ATTGATTTTT GGAGGTACTT CTACATGATA 9000 

AACGAAACTA AGTATATGAG ACAACAAATT ACTAATTTGA TTCAAATCAT TGATACGATT 9060 
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ATAGTTACTA ATGAATTGAA TAAGTTCAAA GGCTTTGAAA CATCATATAT AATAAACGAA 91B0 

AATCAAGTTT CCTATTATGA AATTATAACA CTACTTAATA AACGTCCCCT CgACAAGTCG 924 0 

ACTATGGTAA CAAAATTCAA TATCTTAATT TTTATCATAC AGAACTATCT AACGCATTAT 93 00 

TTGCAATTAA ATTTGCCCAT TAACCTATTT TTCATAAAAT GTCATTTAAA CAAGTTATTT 9360 

ATTAAAATTC ACTTTATTAC ATAAATTATA CAATTArAAA GTTTCTTCAA ATTGTAAAGA 94 2 0 

TGCATTAATC GAGTTATAAT CATAATGATT AAGATGGT 94 5 8 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 115: 

AnGCGTATCA TGTCACG CAT TTTAACTACT TCTTTACCAC AAGATTATAC AGTCACATTA 60 

GTTGATCGTA TGCCATTTCA TGGATTGAAA CCAGAATTTT ATGCTTTAGC TGCGGGCACG 12 0 

AAATCAGATA AAGATGTTCG TATGAAATTC CCTAATCATC CACAAGTGAA TACAGTTTAT 180 

GGTGAAATTA ACGACATAGA TTTAGATGCT CAAATTGTCT CAGTCGGTAA TTCTAAAATT 24 0 

GATTATGATG AGCTAATCAT TGGTTTAGGA TGTGAAGATA AATATCATAA CGTTCCAGGA 300 

GCCGAAGAAT ATACACATAG TATTCAAACA CTCTCAAAGG CTCGGGATAC TTTCCATAGT 36 0 

3S ATTAGTGAAC TACCAGAAGG TGCTAAAGTC GGTATCGTTG GTGCTGGATT AAGCGGCATA 420 

GAACTTGCCA GCGAATTAAG AGAAAGTAGA TCAGACTTGG AAATATATCT TTATGACCGT 480 

GGGCCGCGAA TTTTAAGAAA TTTTCCAGAA AAATTAAGTA AGTATGTTGC GAAATGGTTC 54 0 

GCCAAAAATA ATGTTACCGT TGTTCCAAAT TCAAATATTA ATAAAGTTGA ACCTGGTAAA 600 

ATATATAACT GTGATGAACC TAAAGATATT , GATTTAGTTG TATGGACAGC AGGAATTCAA 660 

CCTGTTGAAG TTGTTCGTAA CTTGCCGATT GATATAAATA GTAATGGACG CGTGATAGTT 720 

AACCAGTATC ATCAAGTACC AACATATCGT AACGTCTATG TAGTTGGTGA TTGTGCTGAT 780 

TTACCACATG CGCCAAGTGC TCAGTTAGC C GAAGTTCAAG G TG ATCAAAT TGCCGATGTG 84 0 

CTTAAAAAGC AATGGCTAAA TGAACCATTA CCTGACAAAA TGCCGGAACT AAAGGTACAA 900 

GGTATCGTTG 910 
(2) INFORMATION FOR SEQ ID NO: 116: 
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(A) LENGTH: 10182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

ro TTTTTGATTC AAAGTGGTGA TTTAACAAGC ATTTTAAATA GCAATGATTT GAAAGTCACA 60 

CATGATCCTA CCACTGATTA TTATAATTTA TCTGGTAAGT TGTCGAACGA TAATCCAAAC 12 0 

GTTAAACAAT TAAAACGTAG ATATAATATT CCTAAAAACG CATCAACAAA GGTGGAATTA 180 

AAGGGAATGA GTGATTTAAA AGGCAATAAT CATCAAGATC AGAAACTTTA TTTTTATTTT 24 0 

TCAAGTCCTG GAAAAGACCA AATCATTTAT AAAGAAAGCC TTACTTATAA TAAAATAAGT 300 

GAACATTAAT ACTTATGCTG TAATTATAGA AACATCCAAA TCATCTATTA nAATCCTATA 3 60 

TTATAAAAnC ACCTCACATA ACTCGTTCAA CTGTACCAAA CCACATTACA TTAGATTTTA 420 

GGCTAACTAT TGTGATGTAC ATCAAAAACG AATTTGTGAG GCGTTGTATA TTTTACAAAG 4 80 

GTGACT AG C G TTTCGTATAG CATTTCCAAC ATTACTACAC TCAAGCGTCA CGCTAAAGTT 540 

CGAAATCGAA TCCTTTCATT CAACAAAAGC TCATATCCAC TACAAACTTC ATATCAAGCG 600 

TATAAACTAT CTTGTGATAC TATCTCGATC ATATCTATAG TATGCATTTG TGTTCCGTTT 660 

30 CACTGAAGTA TATGTATCAT CAGTTAAGTA TAAACCGTCA TCCTTCAATG TTACTTGATA 720 

AGCATATTTC CGTGCTAACC AGG CAATATC TATATAATTT TCTCCTG CGT TTTCATAACT 780 

TCTTAAATCT TCAATATGTG CACTAACTTC AGGGaAAATG ATTCTAACAA CACTTTCATC 84 0 

AACCCAATAT TTGTCATGCA TCCATCGCAC TTGATCTGCC AATAAAGGTA ACTGCACATC 900 

-ATTGAAATAT— AGACGAAAGC CGTCACTATC ATACATTTGC CGATATGGTA- ATGGCTGTTT 960 

TCTAATCACT AACACCTCGC CACCCATTAC GGTGCCTTCT CTAGTATCAT CACTTCCACC 1020 

CGAAGCTTCA TACGTTGTTG GGTCAACCTG TAGTCCATGT ACATCTCCAA TATAAGCATC 1080 

TGG TTTATGT TCCATTGCAT GTCCATGTGC AATCAATGCT AATATTGTAG ATTGTGAAAA 114 0 

TTGAGGCTCC CATTCAATGC GATTAGGATG G CTACTATAA ATTCTAGGTT CAT CT AT AG C 1200 

CTGCTGAATA TCCATGCCAA ACACTAATAC ATTGATTAAT GTTTGCGCAA CACTAGCAAT 1260 

GATACTTATG GCACCAGGTG CACCTACTGT TAATATTGGC TTCCCGTGAT ACATCACAAT 1320 

SO CGTTGGAGCC ATGTTACTTA GTGGTCGTTT ATATGGTGCA ATTTCGTTAA TACCACCATC 138 0 

TACTACATCA AAGCCATCCA TTGTCGTATT CAATAACACA CCGTAGCCTG GAATCGTGAT 1440 

ACCTGAACCA TAAATCATAC CAATTGATGT CGTAAATGAA GCAATATTAC CTTCCTTATC 1500 
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ATCAGACACA ACACCATGCT CTATATCAAT ATTTGCTTTA TTGCTATCAA TGAGCGTACT 1620 

GCGTGCTTTT AAATAATCAT CATCAATTAA TGACTGTACA GGCACCTCAT GAAAATTATC 1680 

ATCCGCCAAG TATTGCGCAC GATCACTATA TGCTAAATGC ATCGCTTGTA TCAAATGATG 1740 

CAAGTAATCA ACAGATCTTG GACCCATAGA TGGTAAATCG ACATGTTCTA ATAACTTCAA 1800 

TATTTGAATT ACCGTGATAC CGCCAGAACT AGATGGTCCC ATTGaATAAA TGTCATAGTC I860 

TTTAAATGTT GCACTGATTG GCGCTTTAAT CTGAATGTCA TATTTGGCTA GATCCTCTAA 192 0 

AGTGATTGTC CCACCACATG CTTTGACAAC ATTGACTAAT TGTTTCGCAA TGTCACCTTT 1980 

ATAAAATGCA TTAAACCCTT GTTCTCTTAA TATTTGAAAT GTCTTACCTA ATTCGGGTTG 2 04 0 

TACAATCCAA TCACCTTCAC GCCAATATTG ATTTTCATGC GTAAATACTT GTGCCGTTTC 2100 

ATGATACTTT GTCAATCGTG CGTGTTGCTG GCGCGAATAT TTTTGAGTAG CCCAATTGGC 216 0 

20 TGCATGACCT TCAATGGCTA GTTCAATTGC AGGATTAATT AAATCTTCCA ATGACAATTT 2220 

AG CAT AACG C TTGTGAATAT AATCAAACAG CTTTGGAATT GCTGGCACAG CGACAGTTTT 2280 

ACCATGTGTA GTCATATCAA AAAATGATTT ATATTCGCCT GAATCATCTA GATAAAATTG 234 0 

TTTGTCTACA TGTTCAGGTG CTGTCTCACG TGCATCAAAC G CAGTT AT AC TGCCAGTACT 24 00 

TTGCTCATAA TATAGCAAAT ACCCGCCACC ACCAATACCT GATGCAAATG GTTCTACCAC 24 60 

ATTCAATGCC AGTTGAATTG CAATCACTGC ATCCATGGCG TTGCCACCTT GATCTAATAC 2 52 0 

ATC CTTACCA ATTTTAGCCG CAAGAGGATG TGATACGGAA ATTAACCCTT CTTTAGATGT 2 580 

TTTTGTCTGT TTGTCATTTA AGTTAATGAC CATACTATAT CCTCCTACTT TCTGTTAAAT 264 0 

ATTTAAAACA TTATTGATTA ATGGCTTTTT CTACTTTTTC TAAATCTTGA CGTTGCTCGT 270 0 

TACCAGTATC GACAAGTGGT GTAATCGGTG ATGCAATTTT AAATTTATCG CCACGATAAA 2760 

ACTTAATAAA TTGATCCTGA TCTATCGCAT TAACTACTGC TTGTCTCAAG TTTGGATGCG 282 0 

40 TCTTAAATAT ACCTTTTTTA ATATTTAGCA TTAAAAAGAC TG ACTTGCG T CCATTTTTGC 28 80 

GAATAATGCT TAAATTTTTA TCCGACTTAA TTAAATCAAA ATGTTTTTGA TTCACATCTG 2940 

CCAACATATC AATTGAATGA TTTCTAAGTT CTGACAATGC ATTATTCGGG TCACCATTAA 3000 

ACTTCAATGT AATATTTTTA ATTTTAGCTG GTCCATAACT ACCTTTTTCT GTTTCGTTGA 3060 

ATCCTGGATT ACGTTGAAAC GTTGCTTGAT ATGCATTTTT CTGTGTCATA ATGTATGCGC 3120 

CACTTGCATA CAGCGCATTT TTCCCATCTG AATTTGCAGG AATTGTACTG CTATCCCCAT 3180 

ATCCTTTTGG ATATTCTTGA TTTACTTGAT TAACAAATTT TTTAGATAAA ATGCCTGCCG 324 0 

AAGAGTGTGT TAAGTAATTT ACCTCTCGAG GCATCGATTG ATCTGTCGTA ATTTTAACAA 3300 
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TATAAGCTTT AATCAACTTA TCATAGATTG ATTTATCGTC CTTGTCTTTC TCTTTACGCA 3420 

ACTGATCGAT GTCCTCATCT TTTAATATCT TGATGTCATT TATATGTTTG TGCATATTGT 34 8 0 

AAGTATTATT GTTAGGCACA GACTTTTTAT CACGTGCTCT ATCTAAAGAA AACTTAACAT 3540 

CTTCAGCCGA TACACGCTCT CCAGTATTAC GTGCTTGTCC ATTGACCACT TTCGCAAAAT 3 600 

AATCATCATC TCTTAACAAG AAATAAAATG CTTTATTGTC CTTATTCACA GCATAATCAT 3660 

GACTTAACGA ACCTTTCGTT GTTAAATGAT CATTTTCATC TAATAATAAT AACCTTGTGT 3720 

ACATATTCAT ATTAATTGAA TATACTGACG GCGCAATTGA ACGTATTGGA TCCAATGTAG 3780 

GAATTTCACC ATCTTGTTGT GTCATCACAA GTGGCCGCGT ATCTCGTTCT CTACTATTGT 3840 

TGTAATCAAA TTGTTGCCAT ATTAATG CAC GTGAATTTGG CAATCCAACA CTATTTTTAT 3900 

CTAACACTTT ATTGTCATAT ACTAAATTCT TTTTTGATCC ATATAAAGGC GCCATATACC 3 960 

20 CTTTATCAAA TACAACTTCA TCTTCAATTT GCTTATATGT TTGTTTAACA TCTGCTTCAT 4 020 

TTTGAGTAGA AGCTTTATTT AACAACTGGT CTACATGTTT ATCTTTCAAT AAACTATTTG 4 080 

ATCCTGTAGA ACTAAATAAT GCCGTCATAG CATAGTTCGG GTCACCAAAC ACTGTCATCC 414 0 

25 AGTCATCAAT TTGGATATCA TAATTGCCGG CTTGACGTTG TGTACGATAG CTACCATAAT 42 00 

CTGGTTGGAT ATTCATCTTC ACGTTAAATC CTGCATTTTC CAATTGATCT TTAACGATAT 4 260 

TCATATCATT TT CAT AACTT GCTTGTCCTA GGAAATGTAT TGTTGGTCGC TCGCCTTTCA 4 3 20 

CTTCAACTTT CGATGACTTT TGAGCCACTT CTGATTTCGT AGGGACACCA CAACCACTTA 4 3 80 

ATACCAACGC TAAAACTATA ATTG CGATAC TAATGATTTT CTTCACATCT ATCCCTACCT 444 0 

TTTTAATGAA TTCTTGGATC TAGTGCATCA CGCACTGCAT CACCTATAAA ATTAAATGCT 4 500 

AAAACGACGA ACATAATACA AACACCAGGT ACAATAGCTA AATTACTGTG CGTTTCCAAG 4 560 

TAGCTACTAC "CGGTACGTAA AATGTTGCCC CATTCAGCTA CATCAGGTGC AACACCAAGT 4 620 

40 CCTAGGAAAC TTAAACTACT TGTTGTTAAT ACAACCACAC CTATATTTAA TGAAAAACGT 4 680 

ACAATCATAG GCGCAATCGC ATTCGGTAAA ATATAACGCC ATATGATATT CCAAGTGTTT 474 0 

TCACCAGTGA TACGTGCTGC ATCTACATAT TCCATGCGTT TAATTTCTAA AACACTGGCA 4 800 

CGCATTGTCC GTGCAAATGA TGG TAT ATT A CCGATACTTA AAGCAATAAT TAAATTTGGA 4 860 

ATACTTGCTC CAAATGATGC AATAATTGCC ACCGCTAACA ATAATGATGG AATTGCAAAC 4920 

ACTACATCTA AAATTCGCAT TATTAAATTA TCAATATGAT TAAAATAACC TGCGATAGTG 4 980 

CCTAGTAACA CACCAAAAAT AACTGCAATA ACTACTGAAA TAATTGAAAT TGAAAATGTC 5040 

AGCTTCGTTC CTACAACTAC GCGTGTAAAT AAGTCTCTAC CGAAATCATC AGTAC CAAAC 5100 
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GTATCAAATG 


TAAATTGTGA 


CACAATTGAT 


AATGTCAGCA 


TGTAGACTAA 


AATAAGTAAC 


5220 




CCGATAATCG 


CAATACGATG 


TCTAGTAGTT 


TTTCGTATAA 


ACGATTCCCA 


CCCGTTATAA 


5280 


5 


CTATGTATTT 


GCGATGTACG 


TTGGTAACGT 


CTAATACTTA 


CAAACATTAA 


TAATGTAAAT 


5340 




ACGTTGCCTG 


TTAATG T CAT 


CAACAATAAC 


AACACTTCGA 


CGATACGTCG 


CCATAGGTCA 


5400 


10 


TGATGCTTCC 


ATGTTTGTTC 


CGTTGTTAAA 


ATAATAATTA 


AAATGATGGT 


TAAAACGATT 


5460 


AGCAATGTTT 


CAGCAATATA 


GAACGTATCG 


GCCACATAAC 


CTTTAAAAAG 


ATTTAATGCA 


5520 




CTCGTTAATA 


TAACTAAAAT 


AT AAGTTG CT 


ATGG CGTAAC 


TTGCGAATAA 


TTTTAAGGAA 


5580 


IS 


GCTATCTTTG 


AATTAAGTTG 


TGCCATATGC 


CTCACTTCCT 


TTCGTTGATT 


TCACTACGTA 


5640 




ATTTTGGATC 


GATTAAAGCA 


TAAAATATAT 


CAATAATTAA 


GTTTGCTAAA 


GATATTACAA 


5700 




TTGATATATA 


TACGACCCCA 


CCCATGACTG 


CTGGAATATC 


AGGTATTAGT 


TGTTTTTGGA 


5760 


20 


CGATATAACG 


CCCGATACCA 


TTAATGTTAA 


ATACTTGTTC 


CGTCACTGCT 


GAACCGC CTA 


5820 




GTAACTCTGC 


CACTAGAAGA 


CCAACTAACG 


TTACAATTGG 


AATAATGGCA 


TTTTTCAAAA 


5880 




TATGTTTAAT 


AACAACTTGT 


GTCGTCGATA 


ATCCTTTTGC 


ATAAGCAGTT 


AAAACATAAT 


5940 


25 


CGctGCGCAT 


TACTTCAAGT 


ACAGAAGACC 


TTGTCATACG 


CGTGATAGAA 


GCAGCAATAC 


6000 




TTGTTCCAAT 


GACAAGTACA 


GGTAAAATCA 


ACGATATTGG 


ATGTTCTGGC 


ATATAAGATG 


6060 


30 


GTGGGAAAAT 


ATCCAATTTC 


AATGAGAACG 


CTAAAATGAA 


TAATAGCCCT 


TGCCAGAAAC 


6120 


TTGGAATAGA 


TAAACCAATT 


AATG CAATTA 


TCATTAACGT 


GATATCAAGC 


CAACTATTTC 


6180 




GCTTCATCGC 


ACTGATAATA 


CCAATTGGTA 


TTGCAATAAT 


TAATGCCACC 


ATTAGCG CTA 


6240 


35 


ATACTGCGAC 


AATTATTGTA 


ATTGGAATTC 


TTTCGCCAAC 


TGCTTTAGTC 


ACAACCTCAT 


6300 




TCCCTTTGTA AGTCGTACCT 


AAGTCAAAGG 


TAAAAACACC 


CTTGATGGTA 


TCCCACAATT 


6360 




GAATSAAATA 


AGGTTCGTTA 


AGATGATGTA 


ATACATTGAA 


TTGATGTATC 


TGTGCCTTTG 


6420 


40 


TTGCATTTTG 


TCCCAGTATG 


CTATAAGCCG 


CATCAAGCGG 


TGAAAAATAC 


AGAATGGTAA 


6480 




ACACACTGAC 


AATAACACCA 


ATGATGACAA 


TCACAGCCAT 


GACAATTCGT 


TCAAAAATAT 


6540 




ATCTAACTAA 


TGGCTGTAAA 


TAAAAAGTCA 


ATAAGATGAA 


CAT CGGCAAG 


GCCAATATCA 


6600 


45 


CTTTGATCAT 


GATGAACTTA 


TGAAATAATA 


CATTTTCAAA 


GTATGTTGAA 


AAATGTGCTT 


6660 




GTTCAATATT 


CTTTGAACTC 


GTATTAGAAC 


TTTGTGCCTT 


GAATATTTTT 


AATGCTTCTT 


6720 


SO 


TATGTATTTG 


TGTGGATGAC 


TTTTGCTGCG 


ATAAATATTT 


ATATTTTTGA 


TGTAACGCCT 


6780 


GTTCAATTTC 


TGAAATTTCA 


GAATTATTAG 


CGTAAAAATT 


TTTCCTCTTA 


GCAGAAAAGA 


6840 




AAAACTTTAT 


CACTGCATAT 


AAAAATATTG 


GCAAGCTTAA 


TACCGATAAT 


ACAAACTTGT 


6900 
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CTTGTAAAAT AATCTTGAGT AGATTACTAT GATATACAAA AGTATAGAAT AAATTTACAC 7020 

ATTTGTGaAT AGGGAGGCAC AACATCATGT CAAATTTATT AGAAGTCAAC AGTCTGAATG 7080 

TACAATTCAA TTATGATGAA ACTACAGTTC AAGCGGTAAA AAACGTCTCT TTCGAATTAC 714 0 

GAAAAAAACA TAT CCTAGGT ATTGTTGGTG AATCAGGATC AGGAAAAAGT ATTACCGCTA 7200 

AATCTATTTT AGGGCTACTA C CAG ATT AT C CAGATCACAC ATTAACAGGA GAAATTATTT 726 0 

TTAATGGGCA ATCGTTAAAT AATTTATCAA CTTCAG CGTT ACAACAAATT CGAGGTAAGG 73 2 0 

ATATTTCAAT GATTTTTCAA GATCCACTCT CTTCGTTGAA TCCAAGATTA ACGATTGGCA 73 80 

AACAAATTAC AGAAGTAATA TTTCAACATA AACGTGTATC TAAATCTGAA GCAAAGTCGA 744 0 

TGACAATAGA CATTTTAGAA AAAGTAGGTA TAAAACATGC AACTCGACAA TTTGATGCTT 7500 

ATCCACATGA ACTTTCTGGT GGTATGCGTC AACGTGTCAT GATAGCAATG GCATTGATTT 756 0 

20 TAAAGCCACA AATTTTAATC GCAGATGAaC CAACAACGGC ATTAGATGCC AGTACACAAA 7620 

ATCAATTACT GCAGTTAATG AAGTCCCTTT ATGAGTACAC AGAAACATCT ATTATTTTTA 7 680 

TCACTCACGA TTTAGGCGCT GTGTATCAAT TTTG CGACGA TGTGATTGTA ATGAAAGATG 7740 

GAAGTGTCGT TGAAAGTGGC ACGGTTGAAA GTATTTTTAA ATCGCCACAA CATACCTATA 7800 

CAAAACGCTT AATAGATGCG ATTCCTGATA TTCATCAAAC GCGTCCGCCA AGACCGTTAA 7860 

ACAATGATAT TTTATTAAAA TTCGATCGCG TGAGyGgGAT TACACATCAC CGAGTGGCAG 7 92 0 

C CT AT AC CG A GCAGTTAATG ATATTAACTT GG CT ATT AG A AAAGGCGAAA CATTAGGCAT 79 8 0 

TGTCGGTGAA TCAGGGTCAG GGAAATCGAC ATT AG CT AAG ACGGTCGTCG GTCTAAAGGA 804 0 

AGTGTCAGAA GGCTTTATTT GGTATAACGA ATTACCATTA AGTTTATTTA AAGATGATGA 8100 

ATTGAAATCT TTACGACAAG AGATACAAAT GATTTTTCAA GATCCATTCG CATCTATTAA 8160 

TCCAAGATTT AAAGTCATTG ATGTGATTAA ACGAC CACTA ATCATTCATG GGAAAGTCAA 8220 

40 AGATAATGAT GACATTATTA AAACTGTCGT ATCGTTGTTA GAAAAGGTTG GCCTAGATCA 8280 

AACTTTCTTA TATCGCTATC CACACGAATT ATCTGGTGGG CAACGTCAGC GTGTAAGTAT 834 0 

CGCGAGAGCA CTTGCTGTTG AACCTAAAGT GATTGTTTGC GACGAGGCAG TGTCCGCTTT 8400 

AGACGTTTCA ATTCAAAAAG ATATCAT CGA GTTATTAAAA CAATTACAGT TAGACTTCGG 8460 

CATCACTTAT TTATTCATCA CACATGACAT GGGTGTTATC AATGAAATAT GTGATCGCGT 8520 

TGCAGTTATG AAAAATGGCG AAATCGTTGA ACTGAATAAC ACAGAAGATA TTATCAAACA 8580 

TCCGCAGTCA GACTATGCAA AGCAACTTAT TTCAGAAGTA GCAGTTATTG CTAAATAAAA 864 0 

GTCATGCGTT GTGCAACTTT AT CACTGT AT GGTCTGAAAT AAATTGCGCG ACTTCTGATG 8700 
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TAT CAAGTTT TAGGTGCTTT GCCATGATTT AAGAGTCACC CCCATACTTT GGGCATTTTA 8820 

ACGCCAGAAT AAATCCCCCG CCACTATGTG AAGTGTGGGG GATTATTTAT ATTTTATTAG 8880 

AATATTCAGA TTTTTGAGTG TGTCAACTTA GCTTAGTCAA TGTATATTTA ACGTCACTTA 894 0 

CTCTTTTTCT TTCATAATTA ACACATTCAA ATAAACTTTG ATCAAAAAAC ACAAAGTTAA 9000 

AAGTACCATC TTGTAATATG CTCTCATACA TTATCCCGTC ATATTTAAGG CTTCGAATAT 9060 

AATCAG CTAA ATATTGAAAT GGCAAATAAT CTATTCCTTG TTCATCGCTT GGATTTGTTA 9120 

TTCCTTTATG AATCTTTTTT AATGTTTGGT AATTTACAAA ATACTTTCTA AATCCATCAT 9180 

CGCCAGCTTT GATTGCATTA CTAGTTAAAT TAGTTAAATT CGCAATTTTC AATTTCTCTT 9240 

TTGTCACGTT TTTTTGTAAC TTAACCTTAC CTATATAAAT AATGTCATTA TGCTTAGGTT 9300 

TAACTTCTTC TATACTGACC TGTTCTTTTG TACTAAGGTA TAATACGCTT ATCCATTTAG 93 6 0 

AATTCAATCT TCCTGCCGTT GCAAATCCCT TTGGTGGTGA CATTAGTTCA CTTTTCTCTG 94 20 

TAATGAACTT AACTATTCTA GAT CT AT AT A ATGGTTCAAA TCTTTCTCTA AATTCCTCAA 94 8 0 

TACTATAGTA ATTAGTAGTG ATATCGAGAA AG AACG CTAA ATTCT CTAAA TTGATCATAT 954 0 

TTTTATGAAA TCTATTTTTA TACTTCAAGC TCTCACAAAA TCCATCCCAG TCATTATTTG 9600 

CTACAATTAG ATTTTTATTT GTATATTTTT TATCGTTTAT GATTTT AG CG CCTACTAAAT 9660 

CTTCCAACAC TCGTCTATCT AAATTTTCAT CATCTTTAAA AAGTTCATTT AAAATACAAC 9720 

TTATTTGAGC TTCCTCAACA TTAAATATAC TCCAGTCGTC TTTTAATGCT ATTTCAATCT 9780 

TTTTACCTTC TTTTGGGCTA AAAGTATCTG GTAAATTTAT ACTAATATCA TATAATTCTA 984 0 

ATGCTGGTCT TAAATAATCT CTAATAAGTT CTAATTTATC TATGTCCTTA GTCGTATCAA 9900 

ATATTTTAAC ACCAAGATGA TTGTTATCAA TATCACAATT GTCAAATTTG CT ATTT AT CA 9960 

TTTGCAATGA TTTCTACGAT TTCAGTATTA TTAAAACATT TTTCACATAT TTTCATTTTG 10020 

AG ACTC CAAG TATCTATTCA TAATTTCTAG GTGATGCATG ATAGATAACC TTTTAATTAA 10080 

ACCTAATCCT GGATa CTTAT TATTTTCATT TAATTCTTCA AATTGTCCCA AGCGCATAAG 10140 

ATCTATTTTT AATATCTAAG TTTTTTGACC ATGTTACTAA TT 10182 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3491 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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AACTCAGGCA ATTGAAACAG CATTAGGTGC TTCATTACAA CATGTCATTG TAGATTCAGA 
AAAAGATGGA CGCCAGGCTA TTCAATTTTT AAAAGAACGT AATTTAGGTC GTGCGACGTT 
TTTACCATTA AATGTTATAC AGAGTAGAGT GGTAGCGACT GATATTAAAT CTATTGCTAA 
AGAGGCAAAC GGATTTATTA GTATCG CTTC GGAAGCAGTT AAAGTAGCAC CAGAATATCA 
AAAT ATT AT C GGGAATTTAT TAGGTAATAC GATTATCGTT GATCATTTAA AGCATGCAAA 
TGAATTGGCA CGTGCGATTA AATATCGAAC TCGTATTGTT ACTTTGGAAG GTGATATTGT 
AAATCCTGGT GG t TCTATGA CTGGTGGTGG CGCTCGTAAG TCAAAAAGTA TTCTGTCTCA 
AAAAGACGAG TTGACAACAA TGAGACACCA ATTAGAAGAT TACTTGCGTC AAACAGAATC 
ATTTGAACAA CAATTTAAAG AGTTGAAGAT AAAAAGTGAT CAATTAAGTG AACTGTATTT 
TGAAAAAAGT CAAAAGCATA ATACACTTAA AGAGCAAGTG CATCATTTTG AAATGGAGCT 
CGATAGATTA ACTACACAAG AAACACAAAT AAAAAATGAT CATGaAGAAT TCGAATTTGA 
AAAAAATGAT GGTTATACGA GTGACAAAAG TCGACAAACT TTGAGTGAAA AAGAAACTTA 
TCTAGAAAGT ATT AAAG CAT CTTTAAAACG ACTAGAAGAT GAAATTGAAC GCTACACAAA 
ACTTTCTAAA GAAGGTAAGG AAAGCGTTAC TAAAACACAA CAAACCTTAC ATCAGAAACA 
ATCTGATCTT GCTGTGGTTA AAGAGCGTAT TAAAACACAA CAACAGACAA TAGATCGATT 
AAATAATCAA AATCAACAAA CTAAACATCA ATTAAAAGAT GTTAAAGAAA AAATTGCATT 
CTTTAATTCG GATGAAGTGA TGGGCGAACA AGCTTTTCAA AATATTAAAG ATCAAATTAA 
TGGTCAACAA GAAACGAGAA CACGCTTATC AGATGAATTA GATAAATTGA AACAACAACG 
TATTGAGTTG AATGAACAAA TCGATGCGCA AG AAG CT AAA CTACAAGTTT GTCACCAAGA 
TATTTTAGCT ATCGAAAATC ACTACCAAGA TATT AAAG CT GAACAATCAA AGCTAGATGT 
ATT/tfVTTCAT CATGCGATAG ATCATTaAAT G ATG r ATATC AATTGACTGT TGAACGTGCG 
ArATCTGAAT ATACGaGTGA TGrATCGATg ACGCATTACG TAAAAAAGTT AAGTTAATG r 
AGaTGyCGAT TGATGrACTA GGTCCTGTAA , ACTTAAATGC AATTGAACAA TTTGAAGAGT 
TAAATGAACG TTATACATTT TTAAGTGAAC AACGTACAGA TCTTCGTAAA GCTAAAGAAA 
CATT AG AG CA AATTATAAGT GAAATGGATC AAGAGGTTAC TGAAAGATTT AAAGAAACTT 
TCCATG CTAT TCAAGGACAT TTTACAG CTG TGTTCAAACA ATTGTTTGGT GGAGGCGATG 
CAGAATTG CA ATTAACTGAA GCCGATTATT TAACAGCTGG TATTGATATT GTGGtACAAC 
CACCGGGTAA AAAGTTGCAA CATTTATCGT TACTGAGTGG TGGTGAGCGT GCATTAACTG 
CTATTGCTTT ACT ATTTG CA ATTTTAAAAG TAAGATCTGC ACCTTTTGTT AT ATT AG rTG 
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TATCAGACGA AACACAATTC ATTGTTATTA CACACCGTAA AGGAACAATG GAATTTGCAG I860 

ATAGGTTATA CGGTGTAACA ATGCAAGAAT CAGGTGTTAC TAAACTTGTG AGTGTGAATT 1920 

TAAATACAAT AGATGATGTG TTGAAGGAGG AGCAATAATG AGCTTTTTTA AACGCTTAAA 1980 

AGATAAGTTT GCAACAAATA AAGAAAATGA AGAAGTTAAA TCCTTAACAG AAGAACAAGG 204 0 

TCAAGACAAA TTAGAAGATA CACATTCTGA AGGTTCAACG CAGGACGCAA ATGATTTAGC 2100 

AGAAAATGCT GAAGTGAAAA AGAAGCCACG CAAGTTGAGT G AAG CGGATT TTGATGACGA 2160 

TGG CTTAAT A TCAATTGAAG ATTTTGAAGA AATTGAAGCT CAAAAAATGG GTGCTAAATT 2220 

1S TAAAGCAGGA CTCGAAAAAT CTCGTCAAAA TTTCCAAGAA CAATTAAATA ATTTGATAGC 2280 

GAGATATCGT AAAGTAGATG AAGACTTTTT TGAAGCTTTA GAAGAAATGT TAATCACTGC 234 0 

AGACGTCGGT TTTAATACAG TGATGACGTT AACTGAAGAA TTACGTATGG AAGCACAACG 24 00 

20 ACGTAATATT CAAGATACTG AAGATTTGCG TGAAGTCATT GTTGAAAAGA TCGTAGAGAT 24 60 

TT AC CAT CAA GAAGATkATA ATTCAGAAGC TATGAACTTA GAAGATGGTC GTTTAAATGT 252 0 

CATTTTAATG GTTGGTGTGA ATGGTGTTGG TAAAACAACA ACAATTGGAA AATTAGCTTA 2580 

CCGATATAAA ATGGAAGGTA AAAAAGTAAT GTTAGCTGCG GGCGATACTT TTAGAGCGGG 2640 

TGCTATTGAT CAATTGAAAG TTTGGGGCGA ACGTGTTGGT GTAGACGTAA TTAGCCAAAG 27 00 

TGAAGGTTCT GATCCAGCTG CTGTTATGTA TGATGCgATT AATGCCGCTA AAAACAAAGG 2760 

TGTTGATATT TTAATCTGTG ATACCGCTGG ACGTTTACAA AATAAmACAA ATCTAATGCm 28 20 

AGAATTAGAA AAAGTT AAG C GTGTAATTAA TCGAGCAGTG CCAGATGCGC CTCATGAAGC 28 80 

ATTACTATGT TTAGATGCTA CAACTGGTCA GAATGCGTTG TCACAAGCTA GAAACTTTAA 2940 

AGAAGTAACA AATGTTACAG GTATTGTATT AACGAAATTA GATGGTACAG CCAAAGGTGG 3000 

TATCGTATTA GCCATTCGTA ATGAATTGCA CATCCCAGTT AAATATGTAG GTTTAGGTGA 3060 

40 GCAATTAGAT GACTTACAAC CATTTAACCC TGAAAGTTAT GTCTACGGCT TATTCGCTGA 3120 

TATGATTGAA CAAAATGAAG AAATAACAAC AGTTGAAAAT GATCAAATTG TAACAGAAGA 3180 

AAAGGACGAT AATCATGGGT CAAAATGATT TAGTtAAAAC GTTACGAATG AATTATTTGT 324 0 

TTGATTTTaT CAATCCTTAT TGACGAATAA ACAACGTaAT TATTTGGAAT TATTTTATCT 3300 

TGAAGATTAT TCTTTAAGTG AAATCGCAGa TACTTTTAAT GTGAGTAGaC AAG CAGTTT A 3360 

TGATAATATA AGAAGAACTG GCGATTTAGT TGAAGATTAT GAAAAGAAAT TGGAATTATA 34 20 

CCAGAAATTT GAGCAACGCC GAGAAATATA TGATGAAATG AAACCACATT TAAGTAATCC 34 80 

AGAACAAATA C 3491 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

AGTACGTTTT ATAATTATAA GTACGTAATT AACATATTAA CATATCG CAA GTATGTATTT 60 

AAATAAgATT GTTATAATTT CAAAGTTCAT C CAAG a TT AT GGCGTTTGCA TTTACCTATT 120 

AAAAACGTTA TTATATCAAA GATGCGAAAG ATAATACGGG TTTATTTTAT GAAAGTGAGA 180 

AGGATAAAAT GGATAATGAG CAACGCTTAA AAAGAAGAGA GAATATAAGG AATTTCTCGA 24 0 

TT AT AG CACA TATTGACCAC GGAAAATCTA CATTGGCTGA TAGAATTTTA GAAAATACCA 3 00 

20 AATCAGTTGA AACAAGAGAT ATGCAAGATC AGTTACTAGA TTCAATGGAT TTAGAAAGAG 3 60 

AACGTGGTAT TACAATCAAA TTAAACGCgT ACGTTTAAAG TACGAAGCTA AAGATGGAAA 420 

TACTTATACA TTCCATTTAA TCGATACGCC TGGACACGTC GATTTTACAT ATGAAGTGTC 4 80 

ACGTTcTTTG GCAGCTTGTG AGGGCGCGAT TTTAGTAGTA GATGCGGCTC AAGGTATCGA 54 0 

AGCACAAACA TTAGCAAATG TTTATTTAGC ATTAGATAAT GAGTTAGAGT TATTGCCTGT 600 

TATTAACAAA ATTGATTTAC CTGCTGCAGA ACCTGAACGC GTGAAACAAG AAATTGAAGA 660 

TATGATAGGT TTAG AC CAAG ACGATGTTGT TTT AG CAAGT GCTAAATCTA ACATTGGAAT 720 

TGAAGAGATA CTAGAGAAAA TAGTTGAAGT TGTGCCAGCT CCAGATGGTG ACCCAGAAGC 780 

ACCACTAAAA G CGTTAAT AT TTGATTCTGA GTATGATCCA TATAGAGGGG TAATTTCATC 840 

GATAAGAATT GTGGACGGTG TTGTTAAAGC CGGAGATAAA ATTCGAATGA TGGCCACTGG 900 

TAAADAGTTC GAAGTAACAG AAGTTGGAAT TAATACACCT AAGCAGCTTC CAGTTGATGA 960 

40 ATTAACAGTT GGTGATGTTG GTTATATTAT TGCAAGTATT AAAAATGTTG ATGATTCTAG 1020 

GGTTGGTGAC ACCATCACAT TAGCTAGTAG ACCTGCATCA GAACCATTGC AAGGTTATAA 1080 

GAAAATGAAT CCAATGGTAT ATTGCGGACT GTTCCCAATA GATAACAAAA ATTATAATGA 1140 

TTTAAGAGAA G CATT AG AAA AATTACAATT GAATGATGCA TCATTAGAAT TTGAGCCTGA 1200 

ATCGTCACAA GCATTAGGTT TTGGTTATAG AACTGGTTTC TTAGGTATGT TACACATGGA 1260 

AATAATTCAA GAAAGAATTG AAAGAGAATT TGGTATTGAA TTAATTGCAA CTGCACCATC 1320 

TGTAATTTAT CAATGTGTTT TAAGGGACGG TTCAGAAGTG ACGGTTGATA ACCCAGCACA 13 80 

AATGCCAGAT CGTGATAAAA TTGATAAAAT ATTTG AG C CA TATGTTCGTG CAaCTATGAT 1440 
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TATAAATATG GACTATTTAG ATGATATTCG TGTAAATATT GTTTATGAAT TACCTTTAGC 1560 

TGAAGTTGTA TTTGATTTCT TCGATCAACT TAAATCTAAT ACTAAAGGAT ATGCATCATT 1620 

TGATTATGAA TTCATCGAAA ATAAAGAAAG TAATTTAGTC AAGATGGATA TTTTATTAAA 1680 

TGGTGATAAA GTGGATGCGC TAAGCTTCAT AGTTCATAGA G ATTTTG CAT ATGAACGTGG 174 0 

TAAAGCATTA GTTGAAAAAC TTAAAACGTT AATTCCAAGA CAGCAATTTG AAGTACCTGT 180 0 

ACAGGCTGCA ATAGGACAAA AAATTG TAG C GCGTACAAAT ATTAAATCAA TGGGTAAAAA I86 0 

CGTTTTAGCT AAATGTTATG GCGGTGACAT AAGCCGTAAA CGTAAATTAC TTGAAAAACA 192 0 

75 AAAAGCAGGT AAAGCTAAGA TGAAAGCAGT TGGTAATGTT GAAATTCCAC AAGATGCTTT 1980 

CTTGGCTGTA TTGAAAATGG ATGATGAATA ATTTTAAAAA ATCAATTAAC AATTTACAAT 204 0 

GAATAAAGTT TAATAACTAA AAAGAGGGAG CCTAGGATAA ATTAACGTCC TGGGCTTTAC 2100 

20 AATGTTATAT TGGCAGCCAT CGACAGAGTT AAAATGAGCT TATAACAATG GGGCCCCAAC 2160 

ACAGAAG CTG ACGAAAAGTC AGCTTACTAT AATGTGCAAG TTGGGGTGGG GCCCCAACAT 2 22 0 

AGAGAATTTC GAAAAGAAAT TCTACAGGCA ATGCAAGTTG GGGTGGGACG ACGAAATAAA 2280 

TTTTGCGAAA ATATCATTTC TGTCCCACTC CCTTATGCAT GAGTTTTACT CATGTAATTT 234 0 

TATTTTTAAG GACATATTAC ATCTGGCTAA TGTGTAAGAG CCACTACATA ATAAATCATT 24 00 

AGTGGTTCTT TATTATTTCT ATCTCACTCC CTCTAAACAA GAATAAATAT TAAAATGAAT 24 60 

CGATATATTA GACAATCATT GATTAAACGT TAAAGTTAAA AGTAAGAATA ATTG CAGATA 2520 

GTCCAACAGG ATATAG CCG A TTGGATAAAA AGTCTGAGAA GCGGGGCATT AAAATGACGG 2 58 0 

35 TACAAAGTGC ATATATACAT ATTCCATTTT GTGTAAGAAT ATGTACATAT TGTGATTTCA 2640 

ATAAATATTT TATACAGAAT CAACCTGTAG ATGAGTACTT AGATGCACTA ATCACAGAAA 2700 

TGTCTACAGC AAAATATAGG ATCTTAAAGA CCATGTATGT AGGTGGCGGC ACACCAACGG 2760 

40 CCCTTTCTAT TAATCaGTTG GAAAGATTAC TTAAAGCAAT ACGTGATACG TTTACAATCA 2820 

CAGGCGAGTA TACATTTGAA GCAAATCCTG ATGAGTTAAC TAAAGAGAAA GTCCAACTAT 28 80 

TAGAGAAATA TGGAGTAAAA AGGATTTCAA TGGGCGTTCA AACATTCAAG CCGGAGTTAT 2940 

TGTCTGTTTT AGGTAGAACG CACAATACTG AAGATATTTA CACTTCGGTG TTAAATGCTA 30 00 

AAAACGCAGG TATTAAATCA ATCAGTTTAG ATTTAATGTA TCATTTACCG AAACAGACGA 3060 
TTGAAGATTT TGAACAAAGT TTAGATCTAG CTTTAGATAT GGATATTCAA CATATTT CG A 3120 
GTTACGGCTT AATACTTGAA CCTAAAACCC AATTTTATAA TATGTATAGA AAAGGCTTGC 3180 
TCAAACTTGC TAATGAGGAT TTAGGTGCTG ACATGTATCA GTTGCTGATG TCTAAGATAG 3240 
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AACATAATAA GGTTTACTGG TTTAATGAGG AATATTATGG ATTTGGAGCA GGTGCAAGTG 
GTTATGTAGA TGGTGTGCGT TATACGAATA TCAATCCAGT GAATCATTAT ATCAAAGCTA 
TAAATAAAGA AAGTAAAGCA ATTTTAGTAT CAAATAAACC TTCTTTGACT GAGAGAATGG 
AAGAAGAAAT GTTTCTTGGG TTGCGTTTAA ATGAAGGTGT GAGTAGTAGT AGGTTCAAAA 
AGAAGTTTGA CCAATCTATT GAAAGTGTCT TTGGTCAAAC AATAAATAAT TTAAAAGAGA 
AGGAATTAAT TGTAGAAAAG AACGATGTGA TTGCACTTAC AAATAGAGGG AAAGTCATAG 
GTAATGAGGT TTTTGAAGCT TTCCTAATAA ATGATTAAAA AAAATTGAAA TTTCGAGTCT 
TTAACATTGA CTTACTTTGA CCAATTTGAT AAATTATAAT TAGCACTTGA GATAAGTGAG 
TGCTAATGAG GTGAAAACAT GATTACAGAT AGGCAATTGA GTATATTAAA CGCAATTGTT 
GAGGATTATG TTGATTTTGG ACAACCCGTT GGTTCTAAAA CACTAATTGA GCGACATAAC 
TTGAATGTTA GTCCTGCTAC AATTAGAAAT GAGATGAAAC AGCTTGAAGA TTTAAACTAT 
ATCGAGAAGA CACATAGTTC TTCAGGGCGT TCGCCATCAC AATTAGGTTT TAGGTATTAT 
GTCAATCGTT TACTTGAACA AACATCTCAT CAAAAAACAA ATAAATTAAG ACGATTAAAT 
CAATTGTTAG TTGAGAATCA ATATGATGTA TCATCAGCAT TGACATATTT TGCAGATGAA 
TTATCAAATA TATCTCAATA TACAACTTTA GTTGTTCATC CTAATCATAA ACAAGATATT 
ATCAATAATG TACACTTGAT TCGTG CTAAT CCTAATTTAG TTATAATGGT TAT 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TCCCTAATCG AACAAAATTA TGCGCATAAA CAAAGTAGAT TGATATAAAA TTCTTAATTA 
TCAGAATATA TTTACAAATC TGAATTTTAT TAGTATATTG GrTAGTrTTC ATAGAGGCAT 
GACGGTa TTT GAGCAGGATT TTAAATCGGg ATTTTATAAT CGATTTAAGA GAGGCCACtT 
TGCTTGcACA TTAATACTGT cAATGGGAGG GGAATGTATA TGAGTrAAGC ACATCAATTA 
ATTCAAGAGG ATGAACATTA TTTTGCGAAA TCAGGACGTA TTAAATATTA TCCGTTAGTG 
ATTGATCATG GATATGGAGC AACATTGGTT GATATTGAGG GGAAGACATA TATCGATTTG 
TTATCGAGTG CGAGTTCTCA AAACGTAGGT CATGCAC CTA GAGAAGTAAC AGAAGCGATA 
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GTACGTTTAG CTAAGAAGCT TTGTGAGATT GCACCTGGAG ATTTTGAAAA AAGAGTGACC 540 

TTCGGATTAA CCGGATCAGA CGCAAATGAT GGCATCATTA AATTTGCCAG AGCATATACA 600 

GGGCGTCCTT ATATCATTAG TTTCACTAAT GCATATCATG GTTCAACTTT TGGCTCATTG 66 0 

TCTATGTCAG CTATTAGTTT AAATATGCGC AAACATTATG GTCCGTTATT GAATGGTTTT 720 

TAT CAT ATT C CGTTTCCAGA TAAATATCGT GGTATGTACG AGCAGCCACA AGCTAATTCA 780 

GTAGAAGAAT ATTTAGCACC CTTAAAAGAA ATGTTTG CGA AGTATGTACC TGCTGACGAA 84 0 

GTAGCATGTA TTGTTATTGA AACGATACAA GGCGATGGTG GACTTTTAGA AC CAGTTCCA 90 0 
15 GGGTATTTTG AAG CGTTAGA AAAGATTTGT CGTGAACATG GTATTTTAAT CGCTGTCGAT 96 0 

GATATTCAAC AAGGTTTTGG GAGAACAGGT ACATGGAGTT CAGTCTCGCA TTTTAATTTT 102 0 

ACGCCTGATT TAATCACTTT CGG AAAATCC TTAGCAGGTG GTATGCCTAT GTCAGCAATT 1080 

20 GTTGGACGCA AAGAGATTAT GAATTGTTTA GAAGCACCAG CACATTTATT TACAACAGGT 114 0 

GCTAATCCAG TTAGTTGTGA AGCTGCATTA GCCACAATTC AAATGATTGA AG ATCAGT CG 1200 

CTTCTTCAGG CTAGTGCGGA AAAAGGGGAA TATGTTAGGA AACGAATGGA TCAATGGGTA 12 6 0 

TCTAAATACA ATAGTGTAGG CGATGTTAGA GGTAAAGGTC TGAGCATTGG TATTGATATT 1320 

GTTTCCGACA AAAAACTCAA AACACGTGAT G CCAGTGCGG CACTTAAAAT TTGTAATTAC 13 80 

TGCTTTG AG C ATGGCGTAGT TATTATAGCT GTAGCAGGAA ATGTGTTGCG ATTCCAACCG 144 0 

CCATTGGTAA TAACATATGA GCAATTAGAC ACGGCGTTAA ACACTATAGA AGATGCACTG 150 0 

ACTGCTTTGG AAGCAGGTAA C TT AG AT C AA TATGACATAT CTGGACAAGG TTGGTAATAG 1560 

CGATTATCTT AATATAAAAT AAAAAATCAT TTCCACATCT GGATGTTAAT CAGATGGGAA 1620 

ATGATTTTTT TTATTTTTTA TTTTGGTGGG TGGTATTCAG CTACGTCATT TTTCTTAGAA 1680 

TGTCTAAGTC CATAACTTAA ATATAGGATG ATAC CAACAA TAAACCAAAT TAAAGTGTAT 174 0 

40 AATTTCGCTT CGAATCCTAA TCCCCAGAAT ACTAGCAATA CTAAAACAAA TGTAATTGCT 1800 

GGTAACACAG GATATAAAGG TAATTTAAAT GCAGGAATTG GTAGATCTTT ACCTTcACGC 1860 

TTTCTCAAAC GATACATTGC TAATGAAACG AACATAAATG CAACAAGTGT ACCTGCTGAA 1920 

ATTAATTGTG CTAAAAATGC GAATGGGAAC ATAGAACCAA TTAAAACACC AATAATAGTA 1980 

AGTATAACTA GTGCGCGATT AGGTAAATGT TTGTCGTTTA AGTGGCTTAA CCATGAAGGT 2040 

AATAAGCCGT CACGTCCAAA TGAATAAAGT AAACGTG AG C CTGCTAACAT CATACCAATT 2100 

AATGCTGTAA ACATACCGAT AACAGAGATA GCTTGAACAA TAGCTGCTAC AACACCATGA 2160 

CCACTTTGAC GTAAAGCCCA ACCAACAGGT TCAGCATTGT TTGCGTATTG TGAGTAATGG 2220 
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CCAAGAATAC CTCTAGGCAT TGTCTTTTGA GGATCAAGTG CTTCTGCTGA GTTTGCTGCG 234 0 

ATAGAATCGA AACCGATATA CGCTAAGAAA ATCATTGAAA CACCAGCATA TATGCCTTGC 24 00 

CATCCACCAA AGTCACCTGT AG CAGTTACT TTGTGTTCTG GAATAAATGG CACATAGTTA 24 60 

CTAACATTTA TTG CTGTTAA ACCTACGATG ACAAATAAAA TAATAGCTAA TACTTTTAAA 2520 

ATAACTAAAA TATTTTCCAT ACGAGCTGCT TCCGACATAC CACGTGATAG TAATAATGCA 258 0 

GTTAATAAAA TAACGATAGC AGCAATAATA TCGATAAAAC CGCCATTTGT ACCAAATGGA 264 0 

TTTGATAATG CTGCAGGTAA TTCGATGCCA ATTGGTTTCA CAAGTCCGCG TAAATTCGCT 2700 

1S GAGAATCCTG ATGCAACAAA GGCTACGGCG ATAAAATATT CAGCTAATAG AGCCCAACCG 276 0 

GCAACCCATC CAAAAAATTC ACCAAATAAT ACATTGACCC AAGAATAGGC TGAACCTGCA 2820 

AATGGCATAG CGGCAGCCAT TTCTGCATAA GTAAATGCAA CTAAACCAGC AACAATAGCA 2880 

GCGAGTAAGA ATGATAACGC AACGGCCGGT CCTGCATGTT CTGCAGCAAC AATGCCAGGT 294 0 

AGCGTAAAGA TAGATGTCGA TACAATTGTT CCTACACCTA AAGCTAAGAA ATCACGCACC 3 000 

CGAAGTGTAC GCTTTAAATG ACCATCTTTA TTTTGATAGA TAGCCGGATC CTCTTTTCGT 3 060 

GCTATTTTAT TGAAAAAACT TCCCATAAAC TTTCCTCCCA AACATTCATA AACAATTCTA 3120 

TACGGTGTTT TTTAATATGT TATATCATAG CACAAATAAT CAATATTTTG TCTAAAAATT 3180 

CTGAAAAATC ACAACTTTAT GTTACGTATT AATGACTTGT CTTGATAACA TCCATAGATT 324 0 

TTTTAAATGA TAAAACTGAT TATAACAGAT ATTAAATGAA TAAGTACTAT TTTTTGCnAA 3300 

TTTTCTAACA ATTTTGCACA TTATATGTTT AAAATCAATT T CATGTTT AT GGTCTGATTG 3360 

GCTAGTGTGT ATGAAATGTA AnTCTTTGAC TnnGA 3395 
(2> INFORMATION FOR SEQ ID NO: 120: 

~ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13508 base pairs 

(B) TYPE: nucleic acid 
CO STRAND EDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ATCAGGTAAT GCCATGCGTT TAG CTG AAAA TTTTTTCAGA ACGTTTAAGT GATATCGGAC 60 

ATCAAGTTGT TTTGATGTCA ATGGATGAAT ATGATACGAC AAACATCGCG CAGTTAGAAG 120 

ATTTATTTAT TATTACGTCT ACTCATGGTG AAGGAGAACC GCCTGATAAT GCATGGGATT 180 

TCTTTGAATT TTTAGAAGAC GATAACGCAC CTAATTTAAA TCATGTGAGA TATTCAGTAC 240 
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TACTAGAAAA TCTAGGCGCT GAGCGTATAT GTAAGCGTGT AGATTGTGAT ATTGATTATG 360 

AAGAAGACGC AGAAAAGTGG ATGGCAGACA T CATTAAT AT TATTGATACC ACATCAGAAG 4 20 

GTATTCAAAG TGAATCGGTG ATAAGTGAAT CAATTAAGTC TGCCAAAGAA AAGAAATATT 4 80 

CTAAATCAAA TCCATACCAA GCAGAAGTAT TAGCGAATAT CAATTTAAAT GGTACCGATT 54 0 

CAAATAAAGA AACACGACAT ATAGAATTTT TACTTGATGA TTTTAGTGAA TCATATGAAC 600 

CAGGAGATTG TATAGTAGCA TTACCGCAAA ACGACCCTGA ATTGGTTGAA AAACTAATAT 66 0 

CCATGTTAGG TTGGGATCCG CAATCTCCGG TGCCAATTAA TGATCATGGT G AT ACAGTT C 72 0 

CTATTGTTGA AGCACTAACA TCACATTTTG AATTTACTAA ATTAACATTG CCATTATTGA 780 

AAAATGCAGA TATCTATTTT GACAATGAAG AATTATCTGA ACGTATTCAA GATGAGTCAT 84 0 

GGG CGCGTGA ATATGTTATA AATCGGGACT TTATAGATTT AATAACAGAT TTTCCAACTA 900 

20 . TAGAATTACA ACCTGAGAAT ATGTATCAAA TCCTTAGAAA ATTACCACCA AGAGAGTATT 960 

CGATTTCTAG TAGTTTTATG GCAACGCcAG ATGAAGTGCA TATTACCGTT GGTACGGTTC 1020 

GTTATCAAGC ACATGGACGT GAGAGAAAAG GTGTATGCTC GGTTCATTTT GCTGAGCGAA 1080 

25 TTAAACCAGG CGATATAGTA CCAATTTATT TGAAGAAAAA TCCGAACTTC AAATTTCCGA 114 0 

TGAAGCAAGA TAT AC CGGTT ATTATGATTG GACCAGGTAC TGrAATTGCT CCTTTTAGAG 1200 

CATATTTACA AGAACGTGAA GAACTTGGTA TGACTGGAAA AACATGGTTG TTCTTTGGTG 1260 

ATCAACACCG TAGTTCTGAC TTTTTATATG AAGAAGAAAT AGAAGAATGG CTTGAAAATG 13 20 

GAAACTTAAC ACGCGTAGAT TTAGCATTTT CAAGAGACCA AGAACACAAA GAATATGTAC 13 80 

AG CATCGTAT AATGGAAGAA AGTAAACGTT TCAATGAATG GATTGAGCAA GGCGCACAAT 144 0 

CTATATTTGT GGCGATGAAA AATGTATGGC GAAAGATGTC CATCAAGCCA TTAAAGATGT 1500 

ATTGGTAAAA GAACGTCATA TTTCTCAAGA AGAAGCAGAG TTATTATTGC GACAAATGAA 1560 

ACAACAACAA CGCTATCAAC GTGATGTTTA TTAGCGATTG GTGTTAAATA TTTTAAGGTG 1620 

TAATGATGTA AAAAGATATA AAGGATGTTG CTCAACATGA ATATGCCATT AATGATAGAT 168 0 

TTAACAAATA AAAATGTCGT CATAGTTGGT GGAGGCGTCG TTGCAAGTCG TCGGGCACAA 174 0 

45 ACATTAAATC AATACGTTGA ACATATGACG GTCATCAGTC CGACAATCAC TGAAAAACTT 1800 

CAAAATATGG TAGATAACGG TGTCGTCATA TGGAAAGAAA AAGAATTTGA ACCAAGCGAT 1860 

ATTGTAGACG CGTATCTAGT TATTGCAGCA AC CAATGAGC CACGTGTCAA TGAAGCGGTA 1920 

SO 

AAAAAAGCCT TACCTGAGCA TGCCCTTTTT AATAATGTTG G AG ATG CATC AAATGGCAAT 1980 

GTTGTATTTC CAAGTG CACT ACACCGCGAC AAGCTAACTA TCAGTGTATC AACTGATGGT 2040 
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TACAGTTCGT ATATCGACTT TTTATATACT TGCCGACAGA AAATAAAAGT ACTTGATATA 2160 

ACATATAACG AAAAGCAACA GTTACTGTCA CAAATTGTGT CACAAGAATA TTTAAATCAT 22 20 

GACAAACAAG CTCAATTTTT AGCGTGGTTG GATGTAAGAT AATAATAGCG GACCGTCTAA 22 8 0 

CCGTCTAAGG TAAGTCTTCT TATTTTAACT TTAACGCTTA ATCATTGAAA TTAAGACATG 234 0 

GGCGGCTTTG TGAATAGTCT AATAATGAAG GATTTAAGCG ATAATGATAT GCGTTTTAAA 24 00 

TATGAATATT ACAATAGAGA AAAAGATACG TAGAACAAAC TTAATAAAAT AGGTGGATAA 24 60 

ATTGAAATCT GGTTGAAGTC GTTACTATCA TAGCGACCTT TAG C CAG ATT TTTTGTGCAA 252 0 

TAGAAAGCAA TAATAAAAAT GATAGATCAA AATGAAATAC AGGACAGGAT ATACAAGGAT 2580 

TAGTCATGCC ATGTTATCAA GTAGGAAAAT CAAACTTCAC TATTGATAGT TACGCAAAAA 264 0 

AGATTTTTTT GATAAAATGA GATAACTTAA ATATAAAAAA TTATATTAAT TATAATATTT 2700 

20 AAGTTAAAGA GGGGGATTAT GTAAATTGTA TTAAAAGTGG AGGGAGAAAA TAATATGAAT 27 60 

AGTGATAATA TGTGGTTAAC AGTAATGGGG CTCATTATTA TTATTTCAAT TGTAGGTTTA 2820 

CTCATTGCCA AAAAGATAAA TCCAGTTGTA GGTATGACAA TCATACCTTG CTTAGGGGCA 2880 

ATGATTTTAG GATATAGTGT GACAGATTTG GTTGGATTTT TTGCTAAAGG GTTAGATCAA 294 0 

GTCATCAACG TTGTTATTAT GTTTATCTTT GCCATTATTT TCTTTGGCAT CATGAACGAT 3 000 

AGTGGTTTAT TCAAGCCGCT TGTCAAACGC TTAATATTAA TGACACGAGG CAATGTCGTC 3 06 0 

ATTGTCTGTG CAATGACAGC TTTAATTGGC ACAATAGCCC AATTAGATGG GGCCGGTGCG 3120 

GTAACATTTT TGCTTTCTAT TCCTGCATTA TTACCTTTAT ATAAAGCGTT AAATATGAAT 3180 

AAATATTTAT TGATTTTACT ATTAGCATTA AG CGCGGCGA TTATGAACAT GGTACCTTGG 324 0 

GGAGGTCCAA TGGCTCGTGT AGCTGCAGTG TTAAAAGCCA AAAGTGTCAA TGAATTATGG 3300 

— TATC^ATT/A TACCTATTCA AATAAT7^3GT TTCMTCTTG TTATCTTGTT TGCGGTATAT 3360 

CTTGGATTTA AAGAACAGAA ACGTATCAAA AAAGCAATAG AGAGAAATGA ATTACCGCAA 3420 

ACACAAGATA TAGATGTACA TAAATTAGTT GAAGTATATG AACGAGATCA AGATGTAAGG 34 8 0 

TTTCCTGTAA AAGGACGTGC AAGAACAAAA TCATGGATAA AATGGGTGAA TACAGCTTTA 3 54 0 

ACTTTAGCTG TTATTCTATC GATGTTAATA AATATTGCGC CACCTGAATT TGCATTCATG 3600 

ATAGGTGTTy CGTTGGCACT TGTTATTAAT TTTAAATCAG TGGATGAACA AATGGAACGA 3660 

TTAAGAGCgC ATGCGCCGAA TGCATTAATG ATGGCTGCAG TGATTATTGC AGCAGGTATG 3720 

TTTTTAGGTG TACTAAATGA AACCGGTATG CTTAAAGCGA TTGCGACCAA TTTAATCAAA 3780 

GTGATTCCTG CAGAAGTAGG ACCATACTTG CATATTATTG TAGGTTTACT TGGCGTACCA 3840 
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ACAGCAGGGC 


AATTTGGTGT 


ACCGTCTGTA 


TCAACAGCTT 


ATTCAATGGT 


CATAGGGAAT 


J -'WW 




ATTATAGGTA 


CATTTGTCAG 


CCCATTTTCA 


CCAGCCTTAT 


GGTTGGCAAT 


TGGTTTAGCA 


~ w ^ \J 


5 


GAGGCAAACA 


TGGGCACGTA 


TATTAAGTAT 


GCATTCTTTT 


GGATTTGGGG 


ATTCGCTATC 


4 O&ft 




GTTATGTTAG 


TAATTGCAAT 


GTTGATGGGC 


ATTGTGACGA 


TTTAAGTATG 


AAAAAATAGA 


« A** \J 


10 


AACTATGGTC 


ACGTTGCAAA 


ATGAAATAAT 


AGTTGCATAA 


ACATGTCGAA 


ATGACGGACG 


4200 


AATCTTTAAA 


CAATTTTAAA 


AATTAATGAA 


ATAATTGTGT 


AGAAATATGA 


ATTTCACTAA 


4260 




ATGTTAATAA 


CTTTGTGACG 


TTTTAGTTAA 


CAGACTAATA 


AAAATTTGAA 


AATACTATAT 


4320 


i o 


ATAGTGGTAT 


AACGTAATGA 


GTAGACACAA 


TATATAGGAA 


GAAGGGGTAA 


AATGAATCAA 


4380 




ATCGAAGAAG 


CATTAACGGG 


TTTGATTTCT 


AAAGATCCTG 


CTATTGTTAA 


CGAAAATGCT 


444 0 




AACAAAGATA 


GTGATACATT 


TTCAACAATG 


AGAGATTTAA 


CAGCAGGTAT 


CGTTTCTAAA 


4500 


20 


TCTTACGCAT 


TAAATCATTT 


ATT AC CAAAG 


CACGTTGCAG 


ATGCACATCA 


AAGAGGGGAC 


4560 




AT ACATTTT C 


ACGACTTAGA 


TTATCATCCA 


TTCCAACCGT 


TAACTAACTG 


TTGTTTAATA 


4620 




GATGCTAAAA 


ATATG CTACA 


TAATGGATTT 


GAAATAGGCA 


ACGCGAATGT 


AACTTCACCA 


4660 


25 


AAATCAATAC 


AAACTGCATC 


AGCGCAGCTT 


GTACAAATTA 


TAG C CAATGT 


TTCTAGCAGT 


4 74 0 




CAATATGGTG 


GCTGTAcGGT 


TGACCgCGTT 


GACGAATTAC 


t TAGTACATA 


TGCACG AC c A 


4 800 


30 


TAATGAAGAA 


CAACATAGGA 


ATATsCGCAA 


AGCAATTTGT 


CAAAGAATCT 


GAAATTGATC 


4860 


GTTATGTTGA 


TCAACAAGTC 


ACTAAAGACA 


TCAATGATGC 


GATTGAAAGT 


TTAGAATATG 


4 920 




AAATTAATAC 


CTTATATACA 


TCTAATGGAC 


AGACACCTTT 


TGTAACATTA 


GGATTCGGCT 


4980 


35 


TAGGTACAGA 


TCATTTAAGT 


CGCAAAATTC 


AACAAGCTAT 


CTTAAATACT 


CG TAT CAAAG 


504 0 




GCTTAGGAAA 


AGACCGCACG 


ACAGCGATTT 


TCCCGAAACT 


TGTATTTTCA 


ATTAAAAAAG 


5100 




GAACQAACTT 


TAGTCCG CAA 


GATCCGAACT 


ATGACATTAA 


ACAACTAGCA 


TTAAAGTGTT 


5160 


40 


CAACGAAACG 


TATGTATCCA 


GATATTTTAA 


ATTATGACAA 


ACTCGTAGAA 


ATATTAGGTG 


5220 




ATTTCAAAGC 


GCCAATGGGT 


TGTCGTTCAT 


TTTTACCAAG 


TTGGAAAGAT 


GCGGAAGGTC 


5280 




ATTTTGAAAA 


TAATGGTCGT 


TGTAATCTTG 


GTGTTGTTAC 


ACTTAATTTA 


CCTAGAATGG 


5340 


45 


CATTAGAATC 


TGCCGGTAAT 


ATGACGAAAT 


TCTGGGAAAT 


CTTTTATGAA 


CGTATCGATG 


5400 




TGTTACATGA 


TGCATTACTT 


TATCGTATAA 


A TrflTTTfi 71 71 
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CCGAATAACG 


5460 




CACCGATTTT 


ATATAAAAGT 


GGCGCATTTA 


ACTATAAATT 


AAAAGAAACA 


GATGATGTTG 


5520 


SO 


CTGAGTTATT 


TAAAAATAAA 


CGTGCAACGA 


TTTCAATGGG 


CTATATAGGG 


TTGTATGAAA 


5580 




CAGCTACTGT 


TTTCTATGGT 


CCAGACTGGG 


AAACATCTCA 


AGAAGCAAAA 


GCATTTACGC 


5640 



55 



<N$OOCID: <£P 07865 19A2_I_> 



674 



10 



25 



EP0 786 519 A2 

GGTTCAGTAT TTmCAGTACG CCGAGTGAAT CG CTAcGGAT CGTTTTTGTC GTTTAGACCA 5760 

AGAGAGATTT GGAGATATTA AAGACATTAC AGATAAAGGA TATTATCAAA ACTCTTTCCA 5820 

TTATGATGTA CGTAAAGATG TTACACCTTT TGAAAAGTTA GATTTTGAAA AAGATTATCC 5880 

TTATTATGCG AGTGGTGGTT TCATTCACTA TTGTGAGTAT CCGAAATTGC AACACAATTT 594 0 

GAAAGCACTA GAAGCGGTAT GGGACTACTC TTATGACAAA GTTGGTTACT TAGGTACAAA 6000 

TATTCCGATT GATCATTGTT ATGAATGTGA TTACGATGGA GATTTTGAAG CAACTGAAAA 606 0 

AGGATTTAAA TGCCCGAACT GTGGCAATGA TAATCCTAAA ACAGTTGATG TCGTTAAACG 6120 

1S AACATGTGGT TACCTAGGCA ATCCAGTTCA ACGTCCAGTA ATTAAAGGCC GTCATAAAGA 6180 

AATTTGCGCA CGAGTAAAAC ATATGAAAGC GC CTAAAGAA TGATACTTTT AGACATTAAA 624 0 

CAAGGACAAG GTTATATTGC TAAAATAGAA TCAAATAGCT TTGTTGACGG TGAAGGAGTA 6300 

20 AGATGCAGTG TTTATGTATC AGGATGTCCA TTTAATTGTG TTGGATGTTA TAACAAAGCC 6360 

TCACAAAAGT TCAGATATGG CGAGAAATAC ACTGATGAAA TATTAGCAGA AATATTAGAT 6420 

GATTGCGATC ATGATTATAT ATCTGGGCTA AGTCTATTAG GTGGCGAACC ATTTTGTAAT 64 80 

TTGGATATTA CATTAAATCT TGTCAAAGCA TTTCGAGCAC GTTTTGGAAA TACAAAGACA 6 54 0 

ATTTGGGTAT GGACTGGATT TTTATATGAA TATTTAGCAA ATGATTGTAC AGAACGTCGA 6600 

GAGTTATTAT CATACATTGA CGTTTTAGTA GATGGTCTAT TTATACAACA CTTATTCAAA 66 6 0 

CCTGATTTAC CATATAAAGG TTCTTTAAAT CAACG CATT A TAGATGTACA ACAATCACTC 6720 

TCGCATGCGC GTATGATTGA ATATATAGTT AGTTGAATAT GTATTAGAAG TCAAGGTAAC 6 780 

ATTCGTTGCC TTGGCTTC1T TTTAGGTTAG GTACATAATT GAAAGTTAAT AAAAGCAATT 6840 

CTTTATAAAA ATATATTGAT AGAATATGAC CTAACAATCA TTTTGATACC AATACTAAAA 6900 

GTTGCATATC CGTTTTTTAA AAAAGTTGAA AGAGAAAAGT GGTATTTTAG TGGGAAGGAA 696 0 

40 GTCTAACTTT TTGGTAGCGT TTTACAATAA ATAAATATTC GTTAATAACG TATAAATATT 7020 

CTTAAATGCC ATTCTAGTAA AATTTGTTAA ATTCGTTAAA TCGTAACTTA ACACTGTTAT 7080 

TTTAGCGCTA TTAAGGTTTT GTTTATTACG GGAAAAATTA TATAAATATT CAATAATTGC 7140 

45 CAAGTTTCAA ATTGTATGAA ATTTGCATTA TTATTAAATG TTAGTTATTG TCAATTTTGT 7200 

GAATCAATAT AATTATTACA TTTTGAGATA AATCGAAACA GGATTCATAA AATTAATAAT 7260 

TAGGGGGAGC ACAATTGAAA AAAGAGAAAG TTATGGACTG GACGACCTTT ATAGGGACAG 7320 

50 

TAGCTGTACT TCTTTTTGCA GTTATACCTA TGATGGCTTT TCCAAAAGCA AGTGAAGATA 7380 

TCATCACTGG TATTAATAGT GCCATTTCTG ATTCAATTGG TTCGATATAT TTATTTATGG 7440 
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TTGGTAAAGC AAGTGATAAA CCAGAATTTA ATACATTTAC ATGGGCGGCA ATGCTGTTTT 7560 

GTGCAGG CAT AGGCTCTGAT ATTTTATACT GGGGCGTTAT TGAATGGCCT TTTTACTATC 7620 

5 

AAGTTCCACC AAATGGCGCG AAAAGTATGA GTGATGAAGC ACTCCAATAT GCGACGCAAT 76 80 

ATGGTATGTT CCACTGGGGG CCAATTGCTT GGGCTATTTA TGTTCTACCA GCATTACCAA 7740 

^ TTGGTTATTT AGTATTTGTT AAAAAACAAC CGGTGTATAA AATTAGTCAA GCTTGTCGTC 7800 

CGATTTTAAA AGGTCAAACA GATAAATTTG TAGGTAAAGT TGTAGATATC TTATTTATCT 78 6 0 

TTGGATTGCT AGGTGGTGCG GCAACATCAC TAGCGTTAGG TGTG CCATTA ATTTCTGCAG 7920 

75 G CAT AG AAAG ATTAACTGGT TTAGATGGTA AAAATATGAT TTTACGTTCG GCCATTTTAT 7980 

TAACAATCAC GGTTATATTT GCCATTAGTT CATATACAGG ATTGAAAAAA GGTATTCAAA 804 0 

AGTTAAGTGA TATCAACGTT TGGCTATCCT TTGTACTTTT AGCCTTTATA TTTATTATTG 8100 

20 GACCGACTGT TTTTATTATG GAAACGACAG TGACAGGGTT CGGAAATATG TTGAGAGATT 8160 

TCTTTCATAT GGCAACATGG TTAGAACCAT TCGGTGGTAT TAAAGGTCGA AAAGAAACGA 8220 

ATTTCCCACA AGACTGGACA ATATTCTACT GGTCATGGTG GTTAGTATAT GCGCCATTTA 82 8 0 

25 

TCGGTTTATT TATCGCTAGA ATTTCAAAAG GTCGACGCCT TAAAGAAGTC GTGCTAGGAA 834 0 

CAATTATTTA TGGAACGCTT GGATGCGTAT TATTCTTTGG TATTTTTGGT AACTATGCTG 84 0 0 

TGTATTTACA AATTTCTGGA CAGTTTAATG TAACACAATA TTTAAATACA CATGGTACAG 84 6 0 

30 

AGGCAACCAT TATTGAAGTG GTGCATCATT TACCATTCCC ATCATTGATG ATTGTACTAT 8520 

TCTTAGTATC TGCTTTCTTA TTCTTAGCAA CAACATTTGA TTCGGGTTCA TATATTTTAG 858 0 

3s CGGCAGCATC TCAGAAAAAA GTGGTAGGCG AACCATTACG TGCCAATCGT TTATTCTGGG 864 0 

CATTTGCATT GTGCTTATTG CCATTTTCAT TGATGCTAGT TGGTGGTGAA CGTGCATTAG 8700 

AAGTATTGAA AACTGCTTCA ATACTGG CAA GTGTGCCATT AATTGTTATT TTTATTTTCA 8760 

40 TGATGATATC ATTTTTAATC ATTTTAGGGC GCGATAGAAT TAAACTTGAA ACGCGTGCTG 8620 

AAAAATTAAA AGAAGTTGAA CGTCGTT CAT TGCGAATCGT TCAAGTATCa GAAGAAGAAC 8880 

AAGACGATAA TTTATAATTC AAAGCGGGTC TGGGACGACG AAATGaATTT TGTGAAAATA 894 0 

45 TCATTTCTGT TCCaTTCCCC TTTTTTTAGT AGCATTGTAG GATGAACTTT TAGGTTTTCA 900 0 

TTAATGTTGT ACTAAAAGAT TTAATTTTTT AGTGCTCCAA GTACTTATTT ATTGTATGAA 9060 

G CAT ATT CT A AATCGAAGTT TGAAAGACTC TCATTGATTA TTAAATTAAA TAAAGGGTAT 9120 

50 

GCGTATGTAC AATTCAAATT AATCGAAGGA TGAAATAAAA TGACTAATCA ATTTAAAAAT 918 0 

AAACAGTCCA AATTACATGA CAGTTTAGAA TCCATCACAA AAAACTTATA TGCGACACCT 924 0 
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ACAGAATATT GTTATCTATC ATTCCGGACA CTTAGGTGAC TCCCAACAAG ACATTGCATC 936 0 

ATTAGGTGGT GTTTCAAAAG TATTGATGAA TCATGATCAT GAATCTATAG GAGGTTCTAA 9420 

TCAAGTTGAA GCCCCTTACT TTATACATGA AAATGATGTG GCTGCACTGA AACATAAGAT 9460 

TTCTGTTCAA AAACAATTTA GTAATCGTGT AATGTTGGAT AAGGATTTAG AAGTTATTCC 954 0 

CGCGCCTGGA CATACACCAG GGACGACACT ATTTTTATGG GATGATGGTC ATCACCGTTA 9600 

CTTATTTACT GGAGATTTTA TATGTTTTGA AGGGAAGAGA TGGCGTACAG TTATATTAGG 9660 

TTCAAGTGAT AGAGAAAAAT CTATTCAAAG TTTAGAGATG GTTAAAGAAT TAGATTTTGA 972 0 

TGTACTTGTA CCTTGGGTTA CTATCAAAGA TGAACCGTTA GTTTATTTTG TAGAAAATGA 9780 

ATATGAAAAA CGTGAACAAA TACAAAATAT TATTGATAGA GTACGTGAGG G CGAGAAT AG 984 0 

CTAATTGAAA TATATTGGCG AAgCAATGTA ACGAATCTAA GAAAG CCCTA GAAAATACCT 9900 

C CAT AATTGA TTGTCATATA AAACAAAAAC GGTAATTTCT ATTTATTGAG ATAGAAATTA 996 0 

CCGTTTATTT CGTGGACCTA TTGCATTGTT TTTATCATGC ATAAT CAT CA TTGTCGTTGT 10020 

TTGAGTCAAT TTTAATTTTC AGAATCAGAA GGCTGTTCTG GAATTGGGAA ATATTTGAAA 10080 

ATTTCACCGC TTTCAATCGC TTCGGTTAAC TGTTCTAACC ATTCGTAATA AACATGTGTA 1014 0 

TGATCAAGCT GAGCTTTAAT TTTTTGTGCC TCTTGTGTTT CAGCTTCAGT TAAATCACTG 10200 

CTTTCAAGTA ATGGATTGAT AATAGCTTGA GCATCTTTTA CTGCTTCGAC ATTGATGTCA 10260 

ATTTCACGCT GGAATTTTTT AGTGAAAAAG TTTCGGAAAA AGATGAAAAA GTCTTTCTCG 1032 0 

GCGATAAAAT GTTGTTTGCG GCTTCCTCTC GTAAATTGTT GTTTAACAAT ATCAAATTCC 103 80 

TGCAATTTCT TAACGCCAGC ACTCATACTT GGTTTGCTCA TTTGCAATTG ATGACGCATT 10440 

TCATCAAGCG TCATACTGCC TTCAAACACC ATTGTGCCAT ATAAGTTTCC TACACTTCTA 10500 

TTAGTGCCAT ACAAATCCAT TGTCTGTCCA ATTGAATTAA TT ACAAT AT C TTTTGCTTGT 10560 

TCTAATTGTT GCTGTTTGTT CTGAGAACGA GTCATCATTG CACCTCCGTA CATCATTTTG 10620 

GTCACGTTAA AATAAATACT AATACATTAT AAAACCTTTT CTAAAAAAAG ACATTAAAAA 10680 

TATTTAAAGC ATTAAAGTTA AATGTTTCGT TAAATAAAAA TCTAACGAAC TTACAAAACT 10740 

TAATTCTTGA GTTGTTTTGT AAATTGACAC ATTTTTCATT TCTATG CTAA CATAAGTnTG 10800 

TAAAATTCGT TAAATAAAAA TTTAACAAAC TTAACGGrGG TTGTTGAAkG GrACTTTTAA 10860 

aACATTT AT C TCAGCGTCAA TATATTGATG GTGAGTGGGT TGAAAGCGCG AATAAAAATA 10 920 

CAAGAGATAT TATCAATCCT TACAATCAAG AAGTGATATT TACGGTTTCT GAAGGGACAA 10980 

AAGAGGATGC AGAACGTGCA ATCTTAGCTG CAAGACGTGC GTTTGAGTCT GGTGAATGGT 11040 
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AACATCgCGA AgCgTTAGCA CGATTAGAAA CATTAGATAC TGGAAAAACG TTAGAAGAAT 11160 

CATATGCAGA TATGGATGAT ATTCATAATG TGTTTATGTA TTTTGCTGGA TTAGCAGATA 11220 

AAGACGGTGG CGAAATGATT GATTCACCAA TTCCAGATAC AGAAAGCAAA ATTGTTAAAG 11280 

AACCAGTAGG TGTAGTTACA CAAATTACAC CTTGGAATTA TCCGTTATTA CAAGCATCAT 1134 0 

GGAAAATTGC GCCAGCGCTT GCTACGGGTT GTTCACTAGT TATGAAACCA AGTGAAATTA 114 00 

CACCATTAAC AACAATACGT GTTTTTGAAT TAATGGAAGA AGTTGGTTTC CCTAAAGGAA 114 6 0 

CAATTAATCT TATTCTAGGT GCAGGTTCTG AAGTTGGTGA CGTAATGTCA GGTCATAAAG 11520 

15 AGGTTGACCT TGTATCATTT ACAGGTGGCA TTGAGACTGG TAAGCATATT ATGAAAAATG 1158 0 

CTGCTAATAA TGTTACGAAT ATTGCCTTGG AACTTGGCGG TAAAAATCCA AACATTATCT 11640 

TTGATGATGC TGATTTTGAA TTGGCAGTAG ACCAAGCGTT AAATGGTGGA TATTTC CATG 11700 

20 CAGGTCAAGT TTGTTCAGCA GGATCAAGAA TATTAGTACA AAACAGTATT AAAGACAAAT 11760 

TTGAGCAAGC ACTTATTGAT CGCGTGAAAA AAATCAAATT AGGTAATGGT TTTGATGCTG 11820 

ATACTGAAAT GGGAC CAGTG ATTTCAACAG AACATCGTAA TAAGATCGAA TCTTATATGG 11880 

ATGTAG cTAA AGCAGAAGGC GCAACAATTG CTGTTGGTGG TAAACGTCCA GATAGAGATG 11940 

ATTTAAAAGA TGGTCTATTC TTCGAGCCAA CAGTCATTAC AAATTGTGAT ACGTCAATGC 12 000 

GTATTGTACA AGAAGAGGTT TTCGGACCTG TCGTTACTGT AGAAGGCTTT GAAACTGAAC 12 060 

AAGAAGCGAT TCAATTAGCG AATGATTCTA TATATGGTTT AGCAGGTGCT GTATTTTCTA 1212 0 

AAGATATTGG AAAAGCACAA CGCGTTGCTA ACAAGTTGAA ACTTGGAACG GTGTGGATTA 12180 

ATGATTTCCA TCCATATTTT GCACAAGCGC CATGGGGTGG ATACAAACAA TCAGGTATCG 12240 

GTAGAGAATT AGGCAAAGAA GGCTTAGAAG AGTACCTTGT TTCAAAACAC ATTTTAACAA 12300 

ATACAAATCC ACAATTAGTG AATTGGTTTA GCAAATAAAA ATTAGATAAG GTGAGTGCCA 12360 

40 TTGTAAGAAC ACAAGACACT CACTTTGTTT TGTATAAGTG GCGAAATGTT GATTGATAAT 12420 

TTGGACTAAA CGCAAAATGA ATCATAGATT ATTT CATTAC TGTTAGTAAC AATCGTAAAA 12480 

GGAAAAGCGA GTGTTTTGGT TAGCTAAGTT TAGCAATTCA ACGATAACCA ATCAGCCACT 12540 

AACAAATATT TCATGCAATA CTCACTTTGA AATACAACAA ACTTTGGAGG TCATAACGAT 12600 

GAGTAACAAA AACAAATCAT ATGATTATGT CATCATTGGA GG AGG CAGTG CAGGTTCTGT 12660 

ACTAGGTAAT CGTCTGAGTG AAGATAAAGA TAAAGAAGTC TTAGTATTAG AAGCGGGTCG 12720 

CAGTGATTAT TTTTGGGATT TATTTATCCA AATGCCTGCT GCGTTAATGT TCCCTTCAGG 12780 

CAATAAATTT TACGATTGGA TTTATTCAAC AGATGAAGAA CCACATATGG GCGGTCGTAA 12840 

55 



30 



35 



45 



SO 



3NSOOCID: <EP 07665 19A2_I_: 



678 



10 



EP 0 786 519 A2 

TCAACGTGGT AATCCAATGG ACTATGAAGG CTGGGCAGAA CCAGAAGGTA TGGAAACTTG 12960 

GGATTTTGCG CACTGTTTAC CGTATTTTAA AAAATTAGAA AAAACATACG GTGCAGCGCC 13 020 

TTATGATAAA TTTAGAGGCC ATGATGGACC AATTAAGTTA AAACGAGGGC CAGCAACGAA 13 080 

TCCTTTATTC CAGTCATTCT TTGATGCAGG TGTTGAAGCA GGCTATCATA AAACACCTGA 1314 0 

TGTGAATGGA TTTAGACAAG AAGGTTTTGG AC CGTTCGAT AGTCAAGTAC ATCGTGGTCG 13 200 

CCGAATGTCA GCTTCAAGAG CATATTTACA TCCAGCGATG AAGCGTAAAA ACTTAACCGT 13 2 60 

TGAAACACGT GCCTTTGTAA CTGAAATTCA TTATGAAGGT AGAAGAGCAA CTGGTGTTAC 13 32 0 

GTATAAGAAA AATGGCAAAC TACATACCAT CGATGCTAAT GAAGTCATTT TGTCTGGTGG 13 3 80 

GGCATTCAAT ACGCCACAAT TACTACAATT ATCTGGTATC GGTGATTCAG AGTTCCTAAA 13440 

ATCAAAAGGC ATTGAGCCAC GTGTTCATTT ACCTGGTGTG GGTGAAAACT TTGAAGATCA 13 500 
20 CTTAGAGG 

(2) INFORMATION FOR SEQ ID NO: 121: 



75 



25 



30 



35 



13508 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7646 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GTAAGTATTG TCTTGATTTC CTAATAAAGT TATATCTTGT AATTCATCTT GTTGACGGCC 6 0 

ATGTGCCATA TAAAGCGCTC CTTTAAATTT ATTTT TT TAT TATTTTGGCG TCTCGGCGTG 120 

CTTTTTCAAA CATGTAATAA CTTGCACCGA TAATAACGAC GTAAC CTAAT GTTG CAT AGA 180 
AATC5*GGAGA _ TTCTCCGAAT^ AGAATAAATC~ C^GTTn^C^ TCTGAA^CT ^ATAGATGCAT ~ 24 0 

40 ACGTAAAAAT AGAAATATCT TTTGCTGCTG CAAAACTATA TGCTAAAGTA ACACCAATTT 300 

GACCCACAGC GGCAgCTAAG CCAGCCCCTA ATAGATAAAG TATTTGCATC TGACTCATTG 360 

GTTCATAAGT ATATGCAGTG AAAGGTATTA AAACGATGAC AGAAAATAAG GAGAAGTAAA 420 

ATACTATAGT ATATGGTGCT TyTCTTGTAC TAAGTGCTCG AACACATGTA TATGCTGATG 4 80 

CTGCAAAAAT ACCTGAGAAT AAGCCAGCTA ATGATGGAAT CATAGATGAT GAAAATTCAG 540 

GTTTCACTAT TAAnAGCAaC CTAAAATAGC AATTATCATT GCTGTAATTT GaTACTTCCT 600 

TACCTTTTCA TGtAAGAaaA CAATGCTTaA TAAAATCGTC CAGAAAGGAT TGAGTTTCAT 660 

TAATGAATCG GCATCACTAA GTACCATATG ATCAATGGCA TAAATATTTA ACAATACACC 720 
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TGGCTGATGG TATTTATATA TAAAAAATAA TGGAATAAAC ATTG CTACT A AGTTTCGTGC 840 

TAATGATTTT TGAAAAACAG GAAGGTCACC TGCAAGTCTG AAAAACACTG ACATAAAACT 900 

5 

GAAACCAATA GCCGAAATTA AAATGGCAAT GATACCTTTT ACTTTAGGAT TCAATTTTAT 960 

CGCCTCTTTT ATATAAAATT AACGTATTTA TATTAGCATA AAACAACATG TTGTGCATAA 1020 

w ATAGTTGAAA TTTACTATAA AAAGACTATA ATAGACTGTA GCGAACAAAC GTTCTGTGTT 1080 

TATTTGTCGG AATAATAGGG CATTACACTT TTATGAATGT TTGTGTTATT ACATAAAACA 114 0 

AATATCAATT CAGTATCAAG CTAATAAGCT TTTTCTTGAT TTCTGTTGAT ACAATTGAGA 1200 

75 TTGACACAGA TTTAAAAAAA TCAAGTGATA TCTACTAAAA AATTTTTTTA AATTTGTTCA 12 60 

AGTTTTTCTA ATTTAGTATT GGTGCCTAGT TGGAACGTTT TACGAACATT CGATTAGAAA 1320 

ATGGCACTTT AAATCATAGT GTGTCTTATG TATAATGAAA CACATAATAT AGTGTTGGTG 13 80 

20 AAACGAAAAA gACACAATAT CTTGTGTTTT GTATGCAAAT GCTTTATTTA TGAAGAAATT 144 0 

ACATTTAAAA GTAATTTAAC ACAGAAATTT AATAGTTATT ATCAATTAAT AGTCATATTT 1500 

TTAGAAAATG TACTGAGCAA ATGGAAGATA TCCAATGATG TAAACACTAC ATATAGTGAT 1560 

25 

TTTTATACAT TCAACCCATA TAAGCTACTA TTTTCTCAAA TATAAATCTA TGCAATTGGT 162 0 

TTACATTTGA GAAAATAAGT AGCTTCATTA TAGTTAATAC AATGCTGAGA TAACCATAGT 16 80 

AACCATGTTG TTAAAGCATT TTTTAATTGG AATGACTACT TTATTTAAAA GGGTTGAAGA 174 0 

30 

AAGAAGGTGA TCCAATGAAA ATAATATATT TTTCATTTAC TGGAAATGTC CGTCGTTTTA 1800 

TTAAGAGAAC AGAACTTGAA AATACGCTTG AGATTACAGC AGAAAATTGT ATGGAACCAG 1860 

3s TTCATGAACC GTTTATTATC GTTACTGGCA CTATTGGATT TGGAGAAGTA CCAGAACCCG 1920 

TTCAATCTTT TTTAGAAGTT AATCATCAAT ACATCAGAGG TGTGGCAGCT AGCGGTAATC 1980 

GAAATTGGGG ACTAAATTTC GCAAAAGCGG GTCGCACGAT ATCAGAAGAG TATAATGTCC 2040 

40 CTTTATTAAT GAAGTTTGAG TTACATGGAA AAAACAAAGA CGTTATTGAA TTTAAGAACA 2100 

AGGTGGGTAA TTTTAATGAA AACCATGGAA CAGAAAAAGT ACAATCATAT TGAATTAAAT 2160 

AATGAGGTCA CTAAACGAaG AGAAGATGGA TTCTTTAGTT TAGAAAAAGA CCAAGAAGCT 2220 

45 TTAGTAGCTT ATTTAGAAGA AGTAAAAGAC AAAACAATCT TCTTCGACAC TGAAATCGAG 2280 

CGTTTACGTT ATTTAGTAGA CAACGATTTT TATTTCAATG TGTTTGATAT TTATAGTGAA 2340 

GCGGATCTAA TTGAAAT CAC TGATTATGCA AAATCAATCC CGTTTAATTT TGCAAGTTAT 2400 

SO 

ATGTCAGCTA GTAAATTTTT CAAAG ATT A C GCTTTGAAAA CAAATGATAA AAGTCAATAC 2460 

TTAGAAGACT ATAATCAACA CGTTGCCATT GTTGCTTTAT ACCTAGCAAA TGGTAATAAA 2520 
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ACATTTTTAA ACGCAGGCCG TGCGCGTCGT GGTGAGCTAG TGTCATGTTT CTTATTAGAA 2640 

GTGGATGACA GCTTAAATTC AATTAACTTT ATTGATTCAA CTGCAAAACA ATTAAGTAAA 2700 

ATTGGGGGCG GCGTTGCAAT TAACTTATCT AAATTGCGTG CACGTGGTGA AGCAATTAAA 2760 

GGAATTAAAG GCGTAgCGAA AGGCGTTTTA CCTATTGCTA AGTCACTTGA AGGTGGCTTT 2 820 

AGCTATGCAG ATCAACTTGG TCAACGCCCT GGTGCTGGTG CTGTGTACTT AAATATCTTC 2 8 80 

CATTATGATG TAGAAGAATT TTTAGATACT AAAAAAGTAA ATGCGGATGA AGATTTACGT 2 94 0 

TTATCTACAA TATCAACTGG TTTAATTGTT CCATCTAAAT TCTTCGATTT AGCTAAAGAA 300 0 

GGTAAGGACT TTTATATGTT TGCACCTCAT ACAGTTAAAG AAGAATATGG TGTGACATTA 3060 

GACGATATCG ATTTAGAAAA ATATTATGAT GACATGGTTG CAAACCCAAA TGTTGAGAAA 3120 

AAGAAAAAGA ATGCGCGTGA AATGTTGAAT TTAATTGCGC AAACACAATT ACAATCAGGT 3180 

20 TATCCATATT TAATGTTTAA AGATAATGCT AACAGAGTGC ATCCGAATTC AAACATTGGA 324 0 

CAAATTAAAA TGAGTAACTT ATGTACGGAA ATTTTCCAAC TACAAGAAAC TTCAATTATT 3 300 

AATGACTATG GTATTGAAGA CGAAATTAAA CGTGATATTT CTTGTAACTT GGGCTCATTA 3 36 0 

AATATTGTTA ATGTAATGGA AAGCGGAAAA TTCAGAGATT CAGTTCACTC TGGTATGGAC 3420 

GCATTAACTG TTGTGAGTGA TGTAGCAAAT ATTCAAAATG CACCAGGAGT TAGAAAAGCT 34 80 

AACAGTGAAT TACATTCAGT TGGTCTTGGT GTGATGAATT TACACGGTTA CCTAGCAAAA 3 54 0 

AATAAAATTG GTTATGAGTC AGAAGAAGCA AAAGATTTTG CAAATATCTT CTTTATGATG 3 600 

ATGAATTTCT ACTCAATCGA ACGTTCAATG GAAATCGCTA AAGAGCGTGG TATCAAATAT 3 66 0 

CAAGACTTTG AAAAGTCTGA TTATGCTAAT GGCAAATATT TCGAGTTCTA TACAACTCAA 3720 

GAATTTGAAC CTCAATTCGA AAAAGTACGT GAATTATTCG ATGGTATGGC TATTCCTACT 3 780 

TCTGAGGATT GGAAGAAACT ACAACAAGAT GTTGAACAAT ATGGTTTATA TCATGCATAT 3 84 0 

40 AGATTAGCAA TTGCTCCAAC ACAAAGTATT TCTTATGTTC AAAATGCAAC AAGTTCTGTA 3 900 

ATGCCAATCG TTGACCAAAT TGAACGTCGT ACTTATGGTA ATGCGGAAAC ATTTTACCCT 3 960 

ATGCCATTCT TATCACCACA AACAATGTGG TACTACAAAT CAGCATTCAA TACTGATCAG 4 020 

ATGAAATTAA TCGATTTAAT TGCGACAATT CAAACGCATA TTGAC CAAGG TATCTCAACG 4 080 

ATCCTTTATG TTAATTCTGA AATTTCTACA CGTGAGTTAG CAAGATTATA TGTATATGCG 4140 

CACTATAAAG GATTAAAATC ACTTTACTAT ACTAGAAATA AATTATTAAG TGTAGAAGAA 4200 

TGTACAAGTT GTTCTATCTA ACAATTAAAT GTTGAAAATG ACAAACAGCT AATCATCTGG 4260 

TCTGAATTAG CAGATGATTA GACTGCTATG TCTGTATTTG TCAATTATTG AGTAACATTA 4320 
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ATGTTTTGGA GACAAAATAT ATCTCAAATG TGGGTTGAAA CAGAATTTAA AGTATCAAAA 4440 

GACATTGCAA GTTGGAAGAC TTTATCTGAA GCTGAACAAG ACACATTTAA AAAAGCATTA 4 500 

GCTGGTTTAA CAGGCTTAGA TACACATCAA GCAGATGATG GCATGCCTTT AGTTATGCTA 4 560 

CATACGACTG ACTTAAGGAA AAAAGCAGTT TATTCATTTA TGGCGATGAT GGAGCAAATA 4620 

CACGCGAAAA GCTATTCACA TATTTTCACA ACACTATTAC CATCTAGTGA AaCAAACTAC 46 80 

CTATTAGATG AATGGGTTTT AGAGGAACCC CATTTAAAAT ATAAATCTGA TAAAATTGTT 4 740 

GCTAATTATC ACAAACTTTG GGGTAAAGAA GCTTCGATAT ACGACCAATA TATGGCCAGA 4 800 

GTTACGAGTG TATTTTTAGA AACATTCTTA TTCTTCTCAG GTTTCTATTA TCCACTATAT 4 860 

CTTGCTGGTC AAGGGAAAAT GACGACATCA GGTGAAATCA TTCGTAAAAT T CTTTT AG AT 4 920 

GAATCTATTC ATGGTGTATT TACCGGTTTA GATGCACAGC ATTTACGAAA TGAACTATCT 4 980 

20 GAAAGTGAGA AACAAAAAGC AGATCAAGAA ATGTATAAAT TGCTAAATGA CTTGTATTTA 5040 

AATGAAGAGT CATACACAAA AATGTTATAC GATGATCTTG GAATCACTGA AG ATGTG CTA 5100 

AACTATGTTA AATATAATGG AAACAAAGCA CTTTCAAACT TAGGCTTTGa ACCTTATTTT 5160 

GAGGAACGTG AATTTAACCC AATCATTGAG AATGCCTTAG ATACAACAAC TAAAAACCAT 5220 

GACTTCTTCT CAGTAAAAGG TGATGGTTAT GT ATT AG CAT TAAACGTAGA AG CATTACAA 52 80 

GATGATGACT TTGTATTTGA CAACAAATAA CAATTAAATT AAAAGACCTT CACATGTAAA 534 0 

GGGAAATAGC GATTCGTTTC GTCTTGTCTC CTACATGTTG AAGGTCTTTT TTTATGTGTA 54 00 

TCTAACTCAT TATGAGTCTG AGTAAGAAAT CAATGCTCTA AGATGTACAA TG CTATTT AT 54 60 

ATTGGCAGTA GTTGGCGGGG CCCCAACACA GAAGCAGGCG GAAAGTCAGC TAACAATATT 552 0 

GTGCAAGTTG GCGGGGCCCC AACATAGAAG CAGGCGGAAA GTCAGCTAAC AATAATGTGC 5580 

AAGTTGGCGG GGCCCCAACA TAAAAGCAGG CGGAAAGTCA GCTAACAATA TTGTGCAAGT 56 4 0 

40 TCGGgCGGGG CCCCAACATA AAGAAAAACT TTTTCCTTTA GAAATTATCA CTTCCaCaTG 5700 

AGTTTTACTC ATGTATTCCT ATTTTTAAGT ACACATTAGC TGAGGCTAAT GTTAAGAACC 576 0 

ACTACTTAAT CAATCATTAG TAGTTTTTAT CATTTCCACT ATTCCCaGAC ATCaAAATCT 5820 

45 TAAGTGTTCT ATTTTACTTT AAGTAAACAA AATACACATT CCGAAAAATT AAATTTCAGT 5880 

TTAATTGCAA ATATCAATAA AATTGACACT AAATTATTTG AAAGG CTATT GAAATTATGG 594 0 

TCAAAAAACG CTACTATTAA TGAGAAATAT TATCAATGAT AATGATTATC ATTAATTTAA 6000 

SO 

AGGGAGAAAA ATTTGTAATG AAGTATTTAT TAAAGGGAAA TATTTTGCTT CTATTACTAA 6060 

TATTGTTGAC AATTATTTCG TTGTT CAT AG GTGTGAGTGA ACTATCAATT AAAGATTTAC 6120 
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GTATTTTAAT TGCTGGAAGT TCGTTGGCTT TAGCAGGCTT GATAATGCAA CAAATGATGC 624 0 

AAAATAAGTT TGTTAGTCCG ACTACAGCTG GAACGATGGA ATGGGCTAAA CTAGGTATTT 6 300 

TAATTG CTTT ATTGTTCTTT CCAACCGGTC ATATTTTATT AAAACTAGTA TTTGCTGTTA 636 0 

TTTGCAGTAT TTGCGGTACG TTTTTATTTG TTAAAATCAT TGATTTTATA AAAGTGAAAG 6420 

ATGTCATTTT TGTACCGCTT TTAGGAATTA TGATGGGTGG GATTGTTGCA AGTTcACAAC 64 80 

CTTCATCTCA TTGCGCACGA ATGCTGTTCA AAGCATTGGT AACTGGCTTA ACGGGAACTT 6 54 0 

TGCCATTATC ACAAGTGGAC GCTATGAAAT TTTATATTTA AGTATTCCTC TTTTAGCATT 66 0 0 

15 GACATATCTT TTTGCTAATC ATTTCACGAT TGTAGGAATG GGTAAAGACT TTACTAATAA 6 66 0 

TTTAGGTTTG AGTTACGAAA AATTAATTAA CATCGCATTG TTTATTACTG CAACTATTAC 672 0 

AGCATTGGTA GTGGTGACTG TTGGAACATT ACCGTTCTTA GGACTAGTAA TACCAAATAT 6780 

20 TATTTCAATT TATCGAGGTG ATCATTTGAA AAATGCTATC CCTCATACGA TGATGTTAGG 6 84 0 

TGCCATCTTT GTATTATTTT CTGATATAGT TGGCAGAATT GTTGTTTATC CATATGAAAT 690 0 

AAATATTGGT TTAACAATAG GTGTATTTGG AACAATCATT TTCCTTATCT TGCTTATGAA 6 960 

AGGTAGGAAA AATTATG CGC aACAATAATA AAAAAATAAT GCTTTTAATT GCAGTAACGT 702 0 

TATTAATTAG TATG CTGTAC TTATTTGTAG GTATTGATTT TGAAATATTT GAATATCAAT 70 80 

TTTCAAGTCG TTTAAGAAAG TT CAT ATT AA TTATTTTAGT AGGTGCTGCC ATTGCAACTT 714 0 

CAGTGGTGAT TTTTCAAGCG ATTACAAATA ACCGTCTATT GACACCATCA ATAATGGGGT 7200 

TAGATGCAGT TTATTTATTT AT CAAAGT AT TGCCAGTCTT TTTATTTGGA ATTCAATCGG 7260 

TATGGGTTAC TAATGTATAT TTGAACTTTA TATTAACACT TATAACGATG GTGTTATTCG 7320 

CACTAATCCT ATTCCAAGGT ATCTTTAAAA TCGGACATTT TTCAATTTAT TTTATCTTAC 7380 

TTAJTGGTGT CCTTTTAGGA ACATTTTTTA GAAGCATAAC AGGTTTTATT CAACTGATTA 744 0 

40 TGGATCCTGA GTCATTTTTA GCAATACAAA GTAGTATGTT TGCTAATTTT AATGCTTCTA 7500 

ATTCGAATTT AGTTACTTTC TCAGCAGTGC TATTAGTAAT CTTATTAGTC ATTACAATTT 7560 

TACT ATTG C C TTATTTAGAT GTATTGCTTT TAGGTCGTGC TGAAGCAATT AATCTTGGGA 7620 

45 TATCGTATGA AAAATTAACG CGAATT 764 6 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 122: 

ATGAATATAT TTnnAAATAA ATTATTATGG ATTGCACCAA TnGCCACTAT GATTATCTTG SO 

GTAATCTTTT CTTTAGCTTT TTATCCTGCA TATAATCCTA AACCAAAAGA TTTACCAATT 12 0 

GGTATATTAA ACGAGGATAA AGGTACAACG ATTCAAGATA AAAATGTTAA CATTGGTAAA 180 

AAATTAGAGG ATAAATTATT AGATAGTGAT TCTAATAAAA TTAAATGGGT TAAGGTTGAT 24 0 

AGTGAAAAAG ACCTTGAAAA AGATTTGAAA GATCAAAAAA TCTTTGGAGT AGCTATTATT 300 

GATAAAGACT TTTCAAAAGA TGCTATGAGT AAAACACAAA AAGTAGTTAT GGATAGTAAA 360 

AAAGAAGAAA TGCAACAAAA AGTTGCTTCA GGTGAAATTC CGCCACAAGT GGTTCAACAA 420 

ATGAAACAAA AAATGGGGAA TCAACAAGTA GAGGTTAAGC AGG CTAAATT TAAAACGATT 4 80 

GTAAGTGAAG GATCAAGCTT ACAAGGTTCA CAAATTGCAT CAGCTGTGTT AACTGGTATG 54 0 

20 GGTGATAATA TTAATG CTCA AATTACGAAG CAAAGTTTGG AAACATTAAC GAGTCAAAAT €00 

GTTAAAGTCA ATGCCGCGGA CATCAATGGT TTGACGAATC CAGTAAAAGT GGATAATGAA 66 0 

AAACTTAATA AAGTTAAAGA TCACCAAGCA GGTGGTAATG CACCATTCCT AATGTTTATG 720 

CCAATTTGGA TAGGTTCAAT CGTAACGTCT ATCTTATTGT TCTTTGCATT TAGAACTAGT 780 

AACAATATCG TCGTG CAACA TCGTATCaTT GCtTCAATTG GACAGATGAT ATTTGCAGTT 84 0 

GTTGCAGCAT TTGCAGGTAG CTTTGTTTAT ATTTATTTCA TGCAAGG CGT TCAAAGATTT 900 

G ATTTTG AC C ATCCAAATCG TATCG CAATT TTTGTAGCAT TTGCGATTCT TGGTTTCGTG 96 0 

GGCCTTATTT TAGGTGTTAT GGTATGGCTA GGTATGAAGT CAGTTCCAAT TTTCTTCATT 102 0 

TTAATGTTCT TTAGTATGCA ACTTGTAACG TTACCTAAAC AAATGTTGCC TGAAAGTTAT 108 0 

CAAAAATATG TATATGATTG GAATC CATTC ACACACTATG CAACAAGTGT AAGAGAcTAT 114 0 

TATAClTGAA T CAT CAT ATT GAATTAAATA GTACAATGTG GATGTTTATA GGGT 1194 
40 (2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 

45 (D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
GACCGACCTA TACATCCGTA TAAGTATTTC TTGATATAAG TCTTCTAAAT CATAATGATT 60 
AAATCCAAAT GTTTTGATGC GTCGAATAAT TAATGGTTGT AGATCCATTA CTAACTTTTC 120 
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GTATTTCAAA TATTAAACTA ACCCCTTCTA TCTAAAATTT AAGGTTAGTT TAATATTGTT 24 0 

ACATTCAAAA TTTCAAGATG ACGGAAATGT CATTTCTTAT GATGTCCTCT TCGTATTTTT 300 

TCAAATTCTG CAAGGATTTC AGAAGATAAC GGAATTCGAG TTCTTGGCTT GTTTTCACTT 360 

ATATCATCTA ATGATTTACT CACATCAATT TCATTTTCTT TTAAATCTCT CCACATTTCG 4 20 

CGAGATGATA TTCTATATGC ACCTGATCCA AAGATAGCAT GTTGcTCACT CaTATCACTT 4 80 

GTTACAACTG TAATATGcTT AGtATGCTTG t CaTAAAG t T CaTAAACCAT AACGGTTCTA 540 

ATGGAAACCA ATCAGCTG 558 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



35 



40 



45 



GCTTCAGACA 


TtlTGATGATA 


TAATCTCTCA 


TCATCGATTA 


ATTCTTTTGC 


AGCTTGATAC 


60 


ACATnTTGCT 


TATTTGTTCC 


AATGACTTTT 


AATGTGCCAG 


CTTCAACACC 


TTCAGGACGT 


120 


TCTGTAACAC 


TTCGCCAAAA 


CTAAAACTGG 


CTTATTAAAT 


GATGGCGCTT 


CTTCCTGAAT 


180 


TCCACCTGAA 


TCTGTCAAAA 


TAAAATAAGA 


TTTTnTAGCA 


AAATTATGGA 


AATCTATACG 


240 


TCCAAAGGTT 


CAATCAATTC 


AATTCTGTCA 


TGACTACCTA 


AAATCTTTTG 


AGCCACCTCT 


300 


CGAACTTTCG 


GGTTTTTATG 


CATTGGATAT 


ACCAGTGCTA 


AATCAGTATA 


CTCATCTATT 


360 


AAGCGTCTAA 


CCGCTTTAAA 


TATATTTTCC 


aTGGGTTTCC 


CGATATTTTC 


TCGTCGGTGT 


420 


GCTGTCATrA 


GAATGAATTT- 


kTtGTCATGG 


-TATTTATCCA 


-TGATGTTAGA- 


-TTTATAATTG— 


480 


TCATCAACTG 


TATATTTCAT 


AG CATCAATC 


GCAGTATTAC 


CAGTGACAAC 


AACACTTTCT 


540 


GAATATTTCC 


CTTCACTTAA 


CAAATGCGAT 


GCAGCATTTT 


TAGTAGGTGC 


AAAATGTAAG 


600 


TCAGCTAATA 


CACCAACTAA 


TTGTCTATTC 


ACCTCTTCTG 


GAAAAGGTGA 


ATATTTATCA 


660 


TAACTTCTAA 


GCCCTGCTTC 


AACGTGTCCA 


ATCGGCACTT 


GGTTATAAAA 


TGCCGCTAAA 


720 


CCACCTGCAA 


ATGTCGTCAT 


CGTATCACCA 


TGTACAAGTA 


CCATGTCTGG 


TTTTTCTAAT 


780 


TGAATCACTT 


GTTCTAATTG 


AGTGATTGAT 


TTAGAAGTTA 


TCTCAGAAAG 


TGTCTGTCCT 


840 


GATTTCATAA 


TATTCAAATC 


GTATTTTGGT 


TTGATTTCAA 


AGGTACTTAA 


TACTGAATCA 


900 


AGCATTTCTC 


TATG CTGTGC 


TGTAACAACA 


ACAATTGGCT 


CGAGCATTTT 


TTCTTGTTCC 


960 
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ATCTTTTTCA 


TCAAACTACT 


TATCTCCGAT 


TCTTCTATTT 


AGTACCAAAC 


AATCTATCTC 


1080 




CAGCGTCGCC 


TAACCCTGGT 


GTGATATATG 


CTTTGTCATT 


aGCTTTTCAT 


CAAGTGCAGC 


1140 


5 


AATATAAATA 


TCTACATCTG 


GATGTGCTTC 


ATGCATCTTT 


TCTACG CCTT 


CTGGTGCTGC 


1200 




AATTAAACAC 


ATGAAGCGAA 


TATTTTTAGC 


GCCACGTTTC 


TTCAATGAAG 


TAATAGCTTC 


1260 


70 


AATTGCTGAT 


GCGCCTGTTG 


CTAACATAGG 


ATCAACAACA 


ATGATTTGTC 


TTTCAGTAAT 


1320 


ATCTTGAGGT 


AACTTAGCAA 


AATACTCTAC 


AGCCTTTAAT 


GTTTCGGGAT 


CTCGATATAA 


1380 




ACCGATATGT 


CCAACTCTGG 


CTGCAGGTAC 


TAAACTTAAA 


ATACCATCAG 


TCATACCTAA 


1440 


IS 


ACCAGCTCTT 


AAAATTGGAA 


CGATAGCTAA 


TTTTTTACCA 


GCTAATCGTT 


TAGCCGTCAT 


1500 




TTTAGTTACA 


GGCGTTTCAA 


TATCAACATC 


CTGAAGCTCT 


AAGTCTCTAG 


TTACTTCATA 


1560 




TGCCATCAAC 


ATACCAACTT 


CGTCTACAAG 


TTCTCTAAAT 


TCTTTAGTAC 


CTGTATTTAC 


1620 


20 


ATCTCTAATA 


TAG CTTAGTT 


TGTGTTGAAT 


TAATGGATGA 


TCGAAAACGT 


GTACTTTACT 


1680 




CATAAAAATT 


ACTCCTATCT 


TTGTGTATGT 


TTATTGATAT 


AGAGGATATT 


CAGCTGTTAA 


1740 




TTTCGCAACG 


CGTTCTTTAG 


CTTGTTGTAA 


TTTTTCTTCA 


TCTTTACTAT 


TTTTCAATGC 


1800 


25 


TAAACTGATG 


ATTTTTGCAA 


CTTCCTCAAA 


AGCTTTTTCA 


TCAAATCCAC 


GCGTTGTTGC 


1860 




AGCAGGTGTA 


CCTAAACGTA 


TACCACTCGT 


TACAAAAGGT 


TTTTCTTGAT 


CGAACGGAAT 


1920 




GGTATTTTTG 


TTACATGTGA 


TACCAACTGA 


ATCTAAAGTC 


TCTTCAGCTT 


CTTTACCAGT 


1980 


30 


AAGTCCTATA 


GACCCTTTTA 


CATCAACAGC 


TACTAAGTGA 


TTATCTGTAC 


CGCCAGAAAC 


2040 




AATTCTAAAT 


CCTTCATTAA 


TTAATGCTTC 


TGCAAGAACT 


TTTGCGTTTT 


TAACCACTTG 


2100 


35 


TTGTTGATAC 


GTTTTGAAAT 


TATTTTCTAA 


CGCTTCTCCA 


AAAGCAACTG 


CTTTtGCTgC 


2160 


AATAACATGC 


TCAAGAGGTC 


CACCTTGAAT 


ACCAGGGAAA 


ATTGTTTTAT 


CTATGTCTTT 


2220 




TTTATATTCT 


TCCTTACATA 


AAAT CAT ACC 


ACCACGtGGT 


CCGcGTAATG 


TTTTGTGTGT 


2280 


40 


TGTAGTTGTT 


ACAAAATCAG 


CATATTCTAC 


TGGATTTGGA 


TGTAAACCTG 


CCGCTACTAA 


2340 


TCCTGCAATA 


TGTGCCATGT 


CTACCATTAA 


CTTAGCGTTT 


ACTTCATCTG 


CGATTTCTTT 


2400 




AAACTTTTTG 


AAGTCAATTG 


TTCTTGAATA 


TGCTGATGCT 


CCTGCCACAA 


TAAGCTTAGG 


2460 


45 


CTTATGCTCT 


AACGCTAATT 


TACGAACTTC 


ATCATAATTG 


ATTCGTTCTG 


TGTCTTTATC 


2520 




TACTCCATAT 


TCAACGAAAT 


TGTAGAATTT 


ACCACTAAAA 


TTAACAGGCG 


CTCCATGTGT 


2580 




CAAGTGACCA 


CCATGACTCA 


AATT CAT AC C 


TAAAACTGTG 


TCGCCCATTT 


CTAATGCAAC 


2640 


SO 


TAAGTAAACA 


GCCATGTTCG 


CTTGTGAACC 


TGAATGTGGT 


TGAACATTGA 


CATGTTCAGC 


2700 




TCCAAACAAT 


GCTTTAGCAC 


GATCAATTGC 


GATGCTTTCA 


GTAACATCTA 


CAAACTCACA 


2760 
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TTGTGCTTCC ATAACCGCTT CCGATACAAA ATTTTCCGAT GCGATTAACT CTATGTTGCT 2880 

ATTTTGTCTC TGAAATTCTC TCTCGATTGC TTCTGCGATA ACTTTATCTT GCTTGGTGAT 2 940 

ATAAGACATA AAATCTCCCC TTCTTTCAAA AAAACTTATT GGTATTTAGC ACGTTCGCCA 3000 

CCAATCTTTT TCGGCCTAGA TGTGGCAATA GTTACAATTG CCTGTCCTAC TTGCTTTACT 3060 

GAGGTCCTTA CAGGTACACA TACATGTTTA ATATG CATGC CTATTAACGT TTGACCAATA 3120 

TCAATTC CAC AAGGAACAGT AATATGTTCG ACCACGATCG GATCCTTCAT ATGCTGAAAA 3180 

GCGTATGTTG CCAAACTCCC TCCAGCATGT ACATCTGGAA CGACGGAAAC TTCTTCCATT 3240 

GTTAATGGAT TATACTGAGA TTTTTCTATT GTTATCGCTC TGTTGATATG TTCACATCCT 3300 

TGAAAAGCAA AAGTAACGCC TGTCTCTTTA CTCACAACAT CTAATGCATT AAAAATAGTT 3360 

TCTGCAACTT CCaTCGAACC GACAGTCCCT ATTTTTTCGC CAATGACTTC CGATGTTGAA 3420 

CATCCAATTA AACATATATC TCCTTTATTA AAAAAGGACA TATCTTTTAA TTCGTCTAAT 34 80 

AACATTG TC A AATCTTTCAT AAAAGCCCAC CCTTCCTAAA AATAAAAAAG G AAT AT AG CA 354 0 

AAGTGCTACA CTCCTCTATT ATAACTTATT TAACTGTTAA CATATACTAA TTATACAGAA 3600 

2s TTCCTACTAG CAAATAATAT CTTTTAATTT TAAAATTAAA CTTACAAGTT CTTCATAGGT 3660 

ATGTACATAC ATTTCTTTTG TTCCACCGTA TGGATCTATA ACTTCTCCTG CTTCTTTtAC 3 720 

ATATT CATGC AATGTGAAAA CATGATTTTG CAAACCAAAG TGTGCCTCTA TTAATTCTTT 3780 

30 GTGCGAATAC GACATCGTCA AAATAATATC TGCTTTCAAA TCTGCTTCAG TAAATTGTTG 3 84 0 

CGATAAGGTC GTTTCAGCTA AATGATGTTC TTCAACTAAG TCTTCAACAT AATTCGAAAC 3 900 

ACCTTGATTG TTCACAGCGA ATATACCTCT TGATTCAAAT TGATGATTTG GCATAACCTC 3 960 

TTTTGCAATA CTTTCCGCTA ATGGG CTACG ACATGTGTTA CCTGTACAAA CGAATAAAAT 4 020 

_ CTTCATAGTT CACATCCTTT-AATAATGTGA-TTACCTGCAG- CTTTTAACAT- GCGATTCATA 4 O80_ 

ATTGCTTCTG TATTATCATT CAGCTCAAAG CCGTATATAT ACGCCGCTGA AATATTTTCA 414 0 

TTTTCATCAA GTGAATGTAA CACATCATAA AGATTATGAC TTGCTTGTTT AACATCATTG 4200 

TCATCCTGAC ATAATTGAAT GAATTGCGCT TCACTTGGTA TAAACGCCAC CTTATTACTC 4260 

GGCACAATAA AAGCTATAGA AGACCAATCT TTACCGTCAT TTCCAATTTT GCTCTCAATA 4320 

TCTGTAATAA TTGTAAGTGG TGTATTGGGT GAGTAATGCT TATACTTCAT ACCTGGTGCA 43 80 

ATTGGCTGTT CAGTATCATT AT AAT CAG CA TGGGCGATAC TATTCGGAAG TATTTCTGTA 444 0 

50 ATCATTGCTG CTGTTATAGA ACCAGGTCTT GCAATTTTAT AAGGAAAAGA TGTGCAATCT 4 500 

AAAACCGTAC TTTCTAATCC TTCTTCACTT TGTTCAGCTT GAACAAT AC C ATCGATACGG 4560 
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GCACTTGGAG 


CAGCTAGAGG 


TTCATTTATG 


ATTTGTAATA 


ATTGTCTACC 


TACAGAATGG 


4680 




CTTGGCATTC 


TAACAGCAAC 


TGATGATAAA 


CCTCCAGAAA 


CTTTTCGACA 


TAGATAGCCT 


4740 


5 


AGCTTTAACG 


GCAATATAAA 


CGAAATAGGG 


CCCGGCCAGA 


ATGCCTGCAT 


TAACTTTTCT 


4800 




ACGCGTGGAT 


CCAAAGTATA 


TGTAAAATCT 


TTTAATTGAC 


CTTTACTGTG 


TATATGAACA 


4860 




ATAAGCGGAT 


TGTCAGATGG 


ACGGCCTTTA 


GCTTCATATA 


TTTTAGCTAC 


AGCTTCTTCA 


4920 


10 


TCTGTCGCAT 


TTGCTGCAAG 


TCCATAAACT 


GTTTCAGTTG 


GTAAACCTAT 


TAAACCACCG 


4980 




TTTAAAACAA 


TGTCTTTTAT 


TTCATTAATT 


TTAGGATATT 


GCTGTAAATC 


TT CATT AT AT 


5040 


15 


TCTCTAACAT 


CCCAAATTTT 


AGTATCCAAC 


TTAATCACGC 


CTTTCTTATT 


TATCATAATA 


5100 


TAAAGCAAAA AGCTATGCAC 


TTAACTAATC 


ATAGCAAAGG 


CATAACTTCT 


AATTACCATT 


5160 




TAAATGAGAC 


GATTCGATCG 


TGGCCATTTA 


TAT CTTTAAT 


AATGTCGATT 


TTTTTGTCAG 


5220 


20 


GAAATTTATT 


TAAAATTATT 


GATTTAAGTG 


CCTCACCTTG 


ATTGTAACCA 


ATTTCAAAAA 


5280 




CAACTGGGCT 


gcctttttcc 


ATAACGTGAG 


GTAAATCTTC 


AATGATTGAT 


TCATAAATAG 


5340 




CATATC CATG 


GTTATCTGCA 


AACAATGCCT 


GATGTGGTTC 


GAATCTCGTA 


ACCGTTGGAG 


5400 


25 


ACATCGTAAC 


CATATCTTTT 


TCATCTATAT 


ATGGTGGATT 


AGATATCAAG 


CCGTTCAACT 


5460 




TGATACCTTC 


ATTAATTAAG 


GGCTTTAATG 


CATCCCCTGT 


TAAAAATTGT 


ATTTGTGATT 


5520 




GATGCTTCTC 


AGCATTATTA 


CGAGCCATAT 


TCATTGCTTC 


AAGTGAAATA 


TCAGTAGCAA 


5580 


30 


TAACATTTAA ATCCGGCTTT 


TCACATTTCA 


AAGTAATTGC 


AAGTACACCA 


CTACCCGTTC 


5640 




CGATATCTAC 


GATTGTTGCA 


TCATCTTCTA 


ACTGTTGTAA 


GAAATGCAAC 


ATTACTTCTT 


5700 




CAGTTTCAGG 


TCTTGGTATC 


AAACAATTTG 


AGTTTACATC 


AAACGTTCTA 


C CAT AAAATG 


5760 


35 


AGGCAAAGCC AACTATATAC 


TGTATAGGCT 


CTCCTAATAA 


CATACGTTGT 


AATGCTAAGT 


5820 




CGAACTTCAT AATCATCGCT 


TTCGGCATAT 


CATCATGCAT 


GTGGACTACA 


AAGTCCGTAC 


5880 


40 


GCGTCCATTG 


AAATACATCT 


AACATTAACC 


ATTCAGCTCG 


TGTTTGTTCA 


AACCCTTTTT 


5940 


GTTGTGTTAA 


ATGAATTGCT 


TCATCTAACT 


TTTCTTTATA 


ATTCAC CATT 


ATTAAGTTCT 


6000 




TTCAATTTAT 


CTGTCTGCTC 


TGATAAAGTC 


AGTGCATCTA 


TAATTTCTTC 


TAAATGGCCT 


6060 


45 


TCCATAATTT 


GCCCTAATTT 


TTGAAG CGTT 


AGACCTATAC 


GATGGTCTGT 


TACACGGCTT 


6120 




TGTGGATAAT 


TATAAGTTCG 


AATACGTTCT 


GAACGATCAC 


CAGTACCGAC 


TGCTGATTTA 


6180 




CGTTGTGACG 


CATACTTTTG 


TTGTTCTTCT 


TGAACTTTCA 


TATCGTATAA 


ACGTGCTTTT 


6240 


50 


AACACTTTCA 


TTGcrrriTc 


ACGGTTTTGA 


ATTTGAGACT 


TCTcAGAAGA 


TGTTGCAATG 


6300 




ACACCAGTTG 


GTAAATGGGT 


AATACGTACT 


GCAGAGTCAG 


TTGTGTTTAC 


GTGCTGACCA 


6360 
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ACATCTTCAA CTTCTGGTAA AACTGCCACT GTAGCTGTTG AAGTATGAAT ACGTCCACCT 64 8 0 

GATTCTGTTT CAGGCACACG TTGAACGCGG TGCGCACCAT TTTCAAATTT CAATTTACTA 654 0 

TACGCGCCAT TACCAGAAAC TGAGAAACTA ATTTCTTTGT AACCACCATG GTCACTTTCA 6600 

GACGCTTCTA CTATTTCAGT TTTGAATCCT TGTGATTCAG CATACTTTGA ATACATACGC 666 0 

ATTAAATCAC CAGCAAAAAT CGCAGCCTCA TCACCACCTG CTG CTGCTCT TATTTCTACA 672 0 

ATAACGTCTT TGTCATCATT AGGATCTTTA GGAATCAATA ATATTTTAAG CTCTTCTTCA 67 8 0 

AGATTTGGAA GTTCAGCTTT AATACCATTA CTCTCCTCTT TTAACATTTC TACTTCTTCT 6 84 0 

TTATCATCAG TCTCACTTAA CATTTCTTCA ATATCAGCTA ATTCTTCTTT TTTAGCTTTA 6900 

TAGTTACGAT AAACATCTAC AGTTTTTTGT AAATCAGCTT GCTCTTTAGA ATATTTACGT 6960 

AATTTATCTG AATCATTTAC AACATCTGGG TCACTTAACA GTTCATTTAA CTGTTCGTAT 7020 

CTTTCTTCTA CAATATCTAA TTGATCAAAC ACTTATAATT CCTCCTTATT ATTATCACTA 70 80 

GGTGCTACGA TATGGTGCGC GCGACAACGT GGCTCATAAC TTTCATTGGC ACCTACTAAG 714 0 

ATAATCGGAT CATCGATTTT AGCTGGTTTA CCATTTATTA ATCGTTGCGT TCTACTAGAT 7200 

25 GAAGAACCAC AAACAGCACA AACTGCTTGA AGTTTCGTTA CTTGTTCACT GACAGCCATC 7260 

AATTTAGGCA TTGGTTCGAA CGGTTCGCCC CTAAAATCCA TATCTAATCC AGCAACAATA 732 0 

ACACGGTGTC CATCTGCTGA TAGTTTTTCT ACTATACTTA CAATTTCATC GTCAAAAAAT 73 80 

TGCAcTTCGT CTATTCCTAT AACATCAACA TTAGTTAAGT CGTG CGTCAT AATTTCACTT 744 0 

GCTTTAGAAA TATTAATCGC TTCAATGGCA TTACCATTAT GAGAGACCAC TTTTTCTTTA 7500 

TGATATCGAT CATCAATCGC CGGTTTAAAT ACAACGACTT TTTGTTTAGC GTATATACCC 7560 

CTTCTTAGAC GTCTTATTAG TTCTTCGGAT TTACCGCTAA ACATACTACC TGTAATACAT 7620 

JTOTATCCAAC CGGAATGGTA -AGTTTCATAC-ATTGAGAGTn— CCAGCTTTT'T GAAAACATAA 7680 " 

TCGCTTTATT ATATCATATT TCAAATATTC ATAAATGTCT TTnTCATAAT TATATCGATA 774 0 

TTGTACATGA ACAATTATTT TA 7762 
(2) INFORMATION FOR SEQ ID NO: 125: 
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(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 2583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
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TAAAAAAATT ATTATCAATG ATGAACTAGA ATTGACTGAA TTCCACCAAG AACTTACTTA 120 
TATTTTAGAC AACATAnAAG GGAATAATAA TTATGGTAAG GAATTTGTTG CAACCGTTGA 180 
AGAAACATTC GACATTGAAT AaAGCGGGGT GgaAGCACTA TGAATCAATG GGATCAGTTC 24 0 

TTAACACCTT ATAAGCAAGC GGTTGATGAG TTGAAAGkGA AcTTaAAGGC ATGCGCAAAC 3 00 

AATATGAAGT TGGTGAACAA GCGTCGCCAA TAGAATTTGT TACTGGTCGT GTTAAACCAA 360 
TCGCTAGTAT TATAGATAAG GCAAACAAAC GACAAATACC ATTTGATAGG TTAAGAGAAG 420 
AAATGTACGA TATCGCTGGT TTAAGAATGA TGTG CCAATT TGTTGAAGAT ATTGATGTTG 4 80 

TCGTCAATAT TTTAAGACAA AGAmAAGATT TTAAAGTAAT TGAAGAACGA GATTATATTC 54 0 

GTAACACTAA AGAAAGTGGT TACCGCTCGT ATCATGTCAT TATTGAATAT CCAATTGAAA 600 
CATTACAAGG CCAAAAATTT ATATTGGCTG AGATTCAGAT TCGTACATTA GCAATGAATT 660 
TCTGGGCAAC GATTGAACAT ACTTTACGAT ATAAATATGA TGGTGCTTAT CCGGATGAAA 720 
TTCAACATCG TTTGGAAAGA GCGGCAGAAG CAGCGTATTT ACTTGATGAA GAGATGTCTG 780 
AAATTAAAGA TGAAATTCAG GAAGCTCAAA AATATTACAC GCAAAAACGT TCTAAAAAAC 840 
2S ATGAAAATGA TTAACGAGGT GTTATAAATC ATGCGTTATA CAATTTTAAC TAAAGGTGAC 900 

TCCAAGTCTA ATGCCTTAAA GCATAAAATG ATGAACTATA TGAAAG r TTT TcGCATGaTT 960 

GaGGATrGTG AAAaTCCTGA AATTGTTATT yCAGTTGGTG GTGACGGTAC ATTACTACAA 1020 

30 GCATTCCATC AGTATAGCCA CATGTTATCA AAAGTGGCAT TTGTTGGAGT TCATACAGGT 108 0 

CATTTAGGAT TTTATG CGGA TTGGTTACCT CATGAAGTTG AAAAATTAAT CATCGAAATT 114 0 

AATAATTCAG AGTTTCAGGT CATTGAATAT CCATTGCTTG AAATTATTAT GAGATACAAC 1200 

GACAACGGCT ATGAAACAAG GTATTTAGCA TTAAATGAAG CAACGATGAA AACTGAAAAT 1260 

GGCTCAACAC TTGTTGTGGA TGTTAACTTA AGAGGGAAAC ACTTTGAGCG ATTTAGAGGC 1320 

GATGGATTAT GTGTATCAAC ACCTTCGGGT TCAACGGCTT ATAACAAAGC GCTAGGTGGC 1380 

GCACTGATAC ATCCTTCACT TGAAGCAATG CAAATTACAG AAATTGCCTC GATAAATAAT 144 0 

CGTGTGTTTA GAACGGTAGG ATCACCACTT GTATTACCAA AG CAT CATAC ATGTTTAATA 1500 

TCACCAGTTA ATCATGATAC CATTAGAATG ACGATAGATC ATGTTAGTAT CAAACATAAA 1560 

AATGTTAATT CAATACAATA CCGTGTAGCA AATGAAAAAG TGAGGTTTGC ACGTTTTAGA 1620 

CCATTCCCAT TCTGGAAACG TGTGCACGAT TCTTTCATAT CAAGTGATGA AGAACGATGA 16 80 

SO AATTTAAGTA TCATATATCA CAACAAGAAA CTGTTAAAAC TTTTTTAGCA CGACATGATT 174 0 

TTTCTAAGAA GACAGTGAGC GCCATTAAAA ATAATGGCGC TTTAATTGTT AATGATGAAC 1800 
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AAATACCGAG TGTTAATTTA AT AC CTTATG CTCGTAAGCT AGAAGTATTG TATGAAGATG 1920 

CTTTTATCAT CATAGTTACT AAACCAAACA ATCAAAATTG TACGCCTTCG AGAGAACATC 198 0 

CTCATGAAAG TTTAATCGAA CAAGTACTAT ATCATTGTCA GGAACATGGT GAAAATATTA 2040 

ACCCACATAT TGTTACG CGT CTAGATCGTA ATACAACTGG TATTGTGATA TTCGCTAAAT 2100 

ATGGACATAT CCATCATTTA TTTTCTAAAG TAAACTTGAA AAAAATATAT ACTTGCCTTG 2160 

TATATGGTAA AACCCATACA TCTGGTATTA TTGAAGCTAA TATTAGACGG TCAAAGGATA 2220 

GGATTATAAC TAGAGAAGTT GCCTCGGATG GTAAATACGC TAAAACATCT TATGAAGTAA 2280 

TAAATCAGAA TGATAAATAC AGTTTATGCA AAGTTCATTT GCATACGGGA CGTACACATC 234 0 

AAATTCGTGT ACATTTTCAA CATATTGGGC ATCCAATTGT GGGAGATTCT TTGTATGATG 2400 

GTTTTCATGA CAAAATTCAT GGTCAAGTAC TGCAATGTAC GCAAATATAT TTTGTTCATC 2460 

CAATCAATAA GAACAATATT TATATTACAA TTGATTATAA GCAATTACTT AAATTATnCA 252 0 

ATCAACTCTA ATnCACACAG GGGGTGTAAG TATGTCAATG AnCACAGATG AAAAAGAGCG 2580 
TGT 

25 (2) INFORMATION FOR SEQ ID NO: 126: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1818 base pairs 
(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
30 <D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATCAAGTGAT ACATTTAACT GGTAAAGGAT TAAnAGATGC TCAAGTTAAA AAATCnGGAT 60 

- ATA^ACAATA^ TCAATTTGTT-AAAGAGGATT^TnACAGATTT -ATTnGCAATT- ACGGATACAG — . - - 120- 

TAATAAGTAG AGCTGGATCA AATGCGATTT ATGAGTTCTT AACATTACGT AT AC CAATGT 18 0 

TATTAGTACC ATT AGG TTT A GATCAATC CC GAGGCGACCA AATTGACAAT GCAAATCATT 24 0 

TTGCTGATAA AGGATATGCT AAAGCGATTG ATGAAGAACA ATTAACAGCA CAAATTTTAT 30 0 

45 TACAAGAACT AAATGAAATG GAACAGGAAA GAACTCGAAT TATCAATAAT ATGAAATCGT 360 

ATGAACAAAG TT AT AC G AAA G AAG CTTT AT TTGATAAGAT GATTAAAGAC GCATTGAATT -4 2 0 

AATGGGGGGT AATGCTTTAT GAGTCAATGG AAACGTATCT CTTTG CTCAT CGTTTTTACA 480 

SO TTGGTTTTTG GAATTATCGC GTTTTTCCAC GAATCAAGAC TTGGGAAATG GATTGATAAT 54 0 

GAAGTTTATG AGTTTGTATA TTCATCAGAG AG CTTT ATT A CGACATCTAT CATGCTTGGG 600 
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CTCATGTTAA AGCGCCACAA AATTGAAGCA TTATTTTTTG CATTAACAAT GGCATTATCT 
GGAATTTTGA ATCCAGCATT AAAAAATATA TTCGATAGAG AAAGACCTAC ATTGCTGCGT 
TTAATTGATA TAACAGGATT TAGTTTTCCT AGCGGTCATG CTATGGGATC AACTGCATAT 
TTTGGAAGTG GTATCTATCT ATTAAATCGA TTAAATCAAG GTAATTCAAA AGGTATTCTT 
ATAGGGTTAT GTGCAGCTAT GATTTTATTG ATTTCCATAT CACGTGTATA TCTAGGTGTA 
CATTATCCAA CAGATATTAT TGCCGGCATT ATTGGTGGAT TATTTTG c AT TATTTTATCA 
ACGTTATTAC TTAGAAATAA ATTAATAAAT TAAATAGTAA AAAAACAAAA GCAGTAAACC 
TAAAGTGTCG TAAGGGTTTA CTGCTTTTAT AAAACGTTGT TATAACGTAT ATTGTCTTTT 
ACGGGCATAT AAnAGGGGAA TATTTGAnAA TGACCAATCC AACAAGAACG AAACGTTGTG 
GGGGGGATGT TCTATGTGGT ATTGATAATC ATTTTCAACT ACTATTATAC ATTAGTGAGA 
ATCATTGTCA ATT AG AAA CT AAAACTTTTT TTGAATATTT TTTAAGAATA GTAAATAAAA 
CGCATGATTA CG CT ATTTT A GAAAATAAAA AAATTTGTAT TTCTCATTAG AATTAGAATA 
TTTAAAAGTG ATGAGGTTTA AACATTATAT TGTTTACATA CTCCTTTTGA ATTCATACAT 
TATGAAATGT tACTTCCAAG TTCAAAATCG CACATTGAAA TGATGTGTGA AATGTTTAAA 
CTACGGTCAT t TTGTGmAAA TAAAG r TAAT AACTATTCAT TTTACAATAG TGAAAAGTCA 
GTATATGACA ACAATTAATA TTG CGGTAAG GCCTTGTGTT ACAGTATTCT ATATTTAAGT 
ACTG CAATCA GAATTAACAG AATGC CATTA ACTGATTATT AAATATTTGA GTTAATAAAT 
AATTAATGAT TGTAGCTTGA AAAATTTAAA ACATGGTTAT TGATTTGTGA TAAAATTTAA 
ACGTAAACAA ACTAATTTAA AAAGCAACTA TTG TAT AG AA AAATACAAAA TTTAAAATAT 
ATTACCTTAT TAGAAAAA 

(2) INFORMATION FOR SEQ ID NO: 127: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12658 base pairs 
<B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 127: 
TGTTTAAACA ATAGGGGGAA TCTTATGATT GAAAAATTAG TAACCTTTTT AAATGAGGTT 
GTTTGGAGTA AGCCATTAGT TTATGGTTTG CTAATTACTG GTGTGCTATT TACATTGCGT 
ATgCG ATTTT TTCAAGTTAG ACATTTTAAA GAAATGATTC GATTAATGTT TCAAGGAGAG 
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GGTACAGGTA ATATTGTCGG TGTATCTACT GCAATATTTA TAGGAGGACC TGGTGCAGTA 300 
TTTTGGATGT GGATTACTGC GTTTTTAGGT GCAAGTAGTG CTTTTATTGA ATCTACACTT 360 

5 

GGTCAAATAT TCAAGAGAGT TGAAAATAAT GAATACCGTG GTGGACCAGC GTATTATATT 42 0 

GAATATGGTA TTGGTGGTAA ATTTGGTAAA ATTTACGGAA TTATCTTTGC TATTGTTACG 48 0 

ATTATCTCAG TAGGTCTATT GCTTCCTGGT GTGCAATCTA ACGCTATAGC AAGTTCTATG 54 0 

10 

CATAATGCGA TTCATGTTCC ACAATGGTTA ATGGGTGGTA TTGTTGTAGT TATTTTGGGA 60 0 

TTAATTATTT TTGGTGGTGT ACGTATTATT GCCAATGTTG CAACAGCCGT TGTACCATTT 66 0 

15 ATGGCAATTA TTTACATACT GATGGCTGTC ATTATCATTT GTATCAATAT ACAAGAAGTG 720 

CCAGCGTTAT TTGCATTAAT TTTCAAATCA GCATTTGGAT TACAATCTGC TTTTGGTGGT 78 0 

ATCGTTGGCG CAATGATAGA GATTGGTGTT AAACGTGGAT TATATTCAAA TGAGGCTGGT 84 0 

20 CAAGGTACAG GTCCACACGC AG CAGCGGCa gcAGaAGTAT CACATCCAAG TAAACAAGGT 9 00 

CTAGTACAAG CATTTTCAGT TTATATTGAT ACATTATTTG TATGTACTGC AACTGCTCTG 96 0 

ATTATACTTA TTTCTGGTAC ATATAATGTG ACTGATGGTA CGGTTAATGC GAATGGCACA 102 0 

25 CCGCATTTAA TTAAAGATGG CGGTATTTAT GTTgAAAATG CAACAGGTAA AGATTATTCA 1080 

GGTACTGCGA TGTATGCACA AGCCGGCATt GATAAAGCGT TCCATGGCAG TGGTTATCAA 114 0 

TTTGATCCTA CTTTCTCTGG CGTAGgTTCG TACTTTATTG cATTTGCTTT ATTCTTCTTT 1200 

30 

GCATTTACTA CAATTTTGTC GTACTACTAC ATTACAGAAA CAAATGTTGC TTATTTAACG 1260 

CGTAATCAAA ATAATCAAGT TTCATCGATA TTTATTAATA TTGCTCGTGT GATTATTTTG 13 20 

TTCGCTACAT TTTACGGTGC AGTTAAAACA GCTGATGTAG CATGGGCATT CGGTGATTTA 13 80 

35 

GGTGTAGGTC TAATGGCTTG GTTAAATATC ATTGCGATTT GGATTTTACA TAAGCCTGCC 1440 

GTAAATGCTT- TAAAAGATTA- -TGAAATTCAA -AAGAAACGTT — TAGGGAACGG— TTATAATGCA 1500 

GTTTATCAAC CTGATCCGAA TAAATTACCT AATGCTGTCT TTTGGTTGAA GACGTATCCA 1560 

40 

GAACGTTTAA AACAAGCACG TGCCAAAAAG TAATCTACTT TTGTTTATAG TATATGTAGT 1620 

GATCATTTGA TAAAAAAGAA AAGTATTGAG AATTTTAGGt GCTCAGAAAT TTGAATTTTA 1680 

45 AAAATATAGT GTCTCTTGGT ACAATAACAA TACAACTACT AGGGGCACTT TTTTATGTCA 174 0 

GAATTTAAAA CTGGTAAGAT TAATAAACAT GTTTTATATA GTAATATTTT AAATAGAGAT 1800 

GTCACGTTAA GTATTTATTT AC C AG AATCT TATAATCAAC TTGTTAAATA TAATGTCATT 1860 

So CTTTGCTTTG ACGGATTAGA TTTTTTACGT TTCGGGAGAA TACAACGTAC ATATGAATCG 1920 

TTAATCAAAG AAGCGCGTAT TGATGATGCG ATCATTGTTG GATTCCATTA TGAAGACGTT 1980 
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GTCGGTAAAG 


AAATATTGCC 


ATTTATTGAC 


TCGACGTTTT 


CTACACTGAA 


AGTAGGTAAT 


2100 




GCAAGGTTAT 


TAGTAGGGGA 


TAGTTTAGCG 


GGTAGTATTG 


CCTTATTAAC 


GGCGTTGACC 


2160 


5 


TATCCAACGA 


TTTTTAGTCG 


TGTAGCAATG 


TTAAGTCCAC 


ATTCAGATGA 


AAAAGTATTA 


2220 




GATAAGCTAA 


ATCAATGTGC 


AAATAAAGAA 


CAATTGACAA 


TTTGGCATGT 


CATTGGTCTA 


22B0 


10 


GATGAAAAAG 


ATTTTACTTT 


ACCAACAAAT 


GGTAAGCGTG 


CCGATTTCTT 


AACACCGAAT 


2340 


AGAGAATTAG 


CTGAACAAAT 


TAAGAAATAT 


AATATAACTT 


ATTATTACGA 


TGAATTTGAT 


2400 




GGTGGTCACC 


AATGGAAAGA 


TTGGAAACCA 


TTGCTGTCAG 


ATATATTATT 


GTArrrriTA 


2460 


15 


AGTAAAAACA 


CAGATGATCA 


ACTTTATGAA 


TAATTTACAT 


TAGTAGATTT 


AGTATGAATT 


2520 




GTCTTCATAT 


AGTCTGGTCT 


ATAATATAAT 


TTATAAAAGA 


TTTTACTGTT 


TAATTTAATT 


2580 




TAAATTTGAC 


GAAATTGCAA 


AAGATGTATA 


ATGAATTATT 


TTTAATGTAA 


CGGTTTTCAA 


2640 


20 


AGAAATTTGA 


TATAATAGCA 


AT AGG TT AAA 


CAAAGGAGGA 


ATTCAGATGA 


TTTTAGGATT 


2700 




AGCATTAATT 


CCATCAAAGT 


CATTTCAAGA 


AGCGGTGGAT 


TCTTACCGTA 


AAAGATATGA 


2760 




TAAACAGTAT 


TCACGAATTA 


AACCACATGT 


GACAATTAAA 


GCGCCATTTG 


AAATTAAAGA 


2820 


25 


TGGTGATTTA 


GATTCTGTCA 


TTGAACAGGT 


TAGAGCTCGT 


ATTAATGGTA 


TACCAG CAGT 


2880 




AGAAGTTCAT 


GCTACAAAAG 


CTTCTAGCTT 


CAAACCAACG 


AACAATGTGA 


TTTACTTTAA 


2940 




AGTTGCGAAG 


ACGGACGACT 


TAGAAGAATT 


GTTTAATCGC 


TTTAATGGAG 


AAGATTTCTA 


3000 


30 


TGGAGAAGCT 


GAACATGTTT 


TTGTGCCACA 


CTTTACAATA 


GCACAAGGAC 


TATCTAGCCA 


3060 




AGAATTCGAA 


GATATTTTTG 


GTCaAGTAG C 


ATTAGCTGGG 


GTAGACCaTA 


AAGAAATTAT 


3120 




CGATGAATTA 


ACTTTGTTAC 


GTTTTGACGA 


TGACGAAGAT 


AAATGGAAAG 


TTATTGAAAC 


3180 


35 


GTTTAAATTA 


GCTTAAGTAA 


CATAATAGTA 


TTGTTAATCG 


TAGTATGTTT 


GAATTAATAA 


3240 




GAAAATGGTC 


ATTTTTATTG 


AATGTAATAA 


AAATGACCAT 


TTTCTTTATT 


TTAAAATACG 


3300 


40 


TTTTAACCTT 


ACTTAGCTTT 


TTCTCTATTT 


ACTATAAAGT 


rGCTTCCATA 


AAATACAGCT 


3360 


AAGACTAAAA 


AGATTAATGC 


CGAGAAATAA 


AATGTATTGT 


TTAAATTGTT 


GGTAAATTGT 


3420 




GTAATTAATC 


CGCCAAATAA 


TGGCCCTATC 


ATTGAGCCGA 


ATCCTTGGAT 


ACTATTAAAA 


3480 


45 


ACACCCCAAG 


TTTCTTCTTG 


TTCATCTGAT 


TTGATAAATC 


GTG C CAT AAA 


GGTATTCCAT 


3540 




GCTGGTAATA 






PlI agctaaag 


CG ATAATC C A 


CAAGATGTGA 


3600 




ATATTAACAA 


TCATAGATAG 


AGTAAAAATT 


AATATCATGT 


ATAAAATAAA 


TCCGCTTAGA 


3660 


SO 


ATAACACCAT 


ACATAAAGTT 


TCTGCTGCGG 


TTATCTATTA 


GTTTCGATAA 


AAATAGCATC 


3720 




GAAACTGCAC 


AGCCGATACC 


ACCAATAATG 


ATTGCAACAG 


TATATTCAAT 


TGTGCTTACG 


3780 
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TGTAAAAGAA 


TACCAGGGAA 


CaACAATAAA 


TGGcGCTTTG 


TCACATCAAC 


AATTTGTCTC 


3900 




AATTGAGCTT 


TAACTGGACG 


AGTATTATAA 


TTTGTTAACT 


TTACATCGAC 


AAAATAATAT 


3960 


5 


AATATCCATG 


CAATTAAAAC 


GACTAAAGAC 


AT CATG AAGG 


CAAAGCGTGT 


TGGGTGCACT 


4020 




TTGATAAGTA 


GATTCATAAA 


AACCATACCT 


ACCAATAGGC 


CTAACAACCA 


TGAAAAATAA 


4080 


10 


ACATAGCCCA 


TTTGTTTGCC 


ACGTTTATCT 


TCTTCAACAC 


TGGATAACAT 


AATGACCCAA 


4140 


ATAGGACTAA 


CTGCAATACC 


GAGCATCATA 


GCACTAAATA 


TGATTACAAA 


AGGTGATGCT 


4200 




GGAAACCAAA 


TAACTAAAAA 


TAAACTTGTA 


AATGCTAAAA 


TAAATCCAGT 


CGTTAAAACG 


4260 


15 


ATTTTTGTGC 


CGAATTTTTT 


CAGTAAAAAT 


CCTATAACAA 


AGTTTGTAGA 


TGCATCAGCA 


4320 


ATAAAATGTA 


TTGAAAATGC 


TAGAGACGTT 


ATTG CTACAG 


CAATGGATGT 


AACTGTTGGC 


4380 




AAGAAATTAA 


TATAGCTTAG 


GATATACATG 


CCTCTCGCAA 


ATTCCATTAA 


AAATAAGATA 


4440 


20 


ATAAGCaTTA 


AAATGAAATT 


TTTATGATTA 


GCGTAATTAT 


TTAACGAAGA ATCTTGCATA 


4500 




TAAAGGAACC 


TTTCCATAAA 


TCTCTTGTGG 


TTGTGATGAA 






4560 




GTCTCGACAT 


ATTGTCTGTG 


TAGCATACTT 


AATTTTATCT 


lol 1LLH1 Ivj 




4620 


25 


GTTAGTTAAT 


TGCTCATTAC 


CGTTAGTTAA 


ACTTGCTACA 






4680 




AGTATCAGCG 


ATTTTACCAA 


AACCTTTTTC 


TTCAAAGTAA 


AGGGCATTTT 


CAAGCTCTTG 


4740 




ACCAGGTGCA 


GGATTTAGGA 


AAATCATTGG 


AATACAACGG 


GCGAAACCTT 


CAGTTATTGT 


4800 


30 


GATACCACCA 


GGTTTCGTAA 


TCATAAGTTG 


ACTTGATGCC 


AT C CATTCAT 


TCATGTGTTT 


4860 




GGTATAACCT 


AGAATCAATA 


CATTCTCGTT 


AGATTTAAAC 


TTAGCTGTTA 


AAGAACGCTT 


4920 




TAGCTCTTTG 


CTCTTACCAC 


AAATCATAAC 


TACTTGTGCA 


TTTGCaCTTT 


tCGCTAATAT 


4980 


35 


ATCAGTAATC 


ATCGTGTCAA 


AACCTTTAGA 


TACACCAAAT 


GCACCAGCTG 


aCATTAAAAT 


5040 





_ AGTCTGCTTA . 


_TCTGGAT_CTA„ 


AGTTCTTGTC. 


_TA1TWVCCAC_ 


TG CTTTTG AT 


T AATAGGCGT 


5100 


40 


TTCAAATTTG 


TTATCAATAG 


GAATACCTGT 


CaCTTTAACT 


GTTGAAGGAT 


CAATACCTAC 


5160 


GTCTATGAAG 


TCTTGTTTCG 


TTTCTTTTGT 


TGCCACATAA 


TATCTTGTTG 


AATACGGCGT 


5220 




AATCCAGTTT 


TTATGTAAGC 


GATAGTCTGT 


CAT CACTGT A 


GCAACTGGAA 


TATTAATGTT 


5280 


45 


AAATTGCTCA 


GTTAGTACCG 


ACATAACTGG 


TGTAGGAAAC 


GTTAATAATA 


TTAAATCTGG 


5340 




CTTTTCTTTT 


ATCAATAAAT 


TAATTAACTT 


ATTAAGTCCA 


TAGTATTTGT 


AAAAACATTT 


5400 




GTCTAGTTTA 


TCTGGGCGGC 


TGTAATAAAA 


CCCTTTGTAC 


ATATTTCTAA 


AATATTTAAA 


5460 


50 


GCTATTGATA 


TACCATTTTT 


TACAAATAGA 


AGTCAAAATT 


GGATGAGCTT 


CCATAAATAA 


5520 




ATCGTGCTCA 


ATGACG CTT A 


AATGGTCTAG 


ATTCATATCA 


TTAAGTTGAT 


TAACGATACT 


5580 
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TTGAGTAACC ATTAATAGCC ACCCTCCGTT AGTTTGAAAA TTTTATTTAA GTGTAACTTA 5700 

TTTTACGGCA TTATAAAAGA AATAAAGACG CAAAGTCGTT ACATTTATAG CAATTTTAAT 5760 

CTATAGATGA ATTGATACAA AATAAAACGT TATTTTATAA AGCAATTTAT TGTTCTATGT 5820 

TTTATTTGTA TATTTAAAAT TATCCAGTAT ACAATTATAG CATATTTTTG GAAACAATTA 5880 

TGATATTATA CCATGTTACA AGATGGTTTT AATAATTTAA GATGAGCCAT AATTGTAAAA 5 940 

CTAATT CAT A ATACCGTATG TTTTATTTTT AATAGTAGAA ATTAGAAAAT GCTGATTAGT 6000 

AGGATATAAC AGTGAAATTA TAAATTTATT AACATCAACA AAACGTGTAT AATAAACATA 6060 

TTGTAGAAAA AGGAGCGGTT CAGTTTGGAT GCAAGTACGT TGTTTAAGAA AGTAAAAGTA 6120 

AAGCGTGTAT TGGGTTCTTT AGAACAACAA ATAGATGATA TCACTACTGA TTCACGTACA 6180 

GCGAGAGAAG GTAG CATTTT TGTCGCTTCA GTTGGATATA CTGTAGACAG T CAT AAGTTC 6240 

TGTCAAAATG TAGCTGATCA AGGGTGTAAG TTGGTAGTGG TCAATAAAGA ACAATCATTA 63 00 

CCAGCTAACG TAACACAAGT GGTTGTGCCG GACACATTAA GAGTAGCTAG T ATT CT AG CA 6360 

CACACATTAT ATGATTATCC GAGTCATCAG TTAGTGACAT TTGGTGTAaC GGGTACAAAT 642 0 

25 GGTAAAACTT CTATTGCGAC GATGATTCAT TTAATTCAAA GAAAGTTACA AAAAAATAGT 64 80 

GCATATTTAG GAACTAATGG TTTCCAAATT AATGAAACAA AGACAAAAGG TGCAAATACG 654 0 

ACACCAGAAA CAGTTTCTTT AACTAAGAAA ATTAAAGAAG CAGTTGATGC AGG CGCTGAA 6600 

30 TCTATGACAT TAGAAGTATC AAGCCATGGC TTAGTATTAG GACGACTGCG AGG CGTTGAA 6 660 

TTTGACGTTG CAATATTTTC AAATTTAACA CAAGACCATT TAGATTTTCA TGGCACAATG 6720 

GAAGCATACG GACACGCGAA GTCTTTATTG TTTAGTCAAT TAGGTGAAGA TTTGTCGAAA 6780 

GAAAAGTATG TCGTGTTAAA CAATGACGAT TCATTTTCTG AGTATTTAAG AACAGTGACG 684 0 

CCTTATGAAG TATTTAGTTA TGGAATTGAT GAGGAAGCCC AATTTATGGC TAAAAATATT 6 900 

CAAGAATCTT TACAAGGTGT CAGCTTTGAT TTTGTAACGC CTTTTGGAAC TTACCCAGTA 6960 

AAATCG CCTT ATGTTGGTAA GTTTAATATT TCTAATATTA TGGCGGCAAT GATTGCGGTG 7020 

TGGAGTAAAG GTACATCTTT AGAAACGATT ATTAAAGCTG TTGAAAATTT AGAACCTGTT 7080 

GAAGGGCGAT TAGAAGTTTT AGATCCTTCG TT AC CTATTG ATTTAATTAT CGATTATGCA 714 0 

CATACAGCTG ATGGTATGAA CAAATTAATC GATGCAGTAC AGCCTTTTGT AAAGCAAAAG 7200 

TTGATATTTT TAGTTGGTAT GGCAGGCGAA CGTGATTTAA CTAAAACGCC TGAAATGGGG 7260 

50 CGAGTTGCCT GTCGTG CAGA TTATGTCATT TTCACACCGG ATAATCCGGC AAATGATGAC 7320 

CCGAAAATGT T AACGG CAGA ATTAG CCAAA GGTGCAACAC ATCAAAACTA TATTGAATTT 738 0 
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GTTTTAGCAT CAAAAGGAAG AGAACCATAT CAAATCATGC CAGGGCATAT TAAGGTGCCA 7500 

CATCGAGATG ATTTAATTGG CCTTGAAGCA GCTTACAAAA AGTTCGGTGG TGGCCCTGTT 7560 

GATTAATAAA AGATTTATTG ATGAAGGTAA AACTATTGAT GTTTATTTAT T CG AAGCATT 762 0 

AAATAACCAG ATAATCATTG CTATACCAGA TTGGTTTTGG TCATATCAGA TGGCAATGAC 76 80 

ATTAGATGAA GAAACTTGTT TTGAAGCAAT ACTCATGCAA TTGTTTGTTT TTAAAGAAGA 7740 

GG AAGAGGCA GAATCGATTG CATCACAACT AACAGATTGG ATAGAAACAT ATAAAAAGGA 7 8 00 

GAAAGACTAA TGAACTTAAA GCAAGAAGTT GAGTCTAGAA AGACTTTTGC GATTATTTCA 78 60 

CATCCCGATG CAGGGAAAAC AACGTTAACT GAAAAACTAT TGTACTTCAG TGGTGCTATT 7920 

CGTGAAGCGG GTACAGTTAA AGGGAAGAAG ACTGGTAAAT TTGCGAGAAG TGACTGGATG 79 80 

AAAGTTGAAC AAGAGCGTGG TATTTCTGTA ACTAGTTCAG TAATGCAATT TGATTACGAT B040 

GATTATAAAA TCAATATCTT AGATACACCA GGACATGAAG ACTTTTCAGA AGATACGTAT 8100 

AGAACATTAA TGGCAGTTGA CAGTGCTGTC ATGGTCATAG ACTGTGCAAA AGGTATTGAA 8160 

CCACAAACAT TGAAGTTATT TAAAGTTTGT AAAATGCGTG GTATTC CAAT CTTTACATTC 8220 

25 ATTAATAAAT TAGACCGAGT AGGTAAAGAA CCATTTGAAT TATTAGATGA AATCGAAGAG 82 80 

ACATTAAATA TTGAAACATA CCCTATGAAT TGGCCAATTG GTATGGGACA AAGTTTCTTT 834 0 

GGCATCATTG ATAGAAAGTC TAAAACAATT GAACCATTTA GAGATGAAGA AAATATATTA 84 00 

30 CATTTGAATG ATGATTTTGA GTTGGAAGAA GATCATGCAA TTACAAATGA TAGTGATTTT 84 6 0 

GAACAAGCGA TTGAAGAATT AATGTTGGTT GAAGAAGCGG GTGAAGCCTT TGATAATGAC 8520 

GCGCTGTTGA GTGGAGACTT AACACCTGTA TTTTTCGGTT CAGCTTTAGC TAACTTTGGT 8580 

GTACAAAATT TCTTAAATGC ATATGTTGAT TTTGCGCCAA TGCCAAATGC GAGACAAACA 864 0 

AAAG&AGACG TTGAAGTAAG CCCGTTTGAT GATTCATTTT CAGGATTTAT CTTTAAAATT _ 8700 

CAAGCCAACA TGGACCCTAA ACACCGTGAT AGAATTGCCT TTATGCGTGT CGTTAGTGGT 8760 

GCATTTGAAC GTGGTATGGA TGTTACTTTG CAACGTACTA ATAAAAAGGA AAAGATCACA 8820 

CGTTCAACGT CATTTATGGC AGACGATAAA GAAACTGTGA ATCATGCTGT AGCAGGCGAT 88 80 

ATCATTGGAC TATATGATAC TGGTAATTAT CAAATTGGAG ATACTTTAGT TGGTGGAAAA 8940 

CAAACCTACA GTTTCCAAGA TTTACCACAA TTTACGCCAG AAATTTTTAT GAAAGTTTCT 9000 

GCTAAAAACG TCATGAAACA GAAGCATTTC CATAAAGGTA TTGAACAATT AGTACAAGAA 9060 

GGTG CGATTC AATACTATAA AACATTACAC ACAAACCAAA TTATTTTAGG TGCTGTTGGT 9120 

CAGTTACAAT TTGAAGTTTT CGAACATAGA ATGAAAAACG AATATAATGT TGATGTTGTT 9180 



35 



40 



45 



SO 



55 



SOOCID: <EP 078651 9A2_I_> 



697 



EP0 786 519 A2 





AAGATGAACA 


CATCAAGATC 


GATTTTAGTG 


AAAGATAGAT 


ATGACGATTT 


AGTATTCTTA 


9300 




TTTGAAAATG 


AATTTGCAAC 


AAGATGGTTT 


GAAGAGAAAT 


TCCCTGAAAT 


TAAATTGTAT 


9360 


5 


AGTTTACTTT 


AACAGCTCAA 


TTGTATAATC 


GAATTTGTTA 


CATTAAAAAT 


AATTGTTTCG 


9420 




TTGAAGAAAA 


ATAAATTGTA 


TATTTTAAAA 


GAAAAAGGTA 


TACTATGATG 


TATCAAATGA 


9480 


10 


ATAACCTATG 


GCATTTTGTC 


AGAGGGGAGT 


AACTTAAGAA 


TCATGACCGT 


AT AAATG a TT 


9540 


CGACACTTTA 


TCGTCATTAC 


GArGATATCT 


TCCGGTAAAG 


TGGGCAATTT 


AAATTGCTTA 


9600 




GTGAGACCTT 


TGCTATTTAT 


TTAGCATAGG 


Tcrri-iTGTT 


TGTACTTAAC 


TTATTTATTT 


9660 


IS 


AAAGGAGTTG 


TACATGTTAA 


TGGATC CAAG 


TTTGATCTTA 


CCTTATTTAT 


GGGTACTTGT 


9720 




CGTTTTAGTA 


TTT1TAGAAG 


GCTTATTAGC 


AG CAGAT AAC 


GCGATTGTTA 


TGGCTGTAAT 


9780 




GGTTAAGCAC 


TTACCACCCG 


AACAACGTAA 


AAAAGCTTTG 


TTTTACGGTT 


TGTTAGGTGC 


9840 


20 


ATTTGTATTT 


AGATTTTTAG 


CATTATTCTT 


AATTAGTATT 


ATCGCGAACT 


TTTGGTTTAT 


9900 




TCAAGCTGCA 


GGAGCGGTTT 


ACTTAATTTA 


TATGTCAATC 


AAAAATCTGT 


GGCAGTTCTT 


9960 




TAAACACCCA 


GAAATTGAAA 


GTCCTGAAGC 


TGGAGATGAT 


CATCATTATG 


ATGAATCTGG 


10020 


2S 


TGAAGAGATT 


AAAGCAAGTA 


ACAAATCATT 


CTGGGGAACT 


GTGTTGAAAA 


TAGAATTTGC 


10080 




AGATATCGCA 


TTTGCCATTG 


ATTCTATGCT 


TGCTGCTTTA 


gCTATTGCTG 


TAACACTTCC 


10140 


30 


TAAAGTTGGT 


ATTCACTTTG 


GTGGTATGGA 


CTTAGGTCAG 


TTCGTAGTCA 


TGTTCCTAGG 


10200 


TGGAATGATT 


GGTGTTATTC 


TAATGCGTTA 


TGCAGCAACA 


TGGTTTGTAG 


AG CT ATT AAA 


10260 




CAAATATCCA 


GGACTTGAAG 


GTGCAGCCTt 


CGCGATCGTT 


GGTTGGGTAG 


GTGTTAAATT 


10320 


35 


AGTTGTCATG 


GTATTAG CGC 


ACCCAGACAT 


CGCTGTATTG 


CCTGAGCACT 


TCCCACATGG 


10380 




CGTATTATGG 


CAATCTATTT 


TCTGGACAGT 


ACTAATTGGA 


TTAGTAATTA 


TCGGTTGGTT 


10440 




AGGTT CAGTT 


GTTAAAAATA 


AAAAATCGCA 


TAAATAATTG 


ATGTGAAGCG 


GACAATCTTA 


10500 


40 


ATTTAGTTTA 


AGGTTGTCCT 


TTTTCATTTA 


ATTGAGTGAT 


TTATGAAAAA 


TGGATTTTGA 


10560 




AGAATGTGAA 


TCAAAAGATG 


CGATATAGTA 


TTAAGAAAAT 


GTGCCTTTTA 


TATTTAGCAT 


10620 


45 


TTTTTCAATA 


GAAATTATAT 


AGATTTTAAA 


GCAAATTAGG 


TGTTAATGTG 


TCATAATGAT 


10680 


AAGTGATTTT 


ATTGAATGGA 


GTGGACATTA 


GTGGATATTG 


GTAAAAAACA 


TGTAATTCCT 


10740 




AAAAGTCAGT 


nACCsaCGTA 


AGCGTCGTGA 


ATTCTTCCAC 


AACGAAGACA 




10800 


SO 


TTTAAATCAA 


CATCAAGATA 


AACAAAATAT 


AGATAATACA 


ACATCAAAAA 


AAGCAGATAA 


10860 




GCAAATACAT 


AAAGATTCAA 


TTGATAAGCA 


CGAACGTTTT 


AAAAATAGTT 


TATCATCGCA 


10920 




TTTAGAACAG 


AG AAA C CGTG 


ATGTTAATGA 


GAATAAAGCT 


GAAGAAAGTA 


AAAGTAATCA 


10980 
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AAATTCATTA GATTCAGTGG AC CAAGAT AC AGAGAAATCA AAATATTATG AGCAAAATTC 111 00 

TGAAGCGACT TT AT CAACT A AATCAACCGA TAAAGTAGAA TCAACTGAAA TGAGAAAGCT 11160 

AAGTTCAGAT AAAAACAAAG TTGGTCATGA AGAGCAACAT GTACTTTCTA AACCTTCAGA 11220 

ACATGATAAA GAGACTAGAA TTGATTCTGA GTCTTCAAGA ACTGATTCAG ACAGCTCGAT 11280 

GCAGACAGAG AAAATAAAAA AAGACAGTTC AGATGGAAAT AAAAGTAGTA ATCTGAAATC 11340 

TGAAGTAATA TCAGACAAAT CAAATACAGT ACCAAAATTG TCGGAATCTG ATGATGAAGT 114 00 

AAATAATCAG AAGCCATTAA CTTTACCGGA AGAACAGAAA TTGAAAAGAC AGCAAAGTCA 114 60 

AAATGAGCAA ACAAAAACCT ATACATATGG TGATAGCGAA CAAAATGACA AGTCTAATCA 11520 

TGAAAATGAT TTAAGTCATC ATATAC CATC GATAAGTGAT GATAAAGATA ACGTCATGAG 11580 

AGAAAATCAT ATTGTTGACG ATAATCCTGA TAATGATATC AATACACCAT CATTATCAAA 11640 

AACAGATGAC GATCGAAAAC TTGATGAAAA AATTCATGTT GAAGATAAAC ATAAACAAAA 11700 

TGCAGACTCG TCTGAAACGG TGGG AT AT CA AAGTCAGTCA ACTGCATCTC ATCGTAGCAC 11760 

TGAAAAAAGA AATATTTCTA TTAATGACCA TGATAAATTA AACGGTCAAA AAACAAATAC 11820 

AAAGACATCG GCAAATAATA ATCAAAAAAA GGCTACATCA AAATTGAACA AAGGGCGCGC 11880 

TACGAATAAT AATTATAGTG ACATTTTGAA AAAGTTTTGG ATGATGTATT GGCCTAAATT 11940 

AGTTATTCTA ATGGGTATTA TTATTCTAAT TGTTATTTTG AATGCCATTT TTAATAATGT 12 000 

GAACAAAAAT GATCGCATGA ATGATAATAA TGATGCAGAT GCTCaAAAAT ATACGACAAC 12060 

GATGAAAAAT GCCAATAACA CAGTTAAATC GGTCGTTACA GTTGAAAATG AAACATCAAA 12120 

AGATTCmTCA TTACCTAAAG ATAAAGCATC TCaAGACGAA GTGGGATCAG GTGTTGTATA 12180 

TAAAAAATCT GGAGATACGT TATATATTGT TACGAATGCA CACGTTGTCG GTGATAAAGA 12240 



AAATCaAAAA ATAACTTTCT CGAATAATAA AAGTGTTGTT GGG AAAGTGC TTGGTAAAGA 12300 

TAAATGGTCA GATTTAGCTG TTGTTAAAGC AACTTCTTCA GACAGTTCAG TGAAAGAGAT 12360 

AGCTATTGGA GATTCAAATA ATTTAGTGTT AGGAGAGCCA ATATTAGTCG TAGGTAATCC 12420 

ACTTGGTGTA GACTTTAAAG GCACTGTGAC AGAAGGTATT ATTTCAGGTC TGAACAGAAA 12480 

TGTTCCTATT GATTTCGATA AAGATAATAA ATATGATATG TTGATGAAAG CTTTCCAAAT 12540 

TGATGCATCA GTAAATCCAG GTAACTCGGG TGGTGCTGTC GTCAATAGAG AAGGAAAATT 12600 

AATAGGTGTA GTTGCAG CTA AAATTAGTAT GCCAAACGTT GAAAnTATGT CATTTGCA 12658 

<2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 8 base pairs 
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(C) STRANDEDNESS: double 
<D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12B : 
TGAAATnGAA TAGTACTATT GCAAGTGTAA AGAGGTTAAT TTTTGCCnCA CGCGGGACTT 
AAAAAGGCAA CCACTGGTTG TGACATATCC TTATTTACAT TTATAAATAT AAGGAGGAGG 
TAGTAGTGAA AGACTTATTG CAAGCACAGC AAAAGCTTAT ACCGGATCTC ATAGATAAAA 
TGTATAAACG TTTTTCTATT CTTACTACTA TCTCAAAAAA TCAGCCTGTC GGACGTCGAA 
GTTTAAGCGA ACATATGGAT ATGACTGAAC GTGTACTGCG TTCTGAAACA GATATGCTTA 
AGAAACAAGA TTTGATAAAA GTTAAG CCTA CCGGAATGGA AATTACAGCT GAAGGTGAGC 
AACTGATTTC GCAATTGAAA GGTTACTTTG ATATCTATGC AGATGATAAT CGTCTGTCAG 
AAGGTATTAA GAATAAATTT CAAATTAAGG AAGTTCATGT TGTTCCTGGT GATGCTGATA 
ATAGTCAATC TGTTAAAACA GAATTAGGTA GACAAGCAGG TCAATTACTT GAAGG CAT AT 
TACAAGAAGA CGCGATAGTT GCTGTAACTG GCGGATCCAC GATGGCATGT GTTAGTGAAG 
CAATTCATTT ATTACCATAT AATGTATTCT TCGTACCAGC CAGAGGTGGA CTAGGCGAAA 
ATGTTGTCTT TCAGGCAAAC ACAATTGCAG CCAGTATGGc aCAACAAGCT GGCGGTTATT 
ATACGACGAT GTATGTACCT GATAATGTCA GTGAAaCAAC ATATAATACA TTGTTGTTAG 
AGCCATCAGT CATAAACACT TTAGACAAAA TTAAACAAGC AAACGTTATA TTACACGGCA 
TTGGTGATGC GCTGAAGATG GCGCATCGAC GTCAATCACC TGAAAAGGTC ATTGAACAAC 
TTCAACATCA TCAAGCTGTC GGAGAGGCAT TTGGTTATTA TTTTGATACA CAAGGTCAAA 
TTGTCCATAA GGTTAAAACA ATTGGACTTC AATTAGAAGA CCTTGAATCA AAAGACTTTA 
TTTTTGCAGT TGCAGGAGGC AAATCGAAAG GTGAAGCAAT TAAAGCATAC TTGACGATTG 
CACCCAAGAA TACAGTGTTA ATCACTGATG AAGCCGCAGC AAAGATAATA CTTGAATAAG 
AGATAAAAAG TTTAATACTT TTTAAATATC ATTTTAAAGG AGGCCATTAT AATGGCAGTA 
AAAGTAGCAA TTAATGGTTT TGGTAGAATT GGTCGTTTAG CATTCAGAAG AATTCAAGAA 
GTAGAAGGTC TTGAAGTTGT AGCAGTAAAC GACTTAACAG ATGACGACAT GTTAG CGCAT 
TTATTAAAAT ATGACACTAT GCAAGGTCGT TTCACAGGTG AAGTAGAGGT AGTTGATGGT 
GGTTTCCGCG TAAATGGTAA AG AAG TT AAA TCATTCAGTG AACCAGATGC AAGCAAATTA 
CCTTGGAAAG ACTTAAATAT CGATGTAGTA TTAGAATGTA CTGGTTTCTA CACTGATAAA 
GATAAAGCAC AAG CT CAT AT TG AAG CAGG C GCTAAAAAAG TATTAATCTC AGCACCAGCT 



.07865 19A2J„> 



700 



10 



25 



EP0 786 519 A2 

ACAGTTGTTT CAGGTG CTTC ATGTACTACA AACTCATTAG CACCAGTTGC TAAAGTTTTA 16 80 

AACGATGACT TTGGTTTAGT TGAAGGTTTA ATGACTACAA TTCACGCTTA CACAGGTGAT 1740 

CAAAATACAC AAGACGCACC TCACAGAAAA GGTGACAAAC GTCGTGCTCG TGCAGCGGCA 1800 

GAAAACATCA TCCCTAACTC AA CAGGTG CT GCTAAAGCTA TCGGTAAAGT TATTC CTGAA I860 

ATCGATGGTA AATTAGATGG TGGTGCACAA CGTGTTCCTG TAGCTACAGG TTCATTAACT 1920 

GAATTAACAG TAGTATTAGA AAAA CAAG AC GTAACAGTTG AACAAGTTAA CGAAGCTATG 1980 

AAAAATGCTT CAAACGAATC ATTCGGTtAC ACTGAAGACG AAATCGTTTC TTCAGACGTT 2040 

15 G TAGGTATG A CTTACGGTTC ATT ATT CGAC GCTACACAAA CTCGTGTAAT GTCAGTTGGC 2100 

GACCGTCAAT TAGTTAAAGT TGCAGCTTGG TATGATAACG AAATGTCATA TACTGCACAA 2160 

TTAGTTCGTA CATT AG CAT A CTT AG CTGAA CTTTCTAAAT AATTTTAGTA TAGTTTTTAT 2220 

20 TCAAATACGC TAGTG CTCAG AACTATTTAG CATTAATTAA AG CTT ATG AG TAAG CGGGG A 2280 

GCACAAACGC TTCTCCGCTT ATTTTTATAT AAAATTTCCT AATTACAAGG AGGAAACACC 2340 

ATGGCTAAAA AAATTGTTTC TGATTTAGAT CTTAAAGGTA AAACAGTCCT AGTACGTGCT 24 00 

GATTTTAACG TACCTTTAAA AGACGGTGAA ATTACTAATG ACAACCGTAT CGTTCAAGCT 24 6 0 

TTACCTACAA TTCAATACAT CATCGAACAA GGTGGTAAAA TCGTACTATT TTCACATTTA 2520 

GGTAAAGTGA AAGAAGAAAG TGATAAAGCA AAATTAACTT TACGTCCAGT TGCTGAAGAC 2580 

TTATCTAAGA AATTAGATAA AGAAGTTGTT TTCGTAC CAG AAACACGCGG CGAAAAACTT 264 0 

GAAGCTGCTA TT AAAG AC CT TAAAGAAGGC GACGTATTAT TAGTTGAAAA TACACGTTAT 2700 

35 GAAGATTTAG ACGGTAAAAA AGAATCTAAA AATGATC CAG AATTAGGTAA ATACTGGGCA 2760 

TCTTTAGGTG ATGTGTTTGT AAATGATGCT TTTGGTACTG CGCATCGTGA GCATGCATCT 2820 

AATOTTGGTA TTTCTACACA TTTAGAAACT GCAG CTGG AT TCTTAATGGA TAAAGAAATT 2880 

40 AAGTTTATTG GCGGCGTAGT TAACGATCCA CATAAAC CAG TTGTTGCTAT TTTAGGTGGA 2940 

G CAAAAG TAT CTGACAAAAT TAATGTCATC AAAAACTTAG TTAACATAGC TGATAAAATT 3000 

ATCATCGGCG GAGGTATGGC TTATACTTTC TTAAAAGCGC AAGGTAAAGA AATTGGTATT 3060 

TCATTATTAG AAGAAGATAA AATCGACTTC GCAAAAGATT TATTAGAAAA ACATGGTGAT 3120 

AAAATTG TAT TACCAGTAGA CACTAAAGTT GCTAAAGAAT TTTCTAATGA TGCCAAAATC 3180 

ACTGT AG T AC CATCTG ATT C AATTCCAGCA GACCAAGAAG GTATGGATAT TGGACCAAAC 3240 

ACTGTAAAAT T ATTTG CAG A TGAATTAGAA GGTGCGCACA CTGTTGTATG GAATGGACCT 3300 

ATGGGTGTAT TCGAGTTCAG TAACTTTGCA CAAGGTACAA TTGGTGTATG T AAAG C AATT 3360 
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TCTTTAGGTT 


TTGAAAATGA 


CTTCACTCAT 


ATTTCAACTG 


GTGGCGGCGC 


GTCATTAGAG 


34 80 


TACCTAGAAG 


GTAAAGAATT 


GCCTGGTATC 


AAAGCAATCA 


ATAATAAATA 


ATAAAGTGAT 


Q a n 


AGTTTAAAGT 


GATGTGGCAT 


GTTTGTTTAA 


CATTGTTACG 


GGAAAACAGT 


CZACAAGATGA 


J oUU 


CATCGTGTTT 


CATCACTTTT 


CAAAAATATT 


TACAAAACAA 


GGAGTGTCTT 


TAATGAGAAC 


"5 tt a 


ACCAATTATA 


GCTGGTAACT 


GGAAAATGAA 


CAAAACAGTA 


CAAGAAGCAA 


AAGa t TCGTC 


1 1 ~> e\ 


AATACATTAC 


CAACACTACC 


AGATTCAAAA 


GAAGTAGAAT 


CAGTAATTTG 


TGCACCAGrA 


*3 *7 q r\ 

J /oU 


ATTCAATTAG 


ATGCATTAAC 


TACTGCAGTT 


AAAGAAGGAA 


AAGCACAAGG 


TTTAGAAATP 

XXX AVJ/Vul X V_- 




GGTGCTCAAA 


ATACGTATTT 


CGAAGATAAT 


GGTGCGTTCA 


CAGGTGAAAC 


VJ i L 1 \_ »— /^VJ X X 


"5 on n 


GCATTAGCAG 


ATTTAGGCGT 


TAAATACGTT 


GTTATCGGTC 


ATTCTGAAOG 


TPGTfiAATTA 

X WO x O/Vt X X f\ 




TTCCACGAAA 


CAGATGAAGA 


AATTAACAAA 


AAAGCGCACG 


CTATTTTPAA 


ArATGGAATG 

^VV^ JTi X UVJAn X VJ 




ACTCCAATTA 


TATGTGTTGG 


TGAAACAGAC 


GAAGAGCGTG 


AAAGTGGTAA 


AG CTAA PG A T 




GTTGTAGGTG 


AGCAAGTTAA 


GAAAGCTGTT 


GCAGGTTTA1* 


CTGAAGATCA 


■/"VV— X X*\/vrlX^->4 


^ i a n 


GTTGTAATTG 


CTTATGAACC 


AATCTGGGCA 


ATCGGAACTG 


GTAAATCATC 


AArATPTflA A 


a *yf\n 


GATGCAAATG 


AAATGTGTGC 


ATTTGTACGT 


CAAACTATTG 


CFGACTrTVATT* 


aahpaa Arci\ a 




GTATCAGAAG 


CAACTCGTAT 


TCAATATGGT 


GGTAGTGTTA 


AACCTAAPTIA 


\—J\± X ZviMuArt 


4 .s^O 


TACATGGCAC 


AAACTGATAT 


TGATGGGGCA 


TTAGTAGGTG 


GCGCATCACT 


TAAAGTTGAA 
x .nnn w x X vnn 


a 1 fl n 


GATTTCGTAC 


AATTGTTAGA 


AGGTGCAAAA 


TAATCATGGC 


TAAGAAACCa 




*t H *± \J 


TTATTTTAGA 


TGGTTTTGCG 


AACCGCGAAA 


GCGAACATGG 


TAATGCGGTA 


AAATTAGCAA 




ACAAGC CTAA 


TTTTGATCGT 


TATTACAACA 


AATATCCAAC 


GACTCAAATC 


GAAGCGAGTC? 


"t 3 « U 


GCTTAGATGT 


TGGACTACCT 


GAAGgACAAA TGGGTAACTC AGAAGTTGGT 


CATATC1AATA 


on 


TCGGTGCAGG 


ACGTATCGTT 


TATCAAAGTT 


TAACTCGAAT 


CAATAAATCA 


ATTGAAGAPG 


■t v O 


GTGATTTCTT 


TGAAAATGAT 


GTTTTAAATA 


ATGCAATTGC 


ACACGTGAAT 


TCACATGATT 

X wlw\ X >J^^ X X 


4 *7xi n 


CAGCGTTACA 


CATCTTTGGT 


TTATTGTCTG 


ACGGTGGTGT 


ACACAGTCAT 


TACAAArATT 


4 Ron 


TATTTGCTTT 


GTTAGAACTT 


GCTAAAAAAC 


AAGGTGTTGA 


AAAAGTTTAC 


GTACACJGPAT 


4 860 


TTTTAGATGG 


CCGTGACGTA 


GATCAAAAAT 


CCGCTTTGAA 


ATACATCGAA 


RAfiAPTnAAri 


4 92 0 


CTAAATTCAA 


TGAATTAGGC 


ATTGGTCAAT 


TTGCATCTGT 


GTCTGGTCGT 


TATTATGCAA 


4980 


TGGAT CGTGA 


CAAACGTTGG 


GAACGTGAAG 


AAAAAGCTTA 


CAATGCTATT 


CGTAATTTTG 


5040 


ATGCCCCAAC 


TTATGCAACT 


GCCAAAGAAG 


GTGTAGAAGC 


AAGCTATAAT 


GAGGGCTTAA 


5100 


CTGACGAATT 


CGTAGTACCA 


TTCATCGTTG 


AGAATCAAAA 


TGACGGTGTT 


AATGATGGAG 


5160 
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CGAACAGAGC ATTCGAAGGC TTTAAAGTTG AACAAGTTAA AGACTTATTC TATGCAACAT 5280 

TCACTAAGTA TAATGACAAT ATCGATGCGG CTATCGTCTT CGAAAAAGTT GATTTAAATA 534 0 

ATACAATTGG TGAAATTGCA CAAAATAACA ATTTAACTCA ATTACGTATT GCAGAAACTG 5400 

AAAAATACCC TCACGTTACT TACTTTATGA GTGGTGGACG TAACGAGGAA TTTAAAGGTG 5460 

AACGCCGTCG TTTAATTGAT TCACCTAAAG TTGCAACGTA TGACTTGAAA CCAGAAATGA 5520 

GTGCTTATGA AGTTAAAGAT G CATT ATT AG AAGAGTTAAA TAAAGGTGAC TTGGACTTAA 5580 

TTATTTTAAA CTTTGCTAAC CCTGATATGG TTGGACATAG TGGTATGCTT GAGCCGACAA 5640 

15 TCAAAGCAAT CGAAG CGGTT GATGAATGTT TAGGAGAAGT GGTTGATAAG ATTTTAGACA 5700 

TGGACGGTTA TGCAATTATT ACTGCTGACC ATGGTAACTC TGATCAAGTA TTGACGGaTG 5760 

ATGATCAACC AATGACTACG CAwACAACGA ACCCAGTACC AGTGATTGTA ACAAAAGAAG 5820 

20 GCGTTACACT TAGAGAAACT GGTCGCTTAG GTGACTTAGC ACCTACATTA TTAGATTTAT 5 8 80 

TAAATGTAGA ACAACCTGAA GATATGACAG GTGAaTCTTT AATTAAACAC TAATATTGTA 5940 

AAAGATGTTA AGTAAACGCT TAATGACACT TATTTTTTGA AAATAATAGT AATATCnTTT 6000 

25 

TGTTAAATGA AAGAATAAAG CTATAATAAT TATAGAATAA CTATTTAn 604 8 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



45 



50 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129; 



AAAGAAGTGC AAG AT AT CAT CGCATTAATT AAGTCGTTAC AAAgTGTAAT TGTAGACaTC 60 

40 GCTTCCAATA ATGTTGATAC AATTATGCCT GGTTATACTC ATTTACAGCG TGCACAG CCA 120 

ATTTCATTTG CACATCATAT TATGACTTAT TTTTGGATGT TACAACGAGA CCAACAACGA 180 

TTTGAAGATA GTTTAAAACG AATCGATATT AATCCTTTAG GTGCAGCAGC CTTAAGTGGT 24 0 

ACCACATACC CT AT CG AT AG ACACGAGACA ACAGCATTGT TGAACTTTGG CAGTCTCTAT 3 00 

GAGAATAGCC TAGATGCTGT TAGTGACAGA GACTATATTA TTGAAACATT GCATAATATT 360 

TCTTTAACGA TGGTTCACTT ATCACGCTTT G C AG AGG AAA TTATTTTCTG GTCCACAGAC 420 

GAAGCTAAAT TCATTACATT ATCAGATGCA TTTTCAACTG GCTCATCTAT TATGCCACAA 480 

AAGAAAAATC CTGATATGGC AGAATTAATT AGAGGTAAAG TTGGTCGAAC GACTGGTCAT 54 0 

55 
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GAAGATAAAG AAGGTTTATT CGATGCTGTC CATACAATTA AAGGTTCTTT ACGTATTTTC 
GAAGGTATGA TTCAAACGAT GACAATTAAT AAAGAACGAC TCAATCAAAC TGTTAAAGAA 

GATTTTTCAA ATGCAACGGA ACTAGCAGAT TATTTAGTAA CTAAAAATAT TCCATTTAGA 78 0 

ACTGCACATG AAATTGTAGG AAAAATCGTC TTAGAATGTA TACAACAAGG TCATTATTTA 84 0 

TTAGATGTTC CTTTAGCAAC ATATCAACAA CATCATTCTA GTATTGATGC CGATATTTAC 900 

GATTATTTGC AGCCTGAAAA TTGTTTAAAA CGACGTCAAA GTTACGGTTC AACAGGTCAA 96 0 

TCATCGGTCA AACAACAACT TGATGTTGCT AAACAATTAC TATCACAATA AATACGTTAA 102 0 

15 TCTACCTACC CACAATGTCT ATTAAAATTA CATTGTGGGT ATTTTAATGC TCTCTTCGTC 108 0 

TTGTTGAACA TCACATTTTT AAGATTCCTA AAATGTTTGA TAATTCTTTT AAATTTATAT 114 0 

TACAAAAATG TTATAAATTG TAAAAGAAAT GTGTAAAGCG TTTTCACAAG CAGGTTTTTG 12 00 

TAGTATTTTA AAATTGTTAG ACTACAAATA AAGAGATGAA AGGATAAAGA CTATGACTAA 12 60 

CTCTTCGAAA AGCTTCACTA AATTTATGGC TGCTTCTGCT GTTTTTACTA TGGGATTTTT 13 2 0 

ATCAGTACCT ACTGCTGGCG CTGAACAAAC AAATCAAATT GCAAATAAAC CTCAGGCTAT 13 80 

TCAATGGCAT ACAAATTTAA CGAATG AG CG ATTCACTACT ATCGCACATC GTGGCGCAAG 144 0 

TGGCTATGCA C C CG AG CAT A CGTTTCAAGC ATATGATAAG AGTCATAATG AGTTAAAAGC 1500 

ATCTTATATC GAAATTGATT TACAACGTAC CAAAGATGGC CATTTAGTTG CTATGCATGA 1560 

TGAAACTGTT AACCGTACAA CAAATGGACA CGGTAAAGTT GAGGATTATA CCCTTGATGA 16 20 

ATTAAAACAG TTAGATGCAG GAAGTTGGTT TAATAAAAAA TATCCAAAAT ACGCAAGAGC 16 80 

35 AAGTTATAAA AATGCTAAAG TACCCACTTT AGATGAAATT TTAGAACGTT ATGG CCCGAA 1740 

TGCAAACTAT TATATTGAAA CAAAGTCACC TGATGTATAC CCAGGAATGG AAGAACAATT 1800 

ATTAGCTTCA TTGAAAAAGC ATCACCTTTT AAATAACAAT AAATTAAAAA ATGGACATGT 1860 

40 AATGATTCAA TCATTTTCTG ACGAAAGTTT AAAGAAAATT CATCGTCAAA ATAAGCATGT 192 0 

G CCATTAGT A AAATTAGTTG ATAAAGGTGA ACTACAACAA TTTAACGACC AACGCTTAAA 1980 

AGAGATACGC TCTTATG CG A TTGGATTAGG TCCTGATTAT ACAGATTTAA CTGAACAAAA 2 04 0 

TACCCATCAT TTAAAAGACT TAGGATTTAT AGTACATCCT TATACAGTGA ATGAAAAAGC 2100 

TGATATGTTA CGATTAAATA AATATGGCGT TGATGGTGTC TTTACAAATT TCGCTGATAA 2160 

ATATAAAGAA GTCATTAAGT AGTAATGTTA AACTAGAAAA CATAAATACA AAAATAT AG C 2220 

TATTACTATA AAAAACAGCA GTAAGATATT TCCAAATTGA AATTATCCTA CTGCTGTCTT 22 80 

TTTGGGAGTG GGACAGAAAT GATATTTTCG CAAAATTTAT TTCGTCGTCC CACCCCAACT 234 0 
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TTGTCTGTAG AAATTGAGGA GCTAATTTCT CTGTGTCGGG GCTCCACCCC AACTTGCACA 24 60 

CTATTGTAAG CTGACTTTCC GCCAGCCTCT GTGTTGGGGC CCCGCCAACT TGCACACTAT 2520 

5 

TGTAAGCTGA CTTTCCACCA GCCTCTGTGT TGGGGCCCCG ACTATTTTTG AAAAGAGCGT 25 80 

GTTACACGGG CATTGTTTTA CAGTCAACTA CTGCTAAAAT AAAATTAACG AGCTTAGGGC 264 0 

TTTGTTTTCT GTCCCAAGCT CGTTAAATCA CATATGATAA TTAATTATGC CCAACCACGA 2700 

10 

TATCTAGCTG CTTCTGCTGT ACGTTTAATA CCTATGATAT ATGCTG CAAG TCTCATATCT 276 0 

ATTTTTCGGT TTTGAGACAA TTCGTAAATC GTATCAAATG CCGCTTCTAA TTTTTCACGT 2820 

75 AGCTTTTCAT TAACTTCTTC TTCAGACCAA TAATAACCTT GATTATTTTG TACCCATTCG 2880 

AAGTAAGAAA CCGTtACACC ACCAGCACTT GCTAATACGT CTGGAACTAA TAATATACCA 294 0 

CGTTCAGTTA AAATACGTGT TGCTTCTGGT GTTGTAGGTC CATTAGCAGC TTCAACAACG 3000 

20 ATACTAGCTT T AAT AT CATG TGCATTGTCT TCTG TAATTT GGTTTGAAAT AGCCGCTGGT 3060 

ACTAAAATGT CACAATCTAA TTCAAACAAT TCTTTATTTG AGATTGTTTC TTCAAATAAA 3120 

TTTGTTACCG TACCAAAACT ATCACGACGG TCTAATAAAT AATCTAT AT C TAAGCCATTT 3180 

25 

GGATCGTGTA ATGCACCGTA AGCATCAGAG ATACCTACAA TTTTTGCACC TAAATCATAT 324 0 

AAGAATTTAG CTAAGAAACT TCCGGCATTA CCGAAACCTT GAATAACAAC CTTGGCACCT 33 00 

TCAATTTGCA TATTACGACG TTTTGCAGCT TGTTCAATTG CAATAACTAC ACCTAGTGCA 33 60 

30 

GTTGATCTGT CGCGTCCATG AGAACCACCC AATACAATTG GTTTACCTGT GATGAAACCT 3420 

GGTGAATTAA ATTTATCTAA TG CACT AT AT TCATCCATCA TCCAAGCCAT AATTTGTGAG 34 80 

35 TTTGTAAATA CATCTGGTGC TGGAATATCT TTGTTCGGAC CTACGAATTG TGAAATTGCT 354 0 

CTTACATATC CGCGTGATAA ACGTTCAACT TCATGAATGC TCATTTGACG TGGATCACAA 3600 

ACGATACCAC CCTTACCACC ACCGTATGGT AAGTTTACAA TGCCACATTT CAAAGTCATC 366 0 

40 CACATTGATA ATGCTTTTAC TTCTTCTTCA TCAACATCTG GGTGGAAACG CACGCCCCCT 3720 

TTTGTTGGTC CAACAGCATC ATTATGTTGC GCACGGTAAC CTGTGAATGT TTTTACTGTG 3780 

CCATCATCCA TTCGTACAGG GATACGCACT TGTAACATTC TTAAAGGTTC TTTAATTAAA 384 0 

45 

TCGTACATTC CTtCGTCAAA TCCCAATTTA TGCAATGCTT CTTTAATAAT TCCTTGAGTA 3 900 

GAAGTTACTA AATTATTGTT CTCAGTCATG ATCCTTTTCG CCTCTTCTTT ACCTAATGAT 3960 

TTCGCTTTCA AACATATTGT AACATAACGT ATTCCTTTTT AAAGCCCTTA CAAACTGATT 4 020 

SO 

GTTACAACTT TTTGACATTA TTGAAATACA TGTCTTATTT TTTCAAGTGC AAGGTCCAAT 4 080 

TCTTCTTTAG TAATAATTAA TGGTGGTGCA AAACGAATGA CAGTATCATG CGTTTCTTTA 414 0 
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ACACCTATAA ACAAACCACG TCCACGGACT TCTTTAATTG ATGGATGATC AATTTGCTTT 4260 

AATTGTTCTT TAAAATAATC TCCTAATTCT AAAGAGCGGC CTGGTAAATC CTCATCAACG 4320 

ATAACATCTA ATGCAGCAAT TGATGCAGCA CAAGCAAGTG G ATT AC CAC C AAATGTTGAA 4330 

CCATGTGAGC CAGGTGTAAA GACATCTAAT ACTTCTTTAT CTGCTAATAC AACAGAAATT 444 0 

GGGAAGACTC CACCACCTAG TGCTTTACCT AAAATATAGA CATCAGGTTT TACATTATCC 4 500 

CAATCCGTAG CAAATAATTT ACCCGAACGA CCTAATCCTG CTTGGATTTC GTCAGCAATA 4 560 

AATAAGACAT TATGTTCATC ACATAATTCT CTAATTGCTT TCAAATATCC TTCTGGCGGT 4620 

ATATTTATAC CCGCTTCACC TTGAATTGGT TCTACTAAAA CTGCTGCAGT ATTTTCATTA 4 680 

ATTGCAGCTT TCAATGCATC TACATCTCCA AAATCAACTT TTCTAAATCC ATCTAATAAC 474 0 

GGACCATAAC CACGTTGGTA TTCTGCTTCT GAAGATAATG AAACTGGCGC CATTGTTCGA 4800 

CCATGGAAGT TACCATTAAA TGCAATGATT TCTG CTTTAT TTGGCTCAAT TCCTTTAACA 4 860 

TCGTATGCCC AGCGTCGTGC TGCTTTCAAA GCTGTTTCTA CTGCTTCAGC ACCTGTATTC 4 920 

ATTGGTAAAG CTTTATCTTT ACCTG CCAGT TTACAAATTT TTTCGTACCA TTCACCTAAG 4 980 

25 TTATCACTAT GAAAAGCACG TGAAACTAAA GTCACTTTAT CAGCTTGATC TTTTAATGCT 504 0 

TGAATAATTT TCGGATGTCT ATGAC CTTGG TTAACAGCGG AATATGCAGA TAACATATCC 5100 

ATATATTTAT TGCCTTCAGG ATCTTTAACC CATACCCCTT CAGCTTcTGa AATGaCAATT 5160 

GGcAATGGTA AATAATTATG TGCTCCGTAA TGATTTGTTA ACTCAATAAT TTTTTCAGAT 5220 

TTAGTCATCA TATCTCCCCT TTTCATCATT TATAACTATT ATACATGAAA CATTATCCAA 5280 

ATAATTACAT TAGTTTTCAA AG CAGATACT TTTCCACCAA AAAAGATGAA ATAATCACTA 5340 

AG TTTCATTA AATTTGTCTA TTTTGAAAAC CCTTACATTT ATAATGACAT AATTACTTAA 54 00 

ATGaOTACAA GCAAAAGAAT TGATAATTTT ACACTTAAT C AAAAGTATAT TTTACTAAGA 5460 

ATATTTTTAT TTATAAATAT TGAAAACCAC TAACAAATTG CATACACAAT ATCATTAGTG 5520 

GTAACAGTTA AACACTTATT TATCTTTACG GGGTAATGGG TTAAAACCCT TnCATTAAAA 5580 

TTGGATGnCC ATAAAATTAG GG 56 02 
46 (2) INFORMATION FOR SEQ ID NO : 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 
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TAACCCCATT TTACCTGGAA AAATCgTTTG 
TTTACGTATa GAATTATAAA AgGTTTCATT 
5 TTTTATGTCA AATTTAAAAC AGTAACACTT 

AAAGGAAGTG TCGCGTATTT TAACTTTTTC 
TTTAATGAAT ACTTATTGCA ATCAATGTCC 

10 

GAAAACGAAG GCGCATCATT TTTGTATCAA 
ATTAGGAAAT GAACTTCAAT AGGAGGAAGT 

15 CCGAACCACA ACACAATGAA GATTACACTT 

ACGTATACTA AAGTTGATGA TTCACAGCCA 
GGCGTTAAAT CAATTTTCCA TGTTATGGAC 

20 AATTGGGAAA CAGTATTGCC AAAAGTAGAG 

GTATTCGGGG GGAATAAAGT ATATGGAAAT 
TACAATGAAA GTTGTTTTGT CATATACAAG 

25 AGTAGAAGAA ACACAACCAA GATTTATAAA 

CATTTTTCAT GTCATGAACT TCTTAGCTGT 
CATATTACCT GATATTAAAG CTGCTTTTTC 

30 

TGAACCTCAA ATTGACAATC ATTTTGGTGA 
TATACCGTAT CAAATTAAGC TAACTTCTGC 
AACATATGTT GACCATATGA CTCAAGCGCA 

35 

TAAATGGCTA GATTTAGGAA ATCGCTATGG 
AGAASi^GT^ — CTAGCTACCT AT^ 

40 AGAAGAAAAT CACGCAACTA ATAATTATCA 

TCATGCAACT GATAATTGGA AGACTCGATT 
TTTTGAAGAT ATACCGCTGC TTGATTTAGC 

45 TCAAGCGATT GTATTATTAG GTATGATTGA 

GGGGCTTCGT GATAAAAGTC CTGCTGTAAG 
AGGGTATCCA GAGGCACTAC CAGAAATGGT 

so 

TAGGTGGCGT GCTGCTATGT TTATCTTTGA 
AAAAGCCCAT ATTAATGACA ATGCGTTTGA 

55 
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CGATGCaATm GCaTTtGaAT ATAaATACAT 6 0 

CaAATCTTAG GGTCAAAAAT GTTATAATAT 120 
ATTTACAAGG TTGCAATATT TTGAAGTAAT 180 
AGAGCAAAAT GCACTCGCGA AAATAGATGA 240 
AATCAAAACT CGTCTGCGTA AATTAGAGGG 3 00 

TGAGTGTTCA ATAGGGAAAG AAATAAAACA 3 60 

CAAATGAAAA TTATATCTAT ATCAGAAACA 4 20 

AGTGAAAGCA GAGAAGGTAT GACATCAGAT 4 80 

GCATTTATTA ATGACATCTT AAAGGTTGAA 540 
TTTATTTCAG TAGATAAAGA AAATGACGCA 600 
GCTGTATTCG AATAAATTTT TCATCAACTA 660 
TTTACGTATA GAGCCAACAC CAAGTCCAAA 720 
AGAAGACAAG TTATCTAATA CTTATAAAAA 780 
TCAGTTGTTA TCTATAGATG GTATCACTTC 84 0 

TGATAAGGCA CCAAAAGCTG ATTGGGAAGT 900 
TGATGCGAAT AAGGTTTTAG AATCTGTAAA 960 

AATTAAAGCT GAATTATTAA CTTTTAAGGG 1020 

TGACCAAGAA TTAAGAGAAC AATTACCACA 10 8 0 

AACAGCACAT GACAATATTG TTTTTATGCG 114 0 

AAATATTCAA GAAGTAATGG ATGGTGTCCT 1200 

ACAGTTACCC GTATTGGTAA~ AACATGCTTT 12 60 

TTTCTATCGA CATGTCTCTT TGGATGAATA 1320 

ACGAATGTTA AACCATTTTC CAAAGCCGAC 1380 

TTTATCTGAT GAAAAAGTAC CGGTTAGACG 144 0 

AAGTAAAGAA ATTTTACCGT ATTTATATAA 1500 

AAGAACAGCA GGGGATTGCA TAAGCGATTT 1560 

GCTACTATTA GATGATCCAC AGAAAATCGT 1620 

TGAAGGTAAT GCAGAGCAGC TTCCCGCACT 1680 

AGTTAAATTA CAAATTGAAA TGGCCATATC 1740 
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AATTTAATTG GAGGAATTAA ATATGAATGC ATATGATGCT TATATGAAAG AAATTGCGCA i860 

ACAAATGCGT GGCGAATTAA CTCAAAATGG TTTTACAAGT TTAGAAACGA GCGAACAGct i 92 o 

ATCGGAGTAT ATGAACCAAG TAAATGCTGA TGACACTACT TTTGTAGTTA TTAACTCTAC 1980 

ATGCGGCTGT GCAGCTGGAT TAG CAAG AC C AGCTGCAGTA GCAGTTGCAA CACAAAATGA 204 0 

ACATAGACCT ACAAATACAG TTACAGTTTT TGCTGGGCAA GATAAAGAAG CAACTG CT AC 2100 

AATGCGAGAA TTCATTCAGC AAGCACCATC TAGTCCTTCG TATGCTTTAT TCAAAGGTCA 216 0 

AGATTTAGTT TATTTTATGC CTAGAGAATT TATCGAAGGT AGAGATATTA ATGACATTGC 2220 

AATGGACTTA AAGGATGCCT TTGACGAAAA TTGTAAATAG TACACATAAA TAAATATAAA 2280 

GGTTAACACA TTTTATAATA TTAAAAATGG TGTCTGTCAT TGAAAATAGA GAATATAGTT 234 0 

GTATTCTATT TGTTAAATAA AGTCCGTTTT TACCaACTAT ATTTTCTAGA AATTTAACTG 2400 

20 TTTTAATAGG ACATCAAACA TAATATT CaA ATCaTGTGTT AACCTCTTTT TTAAAATTTT 2460 

TTAGCATTAA AGTTATAGAT TTGGGTAAAC AATTACCAAT TGGAAACATA TATCACGTTA 2520 

CGATGGGGTA GGTACTTAAT CAGCATTTTA TAAATAAAGT AACGGAATTC ATGATATTAA 2580 

TATCATATTC CTAAAATGAG TGATAACAAA ATGCTACATA AAGTTAAGTT ATATCAAACT 264 0 

AAATATACAT ACTATAAATA ATGAAAATGA GGTGTTATCG CATATGTTGA ATTCATTTGA 2700 

TGCAGCATAT CACAGTCTTT GTGAAGAAGT TTTAGAAATA GGAAATACAC GAAATGATCG 2760 

CACAAATACA GGTACGATTT CGAAATTTGG TCATCAACTT CGCTTTGACT TATCTAAAGG 282 0 

ATTTCCACTA TTAACGACAA AGAAAGTTTC TTTTAAATTA GTAGCAACCG AATTATTATG 2880 

GTTCATTAAA GGAGATACAA ACATCCAATA CTTATTAAAA TATAATAATA ATATATGGAA 2940 

CGAATGGGCT TTTGAAAATT ATATCAAATC AGACGAGTAT AAAGGTCCAG ATATGACAGA 3000 

TTTC25GGCAT CGTGCATTGA GTGATCCTGA ATTTAACGAA CAATATAAAG AACAAATGAA 3060 

40 ACAATTTAAG CAACGTATTC TTGAAGATGA TACATTTGCG AAG CAATTCG GGGATTTAGG 3120 

AAATGTTTAT GGTAAACAAT GGCGAGATTG GGTTGATAAA GATGGTAATC ATTTTGATCA 3180 

ACTTAAAACA GTAATTGAAC AAATTAAGCA TAATCCAGAT TCAAGGCGAC ACATCGTATC 324 0 

45 TGCATGGAAT CCAACAGAAA TTGATACAAT GGCACTTCCG CCTTGTCATA CCATGTTCCA 33 00 

GTTTTATGTC CAAGATGGTA AGTTAAGTTG CCAGTTATAC CAACGTAGCG CAGATATCTT 3360 

TTTAGGTGTG CCATTTAATA TCcGCagctA CGCTTTATTG ACACACCTTA TTGCCAAAGA 3420 

SO 

ATGTGGACTT GAAGTGGGTG AATTTGTGCA TACATTTGGA GATGCACATA TTTATTCAAA 34 80 

TCATATTGAT GCGATTCAAA CACAATTAGC ACGTGAAAGC TTCAATCCTC CAACATTAAA 354 0 

55 
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TGAATCACAT CCAGCAATAA AAGCTCCAAT 
CATATAGACA TCAAAATGAC ATCATAGTAT 
5 TAAACGTTTT CATAAATTAT GCAAAATCAT 

TGTTAAATTA AAGATAACTT AGTAATAAAA 
TGACTTTATC CATTCTAGTt GCACATGACT 

10 

TACCTTGGcA CCTACCAAAT GATTTGAAGC 
TAGTAATGGG TCGTAAGACA TTTGAATCGA 
TTGTACTTAC TTCAGATACA AGTTTCAACG 

15 

AAGATATTTA CCAACTACCG GGCCATGTTT 
AAATGATTGA TAAAGTGGAC GACATGTATA 

20 ATACGTTCTT TCCACCTTAT mCATTkGAgr 

ACTAGATGAG AAAAATACAA TTCCACATAC 
GGGAAAACGA CCATGACAAA ACAGATTATA 

25 GAATACTTAG AAGCAAACAA CATTCATGTA 

TCATACGTTG ACCAAGTAGA TATTACATCA 
GAAGATGTAA AGACAAGTCA GCCAGCCATA 

30 

GGAAAAGATG GCTCTGAAAT CATAAGTATT 
AACACTGCTT ACCAAGCAAG TCAAATGGTA 
TCTATTTCTT TTGGTTTAGG GTATCAAATA 

35 

GtCTCAACTT CTGAAATAGT TAAAAAGTTA 
GTAGTTATAG GG CAATTGAA TCAATTAATT 

40 TTGATTGGTA ATCTTATGAA AATTAAACCA 

CTTGTGCmCA ATGCGAGAAC TCaAAATTCk 
GAATTTATAG GAGATCATGA AATCAAATCC 

45 TATGTTGATA AATTGAAGAA AGTTTTTAAT 

AATGTAACTA CACCAGTTAT TTCTGCACAT 
CTTAAGAAGT AAATTTAATC TTTTCAGTGT 

50 

TAAATTTATA ATTAGATAGA TAGAGGAGGT 
AGCAGGAGGA TGTTTCTGGT GCATGGTTAA 

55 
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AGCAGTGTAG TCATTGCATA GTTAGCTAAC 3660 

TTTCAAGTGC AAAAAAGTAC TTTTTTGTGT 3720 

TATTTCTATC ACACTTTATG ATAAAAATTG 3780 

AATGAAATGA TAGAAGAAGG AGGATAATTA 384 0 

TGCAACGAGT AATTGGTTTt GAAAATCAAT 3900 

ATGTTAAAAA ATT AT CAACA GGTCATACTT 3960 

TTGGTAAACC ACTACCGAAT CGTCGAAATG 4 02 0 

TAGAnGGCGT TGATGTAATT CACTCTATTG 4 080 

TCATATTTGG AGGGCAAACA TTATTTGAAG 414 0 

TTACTGTTAT TGAAGGTAAA TTCCGTGGTG 4200 

CTGGGAAGTT GCCTCTTCAG TTGAAGGTAA 4 260 

CTTTCTACAT TTAATTCGTA AAAAATAAGG 4320 

GTAACAGACT CAACATCCGA TTTATCTAAA 43 80 

ATTCCTTTAA GTTTAACTAT TGAAGGAGCT 444 0 

GAAGAATTTA TTAATCATAT TGAAAATGAT 4 500 

GGTGAATTTA TATCTGCTTA TGAAGAACTA 4 560 

CATCTTTCTT CAGGATTAAG TGGTACATAT 4 620 

GATGCTAATG TAACTGTTAT TGATTCAAAA 4 680 

CAACACCTAG TAGAGCTTGT AAAAgAaGGT 474 0 

AATCATTTAA GAGAAAACAT TAAATTATTT 4800 

AAAGGTGGCA GAATTAGTAA AACAAAAGGT 4 860 

ATTGGTACAC TAGATGATGG TCGCTTAGAG 4920 

AGTATCCAAT ACTTGAAAAA GGAAATTGCT 4 980 

ATTGGTGTCG CACATGCTAA CGTCATTGAA 5040 

GAAGCTTTTC ATGTGAATAA TTACGATATA 5100 

ACTGGTCAAG GTGCGATTGG CCTCGTAGTC 5160 

TAATTACTTC CATTTCAATC CTTTATAGAC 5220 

AATTCATATG ACAAAAGAAT ATGCAACATT 5280 

ACCATTTACA TCATATCCAG GCATCAAGTC 5340 
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GAATCAAACC GGCCATGTCG AAGCAGTACA AATTACGTTT GATCCAGAGG TTACTTCCTT 54 60 

TGAAAATATA TTAGACATAT ATTTCAAAAC ATTTGACCCA ACTGATGATC AAGGGCAATT 5520 

TTTCGATAGA GGCGAAAGCT ATCAACCAGT CATTTTCTAT CATGATGAAC ATCAGAAAAA 5 580 

GGCTGCTGAG TTTAAAAAGC AACAATTAAA TGAACAAGGT ATTTTCAAGA AACCAGTGAT 5640 

TACAC CT ATT AAACCATATA AAAATTTCTA TCCAGCTGAA GACTACCATC AAGATTATTA 57 00 

CAAAAAGAAC CCGGTACATT ATTACCAATA TCAACGTGGT TCAGGTAGAA AAGCGTTTAT 5760 

AGAATCACAT TGGGGGAATC AAAATGCTTA AAAAAGATAA AAGTGAACTA ACAGATATAG 5820 

15 AATATATTGT TACACAAGAn AACGGCACTG AACCACCATT TATGAATGAA TATTGGAATC 58 80 

ATTTTGCTAA AGGATTTATG TAGATAAAnT TCnGGTAAAC CTTG 5924 
(2) INFORMATION FOR SEQ ID NO: 131: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

GGCCGTTnAA AATCTCCAAA ATAnAAAAAC CCATCTTGTT CCAATGTTTT AAAATCGCCa €0 

TCCaACACTT GaTCaATAGC TTGCAACAAC GTTGAACGTG TTTTaCCAAA AGCATCaAAC 120 

GCTCCCACTA AAATCAGTGC TTCAAGTAAC TTTCTCGTTT TGACTCTCTT CGGTATACGT 180 

CTAG CAAAAT CAAAGAAATC TTTAAATTTG CCGTTCTGAT AACGTTCATC AACAATCACT 24 0 

TTCACACTTT GATAACCAAC ACCTTTAATT GTACCAATTG ATAAATAAAT GCCTTCTTGG 300 

GAAGGTTTAT AAAACCAATG ACTTT CGTT A ATGTTCGGTG GCAATATAGT GATACCTTGT 360 

40 TTTTTTOCTT CTTCTAT CAT TTGAGCAGTT TTCTTCTCAC TTCCAATAAC ATTACTTAAA 420 

ATATTTGCGT AAAAATAATT TGGATAATGG ACTTTTAAAA AGCTCATAAT GTATGCAATT 480 

TTAGAATAGC TGACAGCATG TGCTCTAGGA AAAC CATAAT CAGCAAATTT CAGAATCAAA 540 

TCAAATATTT GCTTACTAAT GTCTTCGTGA TAACCATTTT GCTTTGCACC TTCTATAAAA 600 

TGTTGACG CT CACTTTCAAG AACAGCTCTA TTTTTTTTAC TCATTGCTCT TCTTAAAATA 660 

TCCGCTTCAC CATAACTGAA GTTTGCAAAT GTGCTCGCTA TTTG CATAAT TTGCTCTTGA 720 

TAAATAATAA CACCGTAAGT ATTTTTTAAT ATAGGTTCTA AATGCGGATG TAAATATTGA 780 

ACTTTGCTTG GATCATGTCT TCTTGTAATG TAAGTTGGAA TTTCTTCCAT TGGACCTGGT 840 
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ACACTTCTTA 

AATTCAAACA 

TCTTTTTTGA 

TCTATTTTTA 

GTATCCCCTT 

GCCGCATGTG 

CGTTCATGTC 

TCAAGTGTAA 

TCAAACCCCA 

GTCACAATTC 

TCTCGGCGTG 

AAAAAACGTT 

TAACTGACCA 

TTCGCATAAT 

ATAACTTTAT 

TTCAATTGTG 

TCATATTGAG 

ACATCAACCT 

AAAAAATGTG 

ATAGCTGCTA 

ATC»AACAT- 

TGGGTTTCAA 

ATCATATTGT 

TTTATTTCCG 

ACTGTTTCGA 

ATGTTATTTG 

GTTATGGCAA 

AAGCTTGAAT 

TGGCCCCTTT 



CACCGTCAGA 
CTTTTTGATC 
CTTGTGTTAA 
ATAACCCAAT 
TCGTTAAAGG 
TAGATGTATG 
GATGGTTTCG 
TTCCTAATTT 
TAATTCTTCC 
CAGATACATG 
TATCTTCAAA 
GAAATAATAG 
GTGAGCCAGC 
GGATTAAATC 
ACTCATATTT 
TAACTAAGAC 
GAAGTAGAGA 
GCGTTAAATA 
CACCAAAATC 
ATGCAGAAAT 
GTTTTCTAGC 
ACACTTGTAC 
TAGAAAATCG 
ATGATAGTTG 
CTGTATTTAA 
CTATACATGC 
GTGCATCAAC 
TTAACAAATC 
CTATTAGTTA 



CTCTAATTGG 

ATCAAACGGA 

GATTTGATGA 

ACGTTCGGCT 

GGCATATTCA 

TCTTGGTAAA 

ATGTACAAAC 

ATGTGGGATT 

AACATCTCTA 

TAGCTCGCCA 

GTCAATATCA 

ATTGAATTTA 

TGAAGAACCA 

ACTTACTATT 

CAATCGCTCT 

ACGCCACAAA 

TTGATGATAT 

TTCTTGGTTA 

TTCTTGATCA 

CGTATCGGCA 

TTGAATCGAA- 

AATATCACGA 

TTTTAATAAT 

ATACAAATCT 

TCCATTTGTC 

ATCATAAAAT 

ATTTTCAGAC 

ATAAGCCGTA 

AGTTTTGTGC 



AATATGCCAG 
ATCTTTTCGA 
ATAATCGATA 
TCAGTCATTG 
TATAATGGAT 
CCTTCTAACT 
TCTTTAAAAT 
AAACTTGAAA 
GCAACTGCTC 
TATTTTTCTT 
ATATCAGGCA 
ATAGGATCAA 
CGACCAGGAC 
AAGAAATAAT 
AAATAGACGT 
TATTTTTTAG 
TTTAATTCTG 
ATATCTAATT 
TGAATTAAGT 
TCTTGACGTG 
-ATACTAAGGT- 
TGTTGATCAC 
TCAAACGACA 
TTTAATCCAT 
ACATATATTG 
TTAGGAAAAC 
ACAGCAAGTC 
TGAATATTTA 
GTAAAGCTGT 



TCGTATCTCC 
TATCAATATT 
AGTTTCTCAA 
TCCATTGCGT 
GGTCATTAAT 
TTTTACAAAT 
CGTCAATTTG 
TTTCATTTAA 
TTGCAAGCAG 
GGACGTACTG 
TTGTTACACG 
TCGTTGTAAT 
CTACCATCAC 
CTTCAAAACC 
CATAATTAAG 
CTGATTCATC 
CATCACACTT 
GATTAATTTC 
CTAATTTTGT 
TTTGGTAACA 
GGT CCATATA 
CGACTTTTTT 
CATGTTCTAA 
CATTATTTTT 
TCATACCAAA 
CATACAATAC 
TTACgGCATC 
AATATGCCAC 
AGCAAGTTGC 



TTGCGACAAC 
AATACCTAAA 
CCCTAGAAAA 
TAATAATCCT 
AATAATTCCT 
ACTGAACCAG 
ATATGCTTCA 
TGTAACTTCA 
ATGACCGAAA 
AATGACCCTT 
TTCTGGGTTT 
TCCCAATAAA 
ATCATTCGTT 
CATATTAGTA 
TTCTAATTTT 
ATTAGGTGTC 
TTGAGCTATA 
CTTTTCAGTT 
ATTGTCTCTA 
AACATtTTGA 
-TGTGTCATTA 
AAAAATGATA 
TGCATTCATT 
AGCTAGAACA 
AATCGGTTTA 
ATTGGTGTCA 
TTCTATTTTT 
CATGATTGAA 
TCAAATTCAT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
-2160- 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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CAATATCATT AATAATCAAT TGCCCTTTAG AACGTAATCG ACATCTGATT TCATTACCTT 
CATCGACTGC AAATACCCAT ATTTTCAAGC CTTTGATGTC AGCAATTGTA TTAACAAACT 
GAGATGCTTC ATTTGGCTGA ATACCGAATT GCTCCAATAC ATCTTCAGTT ATTTTAACTT 
GGCAGAATCC ATCATCCATA AGTTCGAAAT GTTGTAAAAC ATAACCTTGA AACGGCAACA 
TTTTTGGGTC CTTCTCCATC ATTTTATTTA AAAGCGCATT ATGATCAATA TCATGCCCAA 
TTAACTTTCC AGCAATTTCC ATAGTATGTT CTGAGGTATT GTTAAAAAGG AATCGCCCAG 
TATCACCGAC GATACCAAGA TATAAAACGC TCGCGATATC TTTATTAACA ATTGCTTCAT 
CATTAAAATG TGAGATTAAA TCGTAAATGA TTTCACTTGT AGATGACGCG TTCGTATTAA 3180 
CTAAATTAAT ATCACCATAC TGATCAACTG CAGGATGATG ATCTATTTTA ATAAGTTTAC 324 0 

GACCTGTACT ATAACGTTCA TCGTCAATTC GTGGAGCATT GGCAGTATCA CATACAATTA 
20 CAAGCGCATC TTGATATGTT TTATCATCAA TGTTATCTAA CTCTCCAATA AAACTTAATG 

ATGATTCCGC TTCACCCACT GCAAATACTT GCTTTTGCGG AAATTTCTGC TGAATATAGT 
ATTTTAAACC AAGTTGTGAA CCATATGCAT CAGGATCTGG TCTAACATGT CTGTGTATAA 
TAATTGTATC GTTGTCTTCG ATACATTTCA TAATTTCATT CAAAGTACTA ATCATTTTCA 
TACTCCCTTT TTTAGAAAAG TTGCTTAATT TAAGCATTAG TCTATATCAA AATATCTAAA 
TTATAAAAAT TGTTACTACC ATATTAAACT ATTTGCCCGT TTTAATTATT TAGATATATA 
TATTTTCATA CTATTTAGTT CAGGGGCCCC AACACAGAGA AATTGGACCC CTAATTTCTA 3 720 

CAAACAATGC aAGTTGGGGT GGGGCCCCAA CGTTTGTGCG AAATCTATCT TATGCCTATT 37 8 0 

TTCTCTGCTA AGTTC CTATA cTTCGTCAAA CATTTGGCAT ATCACGAGAG CGCTCGCTAC 
TTTGTCGTTT TGACTATGCA TGTTCACTTC TATTTTGGCG AAGTTTCTTC CGACGTCTAG 
TATGCCAAAG CG CACTGTTA TATGTGATTC AATAGGTACT GTTTTAATAT ACACGATATT 
TAAGTTCTCT ATCATGACAT TACCTTTTTT AAATTTACGC ATTTCATATT GTATTGTTTC 402 0 

TTCTATAATA CTTACAAATG CCGCTTTACT TACTGTTCCG TAATGATTGA TTAAAAGTGG 4 080 

TGAAACTTCT ACTGTAATTC CATCTTGATT CATTGTTATA TATTTGGCGA TTTGATCGTT 414 0 

4S AATTGTTTCA CCCATCTGAG GCTGTCTTCC TAAAAGTTGC ATAGACTTTA AAACATCTTG 42 00 

TCTATTAATC ACACCCACTG TCTTTTTATT ACTCGAAACG ACAGGAATCA ATTCAATACC 4260 
TTCCCAAATC ATCATATGCG CACAACTTGC TACTGTACTC ATAGCATTTA CATAAATAGG 4 320 

ATTTCGCGTC ATCACTTTAT CTATTTCGTC GTCGTCCTTT GTATTAATCA TCTCTCGACT 4380 
TGTTACAATA CCTACTAATT TATACGACTC ATTGACTACC GGAAATCTTG TATGGCCAGT 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
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ATCTAATGGC 
AAGTGCTTTG 
TAAATCATGT 
TAATATAGCC 
CAATAATGTC 
AAATTTAGAT 
ATTGACAATT 
AACCGTACCA 
ACGATATGCT 
AGAGCCTATA 
CCTCTTCTTT 
AGTGAAATGA 
AAAGATCACA 
GTGAGGAGGG 
AATGAAAAAG 
TTAAACGCAA 
AAACTTACTG 
GGTATTGATT 
GCTAATAGCG 
AAATTTGTAC 
- ATCQ£TAAAT_ 
AAACGACTGA 
GTAAATTTAT 
GATAAAGAAC 
ACCCACTAAA 
AAAATCTATA 
CTACATTAAC 
AATGAATCGG 
CAATACCCAT 



GTCATTATAT 
TTGATAATAT 
TCATTCGCAA 
GTACCTCTTT 
TTTGGACCAA 
ACCATCTTAG 
TCATTAAAAG 
ACACGATCTA 
GTCCCytCaG 
GATAACGATT 
TCGAACAGTA 
AATAATGATC 
AAATCTTTAA 
ATGTACTAAT 
CACTAAAAGA 
TCAGCGAACA 
AAGAACGAAG 
ATGACCAAAT 
GTAAATATGA 
TTGGAAGTGT 
AAAATTTTTA 
GTAAATACAA 
AATCAATGTC 
CACTGATGTC 
AATATGCAGA 
TTCTATTTAA 
AGCATCTTTA 
TACAGAAATT 
CGTAAAATTA 



CTTGAACTAT 
TTGCAACTAG 
AATTAATAAC 
TTAAAGCTTC 
TATACTTTAA 
TGATACCTTT 
TTAAATGTTC 
TCGTTGCGAC 
ATACGTTTAA 
CAATATAATC 
TTAACTACAT 
GTTtCACCTA 
TTTTAAATGG 
GTATAAAAAT 
AGTGTCTAAA 
AGaTGCTCAA 
CAAGCGATTG 
AATTGTTCGT 
AATTGTTGTT 
CAGCCACAAA 
TCCAGAATCA 
GAAACGATTA 
TAATTGTATA 
CCCCGTCCAC 
CGATAACTTC 
AACTGAACAG 
AATTGTTGTG 
TTTGGTTTAA 
TCTCCAATTG 



TAAGATATCT 
GAATGTATCA 
TTTATTAGAT 
AATCTGCACA 
AATATCTTTG 
GTTGCCACCT 
AATTTCATTA 
CATGCCCATT 
AAATTT AG CG 
TAAAATTTGT 
TATAACTTTA 
TTTTATTTTT 
AAAAATCCAT 
ATATTACTTG 
TTAGCTGGCG 
GCATCAATTA 
GAAAAAACAC 
GGTAATG CAA 
TTAAGTAACC 
GTAGCAAAAC 
.CAAATAATCT 
TGACTGTGGT 
AAACTAAAAT 
GTCGTAACTG 
CACATAGATT 
ATTCACCTGG 
GATCTTGTTC 
TAAATTCATT 
GTACAAAACA 



TTTCGTATTT 
TAACTTGATG 
GGCTTAAATC 
TCTTCACGAT 
AGTTCCATTG 
AACACTTGGC 
CGATTACGTT 
TTATCAG CAT 
ATTTTACGCA 
TCATGTTTTG 
TTTTGGATAA 
TGAAAATATA 
TATTATTTAT 
GTGTAGACAC 
AAGGTACAGT 
AAGCAGGTGT 
GCAAAGCTTT 
AAGAAGAACT 
GTAAAGCAGA 
GTGCGACTAT 
TTC^TCATG 
TCTGGATTTT 
TACGAGAGTA 
AATCAGTAGA 
AGCGAGGTGT 
TTTTAAAATT 
GATTAATGGG 
AAT AG CAT AA 
TACATCAACT 



TCTGATTAAA 
ATAGAACAGG 
CACCAGTAAT 
TTCCGACAAT 
CTCCAATTGC 
CATCAATAAT 
TTTCGATTCG 
CTTTtnATTGc 
CCGAAATTTT 
TCATTCTTTA 
AAAG CATTG A 
CAACAAACAC 
TAGAATGTAA 
TCAGTTAAAA 
CGTAACAGTT 
TCATTTAAAC 
AGAAGATTAT 
ATTAAAACAT 
AGACAAAAAG 
CCCTGTATTA 
JVTCa^GTCTC 
TTATATCGTA 
GGTCAGAAAT 
ATATAAAAAC 
TTTTTAGTGT 
TGCACGTCCC 
AATGTATCAT 
CTTGCATCAT 
GGATGACGTT 



4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520. 
5580 
5640 
5700 
_5_Z6_0^ 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
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TTCAACTTCA AACACGATAC CCATTGGCAT ACCTAAATAA ACTGGgAATA CCATTTTCAT 


€360 




GTGTAAAACT TGAACTATGA AATGCTTGAA CAAATTTAAC GCTTCCGAAA TCAAaGTTTG 


6420 


s 


crrrAccAcc AaTArrcATA ccatgaacat tttcaacacc GTGATATGAA GAAAGATAGT 


6480 




CAGC CATTTC TGCACTTCCA ATTACTGTTG CTCCTGTTTT CTTTGCTAGT TCCACAACAT 


6540 


10 


CACCAAAATG ATCAAAATGA CCGTGCGTTA AAACGATATA GTCTACCTGC ACTGTTTCAA 


6600 


TATTCAAATC ACACTTAGGG TTATTTGAAA TAAACGGATC TACGATAACC TTTTTGTTGT 


6660 




TCCCTTCTAA ATAAATCGTT GATTGACCAT GAAATGATAA CTTCATTTGA GCATCCTCCT 


6720 


15 


ATCAATTACT ATATAAATTT AGTACCCTTT TGCCACTTAA TTATAACAAA TTCTCAAATT 


6780 




TTAAAAATTG AAAAT CTAGT TAATGTATTA GCTCGATTTT GAAATCTAAT AATAATTGGC 


6840 




ATAAAATGGA AGTAATATTA TGTTGAGGAG TGTTTATAAA ATGACAAAAA TATCAAAAAT 


6900 


20 


AATAGACGAA TTGAACAATC AACAAGCTGA TGCAGCATGG ATTACAACAC CGTTGAATGT 


6960 




ATATTATTTT ACTGGATACC GTAGCGAACC CCATGAAAGA TTATTTGCAT TATTGATTAA 


7020 




GAAAGATGGT AAACAAGTAC TATTTTGTCC AAAAATGGAA GTCGAAGAAG TCAAAGCATC 


7080 


25 


ACCTTTCACA GGTGAAATCG TTGGATATTT AGACACTGAA AACCCTTTTT CACTTTATCC 


7140 




TCAAACAATC AATAAATTAC TAATTGAAAG CGAGCACTTA ACAGTAGCAC GCCAAAAACA 


7200 




ATTAAT CTCT GGTTTCAATG TCAATTCATT CGGAGATGTT GATTTAACAA TCAAACAATT 


7260 


30 


GAGAAATATT AAATCCGAAG ATGAAATTAG CAAAATACGT AAAGCTGCTG AGTTAGCAGA 


7320 




TAAGTGTATC GAAATAGGTG TTTCTTATTT AAAAGAAGGT GTGACTGAAT GTGAAGTAGT 


7380 


35 


CAAC CATATT GAGCAAACTA TCAAACAATA TGGCGTCAAT GAAATGAGTT TTGATACGAT 


7440 


GGTTTTATTT GGAGATCATG CCGCATCACC TCATGGCACA CCAGGAGATC GCAGATTAAA 


7500 




AAGCAATGAA TATGTACTAT TTGATTTAGG TGTAATTTAT GAGCATTATT GTAGCGATAT 


7560 


40 


GACACGTACT ATTAAATTTG GTGAACCTAG CAAAGAAGCA CAAGAAATTT ATAATATTGT 


7620 




ATTAGAAGCA GAAACATCTG CAATCCAAGC AATTAAACCT GGAAT AC CAT TAAAAGATAT 


7680 




CGATCATATC GCTAGAAATA TTATTTCAGA AAAAGGTTAT GGTGAATATT TCCCTCATCG 


7740 


45 


CTTAGGTCAT GG CCTAGGAT TACAAGAACA TGAATATCAA GATGTTTCAA GTACTAATTC 


7800 




TAATTTGTTA GAAGCTGGCA TGGTTATTAC AATCGAACCA GGTATTTATG TACCTGGTGT 


7860 




TG CAGGTGT A AGAATTGAAG ATGACATACT TGTCACTAAT GAAGGATATG AAGTATTAAC 


7920 


50 


ACATTACGAA AAATAAGGAG TGGGATAAAA ATGAAAAGCT TGTTACAAGC GCATTCTCAT 


7980 




TCAGTCAAAC ACTG CCAATA TAACATTGTA GCGCCTAAGA CATAAATTTT TATCCAAGTC 


8040 
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TGTAATGAAT CAAATCAATA TCATTCATGT TCGATGATTT CTTCGCATTG TTTCTAGCTT 8160 

TAATTTATCA TTATTTAATT TTAATAACCA AGGAGATGAT AACGTCATTC TTTAGTACGC 8220 

TGTAATCCAT TCCCTTTTCA TCAAATTCAA ATTATAATTG TAATGCTTCT TCTACAGATT 8 2 80 

TATATTCCAT TTCAAATGCC TCTGCAACGC CTTTATTGGT TACGTGACCT TTGTAAGTAT 834 0 

TTAAACCTAA TGATAATGGT TGATTTGATT TAAATGCTTC TCTATACCCT TTATTAGCTA 84 00 

GCATGAGCGC ATAAGGTAGC GTAgCATTAT TTAAAGCTAA CGTCGAAGTA CGCGGTACTG 846 0 

CACCTGGCAT ATTTGCAACT GCATAATGAA CCACACCATG CTTAATATAT GTAGGATCAT 852 0 

CATGTGTCGT AATTTTATCA GTTG t TTCAA AAATACCGCC TTGATCAATA GCAATGTCAA 858 0 

TAATAACTGA CCCATTTTTC ATTTGTTTAA TCATGTCTTC TGTTACAAGT CTTGGCGCTT 864 0 

TAGCACCTGG AATTAAAACT GCACCTATTA CTAAATCACT TTGTTTAACA TACAACTCAA 8700 

TATTCAACGG ATTTG A CAT A ATTGTATGTA CACGTCCACC GAATAAATCA TCTAATTGTT 8760 

GTAAACGCTT TGGATTAACA TCTAAAATCG TAACATCTGC ACCTAGTCCT AGTGCAATTT 8820 

TAG CTG CATT TGTTCCTGCT TGACCACCAC CGATAATAGT TACTTTACCC TTAGGTACTC 8880 

25 CTGGGACACC AC CTAGT AG A ATTCCCATAC CACCATTAAG TTTTTGTAGG AACTCTGCGC 894 0 

CAACTTGAGC TGACATTCTT CCTGCTACCT CACTCATTGG TGATAACAAT GGTAAAGATC 9000 

GGTCTGGTAA CTGCACAGTC TCATATGCAA TACTAATTAC TTTTCTATCT ATCAAAGCTT 9060 

30 GTGTTAATTT TTCTTCATTT GCTAAATGAa gatAaGTGAA TAATACAAGC C C TTCTTTAA 9120 

AATATGGATA TTCAGATTCA AGTGGTTCTT TAACTTTAAT AACCATATCC ACATCCCAAA 9180 

CTTTTGCTTG TTCAGCAACA ATCTCAGCAC CTGCTTCTTT GTAATCTACA TCTTCAAAGA 9240 

35 ATGATCCTGA ACCCGcATTT GTTTC CACTA AAACAGTATG 92 80 

(2) INFORMATION FOR SEQ ID NO^ A 32 J_^_ 

<i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4 66 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

CTGATTAATC TCTTGTTGTC GTGTATTTAC TAATTGAATC GTTGGTGTCT GAACACGTCC 60 

so CAGGGATAGC TGTGCATCAT ACTTTGTTGT TAGTGCACGC GTTG CATT AA TCCCAACAAT 120 

CCAAtCTGCC TCACTTCTCG CTAACGCTGC ATAATACAAA T CGTT AT ATT GACGACCGTC 180 
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ACGGATTGGC 


TTTTTGTTAC 


CAACTTTATC 


CAAAATCAAT 


CTTGCAACTA 


GTTCACCTTC 


300 




TCGTCCaGCA 


TCTGTTGCAA 


TAATAATATC 


TTTCACTTTA 


TTATCTAAAA 


TTAACGCTTT 


360 


5 


TACTGTTTTA 


AATTGTTTGC 


TTGTTTTACC 


AATAACAACA 


GTTTTCATAT 


ATTTAGGTAT 


420 




AATTGGAAGG 


TCTTCTAATC 


GCCATTCCTT 


TAAATTTTTA 


TCGTATTGTT 


CAGGTGTCGC 


480 




ATTTGTCACT AGATGACCTA 


ACGCCCACGT 


GACAATATAT 


TGGTTATTTT 


CAAAGTAACC 


540 


10 


ATTACGCTTC 


TGATTTATTT 


GTAAAGCATC 


AGCAATATCT 


CTTGCGACTG 


ATGGTTTTTC 


600 




AG CT AAT ATT 


AAAGATTTCA 


TAAATTATCC 


TTTCTCATAC 


GTTCTTTTAT 


TTCGAACGTG 


660 


15 


CTTCATCTAT 


TCCACTAATC 


TTTGATTTAA 


ATTCAATGAT 


TGCAAATGAT 


GTGTTAAATG 


720 


TATTGTAACA 


TGTTAATATC 


ACTATTAACT 


TTCATTTCAG 


TTGAAATACT 


ATATAATAAA 


780 




AGTAACAAAA 


AGTACGGAGG 


TAATGACATG 


AGCATAGTTC 


AGTTATATGA 


TATTACACAA 


840 


20 


ATAAAATCGT 


TCATTGAACA 


TTCGAATTAT 


GAAT GAG CAT 


CATACTTATA 


TAAACTTCCT 


900 




CAACAGTACA 


ATGAAATAGA 


TGTATTAATA 


ACCGATGCGA 


TTGAATCACC 


TGGTGTATTT 


960 




TCGATTAAAG 


AAAACGATTC 


AATCAAAGCA 


ATCATATTGT 


CTTTTGCATA 


CGATAAAAAT 


1020 


25 


AAATTCAAAG 


TCATAGG CCC 


TTTCGTGGCT 


GACAATTATG 


TATTATCTGT 


CGATACGTTT 


1080 




GAAACGCTAT 


TTAAAGCAAT 


GACTTCGAAC 


CAACCTGACG 


ATGCCGTCTT 


TAACTTTTCT 


1140 




TTTGAAGAAG 


GCATTCAACA 


ATACAAACCA 


TTAATGAAAG 


TTATTCAAGC 


AAGTTATAAC 


1200 


30 


TTCACTGACT 


ATTACATAGA 


AGCCCGTACA 


AGATTAGAAG 


AAGATATGCA 


CCAACCAAAT 


1260 




ATCATTCCTT 


ATCACAAAGG 


GTTTTATCGT 


GCTTTCAGCA 


AATTACACAC 


AACTACATTT 


1320 




AAATATCAGG 


CACAGTCACC 


ACAAGATATC 


ATTGATAGTT 


TAGACGACCA 


TCATCATTTG 


1380 


35 


TTTTTATTTG 


TTAGCGAAGG 


TTTACTTAAA 


GGTTATTTAT 


ACCTTGAAAT 


TGATTCACAA 


1440 




CAGTCAATCG 


CCGAGATTAA 


ATACTTCAGT 


TCTCATGTAG 


ATTAC CGTTT 


GAAAGGTATC 


1500 


40 


GCTTTCGAGT 


TGCTTGCGTA 


TGCATTGCAA 


TATGCTTTTG 


ATAATTTTGA 


TATTAGAAAA 


1560 


GTTTATTTTA AAATTCGTAA TAAAAATAAT 


AAACTCATCG 


AACGATTTAA 


TGGTCTAGGT 


1620 




TTCCATATCA ACTATGAGTA 


CATTAAATTC 


AAATTCGAAT 


CACGTAACGT 


AAAAGATCAA 


1680 


45 


ACAATCCCTG AATAAAACAC 


CAAGCAAATA 


CCCTACAGTA 


CATCATTAGC 


ATGTATTGTG 


1740 




GGTTTTTCTA 


CTTTTTGTAA 


ATATTGAAAA 


TTATAAGTAG 


TTGTTTTTTA 


CTATTAGGGC 


1800 




AGAATGCTTT 


ACAATAACAT 


GCAAGTGTCA 


ATTAAGGGGA 


GCACTTGCAT 


AAATAGTATA 


1860 


50 


GGAGAGTGAG 


TAGTCTTGCA 


ATTTCTTGAT 


TTCTTAATCG 


CACTTTTACC 


TGCTTTATTC 


1920 




TGGGGAAGTG 


TCGTTCTTAT 


TAATGTGTTC 


GTCGGCGGTG 


GAC CTTACAA 


CCAAATTCGT 


1980 
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TTCAATAATC CTACTGTAAT TATTGTCGGT CTTATTTCTG GTGCATTATG GGCGTTTGGA 2100 

CAAGCGAATC AGCTTAAATC TATTAGTTTA ATCGGTGTAT CAAATACTAT GCCAGTTTCT 2160 

ACAGGTATGC AATTAGTTGG TACAACATTA TTCAGCGTTA TCTTTTTAGG TGAATGGTCT 2220 

TCAATGACTC AAATTATCTT TGGTTTAATC GCCATGATAT TATTAGTTAC TGGTGTAGCA 2280 

CTTACTTCAC TTAAAGCTAA AAATGAACGT CAATCAGATA ATCCTGAATT TAAAAAAGCA 234 0 

ATGGGTATTT TAATTGTATC TACAGTTGGA TATGTAGGTT TCGTTGTACT TGGTGACATC 24 00 

TTTGGTGTTG GTGGAACTGA TGCATTGTTC TTCCAATCTG TCGGTATGGC AATTGGTGGC 24 60 

TTTATCCTAT CCATGAATCA TAAAACATCA CTTAAATCAA CAGCACTTAA TCTATTGcCA 2520 

GGTGTGATTT GGGGAATTGG TAACTTGTTC ATGTTCTATT CTCAACCAAA AGTTGGTGTA 258 0 

GCTACAAGTT TCTCATTATC ACAGTTACTT GTTATCGTTT CAACCTTAGG CGGTATTTTC 264 0 

ATTTTAGGAG AAAGAAAAGA TCGTCGTCAG ATGACGGGTA TTTGGGCAGG TATTATTATT 27 00 

ATCGTGATAG CTGCTATAAT TCTAGGTAAT TTGAAATAGA AAGTTAAATA CTCATGTAAC 2760 

GTAAAAATGT AATCACTTCT GAAAATAACC ATTCACTTAT AGAATGATTA AAATTAATTT 2820 

25 TOGGGAATTT TACGTTGAAT GTTCCTCTAT ATGTCCTAGG AAATACGTGG CTCTAAAAAC 2880 

AAAACGCAAT AACACATCAT GACATTAATC ATGCGTTTTA AGACTTTAAA ATTAGCGATA 2 94 0 

CTTTTAAAAT CTTGATGATA TTCATATATC AAGTATGCGC CATACATATG AAGTGGATAG 3000 

30 CTGCATAACG CACTGCATTA TCAACTTGAA TGTATGAGTT GAACAACTAT GTCATAAATA 3 060 

AAAGCCCCCT TTTCACAATA TACATTTACA TATTGTGGTA AAGGGGGCTC TCATTTTCTA 3120 

CGAATACTAA AATGGATTTT ATTTTCAAAT GTGTAAACTA GACAAACACT GCCTGATACA 3180 

CGTACAAAAT AATGATACTA ATAATGATTG TCAAATTGGT CGTCATACCT ATAAATGGCA 3240 

- GTGTTGG ATA- TTTAAACTGA A-TACCATAAG AAATAATTGC-AACACcTACC J3GGAACATCC 3300 

AAGTGACCAA CAATGTCGTC TTAATCATAT CATCTGATAC TGGTAACAAO ACATATACTA 3360 
ACAATCCCGC AACTAATGCT AATCCATAAT GCAAACATAA ATATTTAATA GTAGCAGGTA 3420 
TATACTTTCT TTCCAGAGTA AAATTCAACA TGACACCTAG CAAAATCATT GATAACGGCA 34 80 

TATTTGCATG GGAAAGTATG CTAAAGAAAT CGATTGCCAC ATGTGGTAAA TGGATGTGAC 354 0 

TTATATTCAA TATAAACATT ACAATGTATG TAACGAGTGG CACTGATTGT AATAATTTCT 3600 
TACCTAAATA TTTAAAATCG AATTGATCAC TACCTTCACT AAAGTAGCTA CCTACAAAGT 3 660 

50 AAGTAATTCC AAACATCACA AAGGCACCAC CTATATCAGC CATAACAAAA TAAATAAGTC 3720 

CCGTTTTAGG CCATATCACT TCAATTAGTG GATATGCAAA CAATCCAATA TTCATAGCAC 3780 
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CAATCATTTT CGCCACAATA CCATATATAA TCATTAAAAT TGGTAAAATG GAGAATGACA 
ATTTTAATTC TGCACTGTTT AAATTCACAA TAACTAAAGA TGGGAGTGTG ACATTAAGAA 
5 CTAATGTAGC AATGACTTGA CTATCTGTTG CTTTTATAAA ATTAATGCGC TTCAAAAAGT 

AACCAAGCGC AATTAATAAA ATAATCATAG TAAATTGTTC TGTCACTGTT ATCCCTTCTT 
TCAATAATCT TCATAATTTA TAACTTTAAC ATACTCCACA GATATTTTAG AAGTCTACTG 

10 

TTTCATGCTA TAATCTACAT TAAATGCACT TAATTATATT TCAAAGGAGT G TT AT AG TAT 
GTCTTTAGAA AACCAACTAG CCGAACTTAA ATATGATTAT GTTCGTCTTC AAGGTGACAT 

^ AGAAAAACGG GAATCTTTGA ATTTAGATAC TTCCGCACTT GTTCGTCAAC TTAAAGATAT 

TGAAAATGAA ATTAGAAACG TTCGTGCTCA AATGCAAGAT TAATAATCTA TCATTCAAGC 
AATAAATGCT TTTTGTTACA TAAATTTGAC TAGCATTGCT CTGAATACGT TATATTGATG 

^ AATTGCTTCA TTTTTCGCTC AATTACATCT AGAATCACAA GATGTTGTCG TGTTATGATT 

TAG TGTTTCA TTAACAACAT ACACGCATAT CTATCCCAAC ACTGCTATTT ATGTTTTCTA 
CGCTGnTGTA CTACATGAAC CCTTTGAAAC GGAGAGGAAG TTATCATATG CAATTTTAnC 

2s TGATTTTACT AG CAATACTT TAACnAATTG nTAGTTTAAT AGAATTTTA 

(2) INFORMATION FOR SEQ ID NO: 133: 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2785 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
TTTOCACCCA TCTGaTACAA TGCACCATGC GGTTTAACAT GATTAATTTT AACTTGATGA 
ATGCGACAAA ACCCTTGTAA TGCACCTAAT TGATAAATCA T CAAATT AT A AATCTCGTCG 
TTAGAGATAT CTATATTTCG TCTGCCAAAG CCTTTCAAAT CAGGTAAACC AGGATGTGCA 
CCTACTGCAA CATTATGTGC TTTGGCAAGT TTTACCGTTT CATT CATT AC ATTTTCATCA 
CCAGCGTGAA AACCACAAG C AACATTCGCA CTTGTAATTA ACGGAATAAT TTGATGATCA 300 
CCACCAAAGG AATAATTTCC AAATGCTTCG CCTAAATCAC AATTCAAATC AACTCGCATT .360 
ATAATTCCAC CCCTTTAACA ATTTGATGTT TTTCTAAAAA TTTAATATCA ACATCTTTTG 
so CATCTCCATC ACGATATAGT GGATAATTTA AAACTGCATA TAAAAAATCG GCAGTTGTAG 

AAAATCCATC TATCACCATT TCATCTAAGG TGACTTTCAA CTTATCAATT GCTGAAGCTC 
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AACCGTGATA TAGTAAAGAA TCGACTCGCA CATTAAAGCC TTGAGGTAAA TGTAACGCTG 660 

TCACTTTACC TGGTGTTGGT TGAAATTTCT TTTCaGGATT TTCGGCATTT ATTCTCGCTT 720 

CTATCACATG ACCATTAAAT TGAATATCGC TTTGTGAAAA AGGTAAATGA TTATGTTCCA 780 

ATAAATACAG TTGTGCTGCA ACCAAATCAC GTTCTGCTCG CATCTCTGTA ACAGTATGTT 84 0 

CAACTTGTAT TCGAGCATTC ATTTCAATAA AGTAATGTGC GGTATCAGTT ACTAAAAATT 900 

CAATCGTACC TGCACTTCTA TAATTTGCTG CACGTGCAAC TTTAACAGCA TCGTTACATA 96 0 

TTTGTTGTCG TCTTTCTTCA GTTAATGCTG CACAAGGAGA TTCTTCGATT AATTTTTGAT 1020 

TTTTACGTTG TACAGAACAA TCACGTTCCC CTAAATGTAC ATAATTATCC TGCCCATCTC 1080 

CCaTAACTTG AACTTCAACA TGTTTTGcAA CAGGTATAAA AGCCTCAACA TAAACACGAT 1140 

CATCATCAAA GTATTTTTTT CCTTCACTTT TAGCTTCTTT AAATGCCTTT TCTAAATCTT 12 00 

CAGCTTTCTT TACAATACGT AT AC CTTTAC CACCACCGCC ACTGGCAGCT TTGATAACAA 1260 

CTGGATAACC GATGTCTTTG GCAAGATTCT CAATTTCAGA CACATGATTC ACAGCACCAT 1320 

TTGATCCTGG AATCACAGGA ACACCTGCAT GATGAACTGT TTGTCTTGCT GTTATTTTAT 1380 

25 CCCCCATCAT TTCCATCGTT TTTTTAGTAG GCCCTATAAA CGCTATGCCT TGTTCCTCAA 144 0 

CGGTTTGAGC AAATTTTGTT GATTCTGATA AAAAGCCATA TCCTGGGTGA ATTGCATTAG 1500 

CACCAGTGAT TTGTGCAGCA GATATGATGC GGTCAATATT TAAATAACTA TCTAAAg CAT 1560 

30 TArcvrTCCCC AATACATATA GCTTGATCTG CTAAATGTAC ATGCAAGCTT TGCTCGTCCC 1620 

CTTTTGCATA AACTGCTACA GTTTCAATCC CAT ATT CTCT GCAAGCTCTT ATAATCCTTA 1680 

CAGCAATTTC ACCTCTGTTC GCAATTAAAC AACGAAGCAT TTACTTACCC CCTTTACTTA 174 0 

ATACGTACCA AAACTTGGTC GTATTCAACA TTTGTGCCAT GATCAGCTAC TATTTCAGTA 1800 

^ATrtCTCCAG -CAACATCTGT _TGTTACCTCG_ _TTTAATACTT__TCATCGCTT.C _ AACATATCCT 18 6.0._ 

ATAATATCTC CCTTGTTAAC TTTGTCACCG ACATTCACAA TTGGTTCAGT TAATTCTTTA 1920 

CTATCTTGTA AAAAGAATGT ACCTATCATT GGTGATTTAA TGTCATGATA ATCATTTGTC 1980 

GAAACATCGG AGTTATCATT CGCTTTTGAA GCTGTCAAAT CATTATTGTT CATACTTTGA 204 0 

TTTGATTGAT TACTGTGTGC AGCCAAATGA TTCGAGTCAG TGAAGTCAAT TTCTATTTCA 2100 

TCTTCAAAAT TTTTATATTT AAATTTCTTA ACATCATTTT CCTTCACTAA TTTGATTATT 2160 

TGTTCGATTT nTTCAATATT CATTTTACAA ATCCCCTTTT AAAATTGTTG CTAATTTTTT 2220 

SO CGAAGTATGT CGCAAGCTAG ATGT AT CAAA AATTGGAGTC TTTTGATGAC TCTTAAGAAT 2280 

TTCATTAAAC AGAGACATTT GTTCCCGATT CTTATCTACA GCTTCTTGGA ATGATATCCA 234 0 
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TACAGTTGCA ATTTTGGTAT AACCACCTAT CGTTTGTTTA TCATTAAGCA GAATAATAGG 2460 
TTGACCATCA TTTGGTACCT GAACACTACC AAGAGCAACC GGTTCAGAAA TGATATCTGC 2520 
TTGATTAAtT GGTGCAACGC TGTCACCTTC CAAACGATAG CCCATACGGT CTGATTGTTC 
AGTAATTAAA TATGGATGAT TTACAATTTT CGCTCTAGCC TCTTCAGAAA ATGCCTCGAA 
TTGAGGTCCT TGAAGAATGT G T AT AAT ATT ATTTTCTGGC AATAAATCGT CCTGTAAATG 
AATCGTCTTT CCAATGTTTT CTTTAAAGTC ATTATTTATT TTCACTGTTA TTACATCATC 
AGCTAATAAC TTTCTACCTT TGAAT 
(2) INFORMATION FOR SEQ ID NO: 134: 



2580 
2640 
2700 
2760 
2785 



60 
120 
180 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 < D > TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 134: 
25 AATGGAAACG GTTGAAACAG CAATTATTAC TATTTCTATG GGTGAAGGTA TTTCAGAGAT 

ATTTAAATCA ATGGGTG CCA CACATATCAT TAGTGGTGGA CAAACGATGA ATCCTTCTAC 
AGAAGATATC GTTAAAGTCA TTGAACAATC AAAATGTAAA CGTGCAATTA TTTTACCGAA 
TAATAAAAAT ATCTTAATGG CAAGTGAACA AGCAGCGAGT ATTGTTGATG CAGAAG CTGT 24 0 

TGTTATTCCA ACGAAATCTA TTCCTCAAGG TATAAGCGCA CTATTCCAAT ATGATGTGGA 3 00 

CGCAACACTT GAAGaAAATA AAGCGCAAAT GGCTGATTCA GTAAATAACG TTAAATCTGG 
TTCATTAACG TACGCTGTTC GTGAT AC GAA AATTGATGGC GTTGAGATTA AAAAAGACGC 
GTTTATGGGC TTGATTGAAG ATAAGATTGT AAGCAGCCAA AGTGATCAAT TAACAACGGT 
TACTG AG TTG TTAAATGAGA TGTTAGCAGA AGATAGTGAA ATATTGACTG TGATTATTGG 540 
TCAAGATGCA GAGCAAGCAG TTACAGATAA CATGATAAAC TGGATCGAAG AGCAATATCC 6 00 

AGATGTAGAA GTGGAAGTTC ATGAAGGTGG ACAACCAATT TATCAATATT TCTTTTCAGT 660 
45 AGAATAAAAA TTTAAAATAA AAAACTACCA ATGATAAATC ATCAGTTGGT AGTTTTTTAT 720 

TTTGCTATTT TAG TG AT ATT GCGGGTTAAA AGTATCGTTC TCGAGTTGCT AACAATGTCA 78 0 

TGTTCAACTT AGTCATGATA AAATAAATAA CATACTAAAT GATACGTAAA ATCAAATAAA 84 0 

SO ACATAGGTGA TTTATTTTGG CTAAAGTAAA CTTAATAGAA AGTC CATATT CTCTTTTACA 

ATTAAAAGGT ATAGGTCCTA AGAAAATAGA AGTATTGCAA CAACTAAATA TTCATACAGT 
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(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 154 0 base pairs 
5 <B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

TGTAGTTGAA CATGAACAAC AAAAGAAAGA AAAGACAAAA AAGCAATACA AGCCATTTTG 60 

GATTGTCATG AGTTTTATAA TACTTATAGT TGTACTATTA CTCCCGGCAC CTTCAAGTCT 120 

75 

GCCGATAATG GCTAAGGCAG TACTAGCTAT TTwAGCTTTT G CAGTT ATT A TGTGGGTAAC 180 

GGAAGCTGTA TCATATCCGG TGTCAGCAAC TTTAATTATT GGCTTAATGA TATTACTTTT 24 0 

2Q AGGATTTAGC CCTGTTCAAA ATTTAGGGGA GAAGCTAGGT AATCCGAAAA GTGGCAGTGC 3 00 

TATTTT AG CT GGAAGTGACC TTCTAGGAAC TAATCATGCA TTATCATTAG CGTTTAGTGG 360 

ATTTGCAACT TCAGCTGTAG CTCTCGTTGC AGCTGCATTA TTT TT GGCTG CTGCTATGCA 420 

25 AGAAACGAAT TTGCATAAAA GACTAGCTCT TTTAGTGTTA TCAATTGTTG GTAATAAAAC 480 
TAGAAATATA GTTATTGGAG CAATTATCGT TTCAATTGTA CTTGCATTTT TCGTTCCTTC . 54 0 

TGCAACAGCT AGAG CAGGGG CAGTTGTACC AATCTTGCTG GGTATGATTG CGGCATTTAA 600 

30 AGTTTCCAAA GATAGCAAGT TAG CGTCTTT ATTAATAATT ACTTCAGTAC AAGCTGTGTC 660 

AATTTGGAAT ATTGGTATCA AAACGGCGGC AGCACAAAAT ATCGTAGCGA TTAATTTTAT 720 

AAACCATCAA TTAGGATTTG ATGTTTCATG GGGCGAGTGG TTCTTATATG CAGCGCCTTG 78 0 

35 

GTCCATAGTT ATGTCCGTAG CTTTATATTT CATCATGATT AAAGTGATGC CTCCAGAAAT 84 0 

TAATSCAATA^ GAAGG TGGTA AAGATTTAAT AAAAGAAGAA TTGCATAAAC TTGGCCCCGT 900~ 

TAGCCCACGT GAATGGCGTT TAATTGTTAT AT CG ATGTTA TTATTACTGT TTTGGTCAAC 960 

40 

TGAAAAAGTA TTACATCCGA TTGACTCTGC ATCCATTACT ATTATTGCTT TAGGTGTTAT 1020 

GTTAATGCCG AAAATTGGTG TCATGACATG GAAACATGTT GAAAATAAAA TACCATGGGG 1080 

AACAATTATC GTGTTTGGTG TAGGTATTTC ACTAGGTAAC GTTCTTTTGA AAACAGGTGC 1140 

AGCTCAATGG TTAAGTGATC AAACTTTTGG TGTTTTAGGT TTAAAACATT TACCTATTAT 1200 

CGCGACAATT GCACTTATCA CGCTTTTTAA TATATTGATT CATTTGGGCT TTGCGAGTGC 1260 

50 AACAAGTTTA TCATCAGCGT TAATACCTGT TTTTATTTCG CTAACCTCTA CGTTACACTT 1320 

AGG AG AC CAG T CT AT AGG AT TTGTTTTAAT TCAACAATTT GTTATTAGTT TTGGTTTCTT 1380 

55 
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AGATTTCTTG AAGGCAGGTA TACCATTGAC AATTGTAGGG aATAtctAgT GaTAGTTTTT 1500 
AGCATGACTT ATTGGAAATG GGTAAGGTTG CnTTAATTAA 1540 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



15 


ACTTCTCACA ATAAGAAATA 


TGAAATTGTT 


ATGTGTTAGT 


TGAGATTCAG 


TGATGAATTA 


60 




CTTTTATCAT 


TTAAAATGTT 


GTTATCATTG 


TCATGCGTTA 


CCAAATCGCT 


TACGTATACA 


120 


20 


CGATTCCCAA 


TCTTAACATA 


GACGATTTGT 


ATATCAGAAT 


TTTCTGATTA 


CTAACAGTTT 


180 




ACCTAAGTTT AAATATCTGT 


TCAATGATTT 


TCAGTTATTT 


TTAAAAGAAA 


AATCGTAATG 


240 




CTGCCATGAT 


AACAATCCCA 


CTAATAATTG 


TAATAGTTAA 


AtACGCGTGA 


TTATAGATAA 


300 


25 


AATAACCGTC 


GGAATGAG CG 


CGATAATGTA 


AGGGATGTTT 


AATGTATACC 


CCTCACCATG 


360 




AGGCGTCTGT 


TGAATAATGC 


TGTCAATGAC 


AAGTGCCGTA 


AATAGTGTGA 


TTGGGATAAA 


420 




TGATAGCCAT 


CGAACCACGA 


CATCAGGCAA 


TTGCACTTTT 


GAAATCATGA 


TAAAAGGTAT 


480 


30 


AATTCGAATT 


AATAG CGTTA 


CGATACCACA 


CAATAAAATA 


AGTATTAACA 


TGTTCATATG 


540 




AGTTATCATT 


GTTCCATCAT 


CACTCCTAAC 


GCTGCTGAAA 


TTGTGGCTGC 


AATTAATATT 


600 




GCTAGATATG 


AAGG CAT AAA 


CATACTTAGC 


GATAACATCA 


TTACTATGAC 


GGCAATAATG 


660 


35 


AGTACTATGT 


AAATTCTTAA 


TCGCGATTTA 


GTAATTGATT 


CAAATTGCGC 


AATGGCCAAA 


720 




AAGATAAACA TAGCCGTGAT 


AGCAAAATCT 


AACCCTAGCG 


TTTGCGGATT 


TGAGATATAT 


780 


40 


TCGCCAAATA AAGCCCCAGC 


TACACATGAA 


ATTGCCCAAA 


ATAAATATGC 


TGTGATGTTA 


840 


AGACCATGCA 


TCCAACGATC 


ATTGATAGCT 


TCTCCTTTTA 


AATAAGGTGT 


AATGGCGACG 


900 




CCAAACGTTT 


CGTCAGTTAC 


TAATGAACCT 


AATCCAACAC 


GGTTCCAAAA 


CCCATATGTC 


960 


45 


TTGAAGTTTG 


GTGCAAGCGA 


CATACTTAAA AGGAACATTC 


TTGAATTTAC 


GATAAATACA 


1020 




GTTAGTACAA 


TCG CTGATAT 


AGGTGTACCT 


GCTATAAACA 


ACGCG CACAT 


AATAAATTGC 


1080 




GCAgcaCCGG 


CATATATAAC 


AAGACATAAC 


AAGACAATTT 


CTAAAATACT 


AAAGTTTTGA 


1140 


50 


GACGAAG CCA 


CAATACCAAA 


TGAAATACCA 


ACACCGGCAT 


AACCCAATAA 


TGTTGGGATA 


1200 




CACTCTTGCA 


CGCCTTGTCT 


AAAACTTAAA 


TGTGTTGTCA 


TCTCAATTAC 


CTCCTTTGCC 


1260 
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TAAGCAATAA CATTAGACAT CAGTTTGTCT GAGGTTAGAC ATTCCGGAGT CTTTAGTCAG 13 80 

CTTCATATTA ACTTTTTATT TTTGAGAATT TTCAATTTTT TATTTAAGAC TACCTCCATA 144 0 

5 TTTTCTATGG aTTTGTAGTT GTTTTTAAGT ATCAATTTTA TAAATTTTTA TATCTGATGA 1500 

TGAGTCTGGG aTATTGaTTC ATGTACCACT CCCTTaTaAT CATCCCCTCC CCCTaCCCTA 1560 

CTCCATCGAT ATAACTCATA CTACATATCA ACGAAATCAG TATTTTATCG CTTCCTTTCC 1620 

10 TATATTAGTG ATGCTCAAAC TTGTTACGTT TTAGATTGTT TTAGTTCATC ATAATTATCC 1680 

CGTATTGTTG CTATAATGAA ATGCGTTCAC CCCATTAAAC CACAAACTTA ATTTATTGTT 174 0 

GTTATGTGCA TTGGCTCACT ATTATATTTT TACAGCACAA AAAAAGTGGC GACAGTTCGT 1800 

CACCACTTTT TAAAATATTA TTTAAAGTAT CTTGCCCTTG CTTTAAGTAT ACGTAGATAT 1860 

ATACTTTTTA AAG CTTGT AG CTAAAGCCTT TATTTAACTG GTTTTGAAAT TTGTGTTTTA 1920 

CCACCCATAA ATGGTACTAA TGCTTCTGGA ATTGTTACTG TTCCATCTTC ATTTTGGTAA 1980 

TTTTCAACAA TAG CAG CAAA TGTACGTCCA ACTGCTAAAC CACTACCATT TAATGTATGT 204 0 

G CTAATTCTG GTTTAGCTGC TTTGTCACGC TTGAAGCGGA TGTTAGCACG ACGCGCTTGG 2100 

AAATCCGTAC AGTTTGAGCA TGAACTAATT TCTTTATAAT CATTGTAGCT TGGTAACCAA 2160 

ACTTCTAAAT CATATGTTTT GCTTGCACTA AATCCAATAT CACCTGTACA TAAAATAACA 2220 

CGACGGTATG GTAAACCTAA CTCTTCTAGA ATTGCTTCTG CGTTTGTTGT CATTTCTTCT 228 0 

30 AAAGCATTCC ATGAATCTTC AGGTTGTTCA AAACGTACCA TTTCCACTTT ATCGAATTGA 234 0 

TGTAAACGAA TTAATCCTCT TGTATCT CTA CCTGCTGATC CTGCTTCACT ACGGAAACAT 24 0 0 

GCAGATTGAC CAGTGAATTT TTCAGGAAGT ACACCTGGTT GAATAATTTC ATTACGGTAG 24 6 0 

35 AAATTCGTTA ATGGTACTTC AGCAGTTGGA ATTGTATATA ATCCTTCTTT TTCTACTTTA 2520 

AAT^ATCTT- GTTGAAATTT AGGTAATTGA CCTGTACCAT- ACATTGTATC TGCGTTCACA ,2580 - 

AGCTGTGGTA CCATCATTTC TGTATAACCA TGTTGTGTTG TATGTTTTGT AATCATATAG 264 0 

TTCATTAAAG CACGCTCTAA TTGCGCACCT TCATTTGTTA AATATACAAA ACGCGCACCT 2700 

GAAACTTTTG CTGCACGATC AAAATCAGCC ATTTTCAATT CTTCTACAAT ATCCCAATGT 2760 

GCTTTGGGTT CAAATGAAAA CTCaCGTGGT GTACCCCACT TTTTAACTTC AACGTTATCT 2820 

TCATCAGATT CACCTTGAGG TACATCATCA CTTATTAAAT TTGGAATACG ACAAAGGATA 2880 

CCTGTCATTT TATTATCAAT TTCATTTAAT TGACTATCTT TTTCTTTAAT ATCGTCACCT 294 0 

AATGTGCGCA TTTCAGCAAT CACATCATCA GCATTTTCTT TATTACGTTT TTTTAATGCG 3000 

ATTTCTTCGC TTACTTTATT ACGACGTGCT TTCATTTCTT CTGTTGCACT AATTAATTTA 306 0 
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TCAATTTTGC 


TCTTAACTGT 


GTCAGGCTCA 


TTTCTGAATA 


ATCTAATGTC 


TAACATTAAC 


*11 Oft 

JXou 




CTTCATCCTT 


TCCCAAATAA 


TTATCATTTA 


TTATGGAATG 


ACGTACGTCT 


TTATTTTTTA 




5 


GAAAATAAAA 


AAAGACCACA 


TCCCTACAAG 


GGACGTGGTC 


TACGCGTTGC 


CACCCTATTT 


"inn 




AACAATTTAA 


GTTATAAAGA 


TACACTAAAC 


CTAAATTGCA 


CTTCACTAAA 


ATAACGGTTA 


JJOU 


10 


TCACCGATTG 


TTCTTTTAAA 


TTAAGTAGGT 


AGATTCATAT 


ATATGTTGAT 


TCTTGTT CAC 




ACTAACCACA 


AGCTCTCTGA 


TATCGAACAC 


TATATATTAC 


TTGTCCTACG 


AACAATGTCT 


J *» O V7 




TATTAAGTTA 


TTTTTAATAT 


AGCAAACTAT 


ATTTGC1"I"IT 


TCAAG TAACG 


ATTTCAAACA 




IS 


TCACTCATGT 


CGATTTAGTG 


ACATGCAGTC 


GTTTGATAAA 


TTGATTGCTT 


TAAATACTGT 


3600 




GCAACCGCTT 


CAATATCTTT 


ATGAAATTGA 


CGATCATGTG 


TAATGGATGG 


CACGATACTT 


3660 




CGAAACTCAT 


CATACTTGCG 


ACGTGTTTTT 


GGTGATAATC 


CTTCAACACC 


TTTTAAC'TCT 


3720 


20 


GCTGCTTGTA 


ATGCAATAAC 


ACATTCGATT 


GCCAGCACAC 


GTCTTGCATT 


TTCAATAATT 

X X V^- »Tb^V X ^^^^V X. X 


v 'OU 




TGATAACCAT 


GTCTAGCAGC 


TGTAGTTCCC 


ATAGATACGT 


GATCTTCTTG 


GTTCGCAGAT 


^ fid n 

JQ4U 




GAAGTGATAG 


AATCAACACT 


CGCTGGATGC 


GCTAAAGTTT 


TATTTTCAGA 

X ** X X X X Vpi&XjTl 


AACG AG A PTT 


*5 0 a n 


25 


G CAGCAG CAT 


ATTGCATAAT 


CATCGCGCCA 


CTTTGCAATC 


CTGGCTCTGG 


ACTAAGAAAT 


J sou 




GCTGGTAAAT 


CACCATTTAA 


TTGAGGATTT 


ACTAGTCGCT 


CTAGACGACG 




Anon 




TTTGCTAATT 


CACTTACACC 


TAATTTAAGA 


TGATCTAATG 


CAAAAGCAAT 


AGGTTGTPCA 


a no n 
*± u 0 u 


30 


TGGAAGTTAC 


CAC CTGAAAT 


AACAAACGTT 


TCATTTGCTT 


C CT CAAAT AT 


AAGTGGATTA 


■x X ** \J 




TCATTAG CCG 


CATTCATTTC 


AAATTCTAAT 


TGCTGTTTAA 


CATAATTGAA 


TACTTGAAAA 


4200 


35 


CTCGCGCCAT 


GGATTTGTGG 


TATACAACGC 


AACGTATATG 


CATCTTGTAC 


ACGTATTTCT 


4260 


GATTGTCGCG 


TCGTTAATGT 


TGATCCTTCT 


AACCAATCAC 


GCATACGCGC 


TGCCACATTA 


4320 




ATCT6TTCTT GAAAATTACG AACTGCGTGC ACATCATGTC 


GATATGCATC 


TATAATGCCA 


4380 


40 


TTAAGAGACT 


GATGCG TTAA 


TGCAGCAATC 


CATTCAGATT 


GGTAAC CTAA 


ATCTTCTGCT 


4440 


TCTATATAAC 


TAATGACACC 


TTGAG CTGTC 


ATAGCTTGCG 


TACCATTAAT 


CAATGCTAAA 


4 500 




CCTTCTTTAG 


CCTGAAGGTT 


CAAAGGTTGT 


CTATTTAATT 


CTCTTAATAC ATCGTCACTA 


4560 


45 


TCCTTTTCTT 


CCCCTCTGTA 


CAATACTTTC 


CCTTCACCAA 


TTAATGCTAA 


TGCTAAATGT 


4620 




GATAATGGCG 


CTAAATCTCC 


TGATGCACCG 


AGAGAGCCTT 


GCTGTGGGAT 


TATCGGTATA 


4680 




AT ACG TTCAT 


TTATAAAAAA 


TTGTAATTGT 


CTCACTAATT 


CTAAAGTGGC 


ACCTGAATGA 


4740 


SO 


CCTTTTAATA 


ATGTATTCAA 


TCGTAAAATC 


ATCATGACTA 


ATGCTACTTC 


TTTTGAAAAT 


4800 




GGCTCACCTA 


GTCCACAGGC 


ATGTGAGCGT 


AT CAGATTCA 


CTTGTAATTC 


ATTATATTGC 


4860 
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TCCTCATTTT 


CAATAATACG 


TTCAACTACC 


GCTCTACTTT 


TTTTGACACG 


TTCTAACGCA 


4980 


TCATCAATAA 


TTTCAATCTT 


TGATTGTTGT 


TGTAAAAATG 


ATTTAATATC 


CTCAATTGTT 


5040 


AG TGTTTCAC 


CATCTAAATA 


TAAAGTCATA 


TATGTTACCC 


CCTTGTTTAT 


ATTAAGTAAC 


5100 


CCATCCTTCT 


TGAAGTATAC 


GTTTTCATTT 


TTATTGAAAC 


AATGGTTTTA 


CGTACATTTA 


5160 


TAACCTATTA 


TCAGAGCACT 


ATTGTAGTGC 


GTTAAAGGAT 


ATTAAGATTG 


TTGT AAG CAT 


5220 


ATTTAATAAT 


TTATCTATTG 


ACGAATTGCA 


TATACAGGTA 


TAGTATTTTC 


TATTGTATTT 


5280 


AACGACAAAT 


AATAATGAAT 


TCAGAAATTT 


ATAATACATT 


TTGTTAAAAG 


TTACTATATA 


5340 


TTTTTAAAAT 


TGAATAAATT 


CGGAAAAGGC 


TTTTACATGG 


GAGGTTATAT 


CACTATGGAA 


5400 


ACGTTAAATT 


CTATTAACAT 


TCCTAAGCGT 


AAAGAAGATT 


CACATAAAGG 


TGATTATGGC 


5460 


AAAATTTTAT 


TAATTGGTGG 


ATCTGCTAAC 


TTAGGTGGTG 


CCATTATGTT 


AGCGGCTCGT 


5520 


GCATGTGTAT 


TTAGCGGTAG 


TGGTTTAATC 


ACTGTAGCTA 


CACATCCAAC 


AAATCATTCA 


5580 


GCATTACATT 


CTCGTTGCCC 


AGAAG CGATG 


GTTATTGATA 


TTAATGATAC 


GAAAATGTTG 


5640 


ACGAAAATGA 


TTGAAATGAC 


TGACAGTATA 


CTAATTGGTC 


CAGGTCTTGG 


CGTTGATTTC 


5700 


AAAGGAAATA 


ATGCCATTAC 


ATTCCTACTA 


CAAAATATAC 


AACCGCATCA 


AAATTTAATC 


5760 


GTAGACGGCG 


ATGCGATTAC 


AATCTTTAGT 


AAACTGAAAC 


CGCAATTACC 


TACATGTCGT 


S820 


GTGATCTTTA 


CACCACACCT 


CAAAGAATGG 


GAACGATTAA 


GTGGTATTCC 


TATTGAGGAA 


5880 


CAGACATATG 


AG CGTAATCG 


TGAAGCAGTT 


GATCGTTTAG 


GTGCAACTGT 


TGTACTTAAA 


5940 


AAACATGGTA 


CTGAAATTTT 


CTTTAAAGAT 


GAAGACTTTA 


AATTGACAAT 


CGGTAGCCCA 


6000 


GCAATGGCGA 


CTGGTGGTAT 


GGGCGATACA 


CTTGCTGGTA 


TGATTACAAG 


CTTTGTCGGT 


6060 


CAATTTGATA 


ACTTAAAAGA 


AG CGGTTATG 


AGTGCCACAT 


ATACACATAG 


TTTTATTGGC 


6120 


GAAA£CCTTG 


_CAAAAGATAT_ 


GTATGTGGTG 


CCACCAT.CAA 


GACTTATCAA 


_TGAAATACCT 


618 0- _ _ - 


TACGCAATGA 


AACAATTAGA 


AAGTTAGTCA 


TTACTAATCA 


TTGAATATAG 


TAAAGCATTA 


6240 


CTTTCTAGCA 


TAAAAATAAG 


ACTCCCCTAC 


ATATAGGGAA 


GTCTTATTTT 


TTATTATTCT 


6300 


TCATCTGATG 


ATTGTTGTAT 


ATCTTCTTCA 


ACACGATCCA 


TGAAATCTTG 


TCTTACTTCA 


6360 


ATACGTCCAT 


CTTCATCATT 


TTCTTCTGAA 


TCAATCACTT 


CAGTATGAAT 


TGCATTTCCT 


6420 


GG TGTTT CAT 


CATTTaCAAC 


CGCTTCACGT 


TGTTGTTCAG 


TACCATCTTC 


AGATACAGTT 


6480 


GAAGTAGATT 


GCTCATCTTC 


ATTCGTTTCA 


TCTTCTGCAT 


CTTCTTTTAC 


TTTAGCAACC 


6540 


GTTGAAACAA 


ATTGATCATC 


ACCTAAGCGA 


ATTAAG CGAA 


CACCTTGTGC 


TGCACGACCA 


6600 


TTTTGAGAAA 


TATCTGCAAC 


ATCTAGTCGA 


ATAATGACAC 


CTGCATTAGT 


AACAATCATT 


6660 
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GTAGCTGTTT TAATACCTTT 


ACCACCACGA TTTGATAAGC GATAGTCATT AACTGGCGTA 


_ 

6780 




CGTTTACCAT 


' AACCATTTTC 




AATACTTCAT CAACACTGTT TG GATGAGC T 


6840 




ACATCAAGCC 


CTACAACTTC 


X V_HrVV_V_ X X V 


CGAAGTGTAA TACCTTTCAC ACCCGTTGCT 


6900 




GTACGGCCTA 


AAGGACGTAA 


xox iVjAI X LA 


GGGAATCGAA TTAATGATGC ATGTGATGTA 


6960 


10 


CCAATCAAGA 


TATCTTCITH 


n\L.v-n\— X lull 


AAGCGAACTG CAATTAACTC ATCATCTTCT 


7020 




CTGAACGAAA 


TCG CAATCTT 


RpplTTTPTA 
\~r\ X i ILln 


TTTATTCTTG AGAAGTTACT TAATGCTGAA 


7080 




CGTTTAACGA 


CACCACGTTT 




ACTAAGAAGT TGTCTTCACT TTCAAGGTCT 


7140 


15 


TTAACAGCAA 


TCATTGT A PT 


/vi X <aHL X X LA 


TCATTTTCAA GTTCAATAGC ATTCACTACA 


7200 




GGAATACCTT 




x W\i/\/\L 1 \JA 


GGCACTTCGT AACCTTTAAG TTTGTATACA 


7260 




CGACCTTTGT 


TAGTAAAfS A A 


LAA1AUA 


TCATGTGTAC TTAAAGTTAC CAATTGACTG 


7320 


20 


ACAAAATCTT 


x x ^— rvrlXVj X 


Ax i^-AIAC-CT 


TGAACACCAC GACCACCACG GTTTTGAGCA 


7380 




CGATATGTAG 


ATACCGTIP A A 




TAGTTATTAT GGCTTAGTGT AATTACTATT 


7440 




TGTTCTTCTG 


RAATT 2V finTP 

VJ/V\ X X xtMla X ^» 


TTCGTCCTCT 


AAGTCTTCAA ATCCACCTAA TTGAATTTCT 


7500 


25 


GTACGACGAT 


CATrArmii a 

x v-t\^v**jAA 


ACGATCXCTA 


ATTTCAGTCA ATTCATCTCT AACTAACTGT 


7560 




AATAACACTT 


V X X x V__/A»J v_ 




TCTAATTCAC TAATATAATT T AATAA CTCA 


7620 




TTATATTCA d 


\- X X Lhh X 1 X X 


GTCTCTCTCT 


AAACCTGTTA GACGTCTTAA ACGCATGTCT 


7680 


30 


AAAATAGCTT 


VJ.*"V^JV_ X x O 1 1 X 


X 1 l_Ai? AAAb I 


TTGAAGCGTT GTTGCAAGCT TTCCATTGCA 


7740 




ACTTTATCTG 


TATrTnarrr 

i 1U1 VjnL 1 \^ 


ALvjAA I L-Vj x 1 


GAAATAATTT CATCGATATG GTCAAGTGCG 


7800 




ATACGTAATC 


CTTCTAAAAT 


f3 Tr^irs P A a 

X VjVtVj *_AL-VjA 


1 L. 1 1 1 A(jCTT TACGTAAgTT GTATTGCGTA 


7860 


35 


CGTCTTCTAA 


v^nnw x vj x w X X 


A XVxAX uL 1L1 


AAATAATGTA CCAACGCTTC TTTTAAATTA 


7920 




ATAAGCTTCG 


GTCTACHATT 


X AV^AAVj 1\3\JA 


ATCATATTCA CACCAAATGA TGTTTGAAGA 


7980 




GGTGTTTGTT 


TGTATAAnTT 

X VJ X f \ X /Tk/A\J X X 


AX ± XAAAAlVj 


ACACTAGCAT TTGCATCCTT AC G CACAT CA 


8040 


40 


ATAACGACAC 


MW\wt^^nVj X 


7\ PV"2 r P 71 71 71 PTT 
X AAAL X X 


GmC^TCAC GTAAATCAGT GATACCGTCA 


8100 




ATTTTCTTGT 


CA rvs a a rvi iv n. 


v- X LTbCAAll 


TTTTCAATCA TACX5AGCCTT ATTCACTTGG 


8160 


45 


AAAGGAATTT 


CAGTGACAAC 


AATACGTTGA 


CGTCCGCCTC CACX5TTCTTC AATAACTGCA 


8220 




CGAGAACGCA 


TTTGAATTGA 


ACCACGACCT 


GTTTCATATG CACGTCTAAT ACCACTCTTA 


8280 




CCTAAAATAA 


GTCCAGCAGT 


TGGGAAATCA 


GGACCTTCAA TATCCTCCAT TAACTCAGCA 


8340 


50 


ATTGAAATAT 


CAGGGTTCTT 


ACTTAAGCTA 


AGTACACCAT TGATTAATTC TGTTAAGTTA 


8400 




TGTGGTGGAA 


TATTCGTTGC 


CATACCTACC 


GCX3ATACCTG ATGCACCATT GGCTAATAAG 


8460 
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AAATCTATTG 
GCTTCAGTAT 
CCATCAACAA 
ATAGATGAGT 
TTTTTATATG 
TGTACTGGTT 
GCATAATCTA 
TGAGGTAATT 
TCTAAGTTTG 
CCCATTAACA 
AGAGCGCGGT 
AGACCTTTGT 
TCAAGTTCCC 
GGTGGCTGTG 
AATGTTAATA 
ATTTTGTGAT 
GTGATCATTT 
TTTAATATCT 
GTAGACCCCC 
CTAGAGCAAT 
GTTACTTCAC 
ACCACTGTAC 
AATTTATCTA 
AATTGAGGAT 
CCAGAAAGTC 
ACACGCGTTA 
TTATTTGCGT 
ACTTCAATAT 
TTTTCGTTCA 



TATCTTTATT 
AACGCATTGC 
GCGGATAACG 
CACCATGAGG 
ATTTATCCGG 
TTAAACCGTC 
AAAATGATTC 
CAGCCATCAA 
CATAAACTGC 
TTTCAAATGT 
GCTCAGGGTT 
ATCGTG CAAT 
TATCATTGTA 
CAATATACAC 
ACAATGTTCT 
ATCTTGCTTT 
GACGAATTTC 
TACCTCGTAA 
CGG CAGAGTC 
CGGCTAATTT 
GOGCTTTTTT 
GTGCGACTTG 
CAACTTGACG 
CACCATGTTT 
TATCTTTTTC 
ATGCACGTTT 
AAGTTAAAAG 
CATCTTTAGA 
ATAACTCAAC 



AAT AT CACGT 
TGCTGCGCCA 
ATAACTGAAA 
GTGATATTTA 
TGTCATACCT 
ACGAACATCT 
ACG CATTTCA 
GAGTTCCTCC 
ATTATCTTCT 
TTGGTCCGCT 
CATTGTTGTT 
AGACCATTTT 
TACATAATAC 
ATAGCCTGCT 
AATATGCGCT 
CGCTAGATCA 
ATTGTTATTC 
TGGTAAAATC 
CCCTTCGACT 
ACCTGGAAGG 
CGCAGCAACA 
TGGATTTTCA 
CACTTCAGAA 
GATAGATATA 
TTCTTICATA 
GAATCCGTCT 
ATTTGTGGCA 
TTGATGAATA 
GTACGATTTA 



AACAGTTCAA 
TCTCCATCCA 
TCTTGAGCCA 
CCCATTACGT 
TGTTCATTTA 
GGCAATGCAC 
CTGGTAATAT 
TTCAAAAGTT 
ATAAATTGTC 
TCAATCGCAT 
TCCCAtAATT 
GGTGTTGGAT 
TTTTGTTTAC 
TCAATTAACG 
CCATCCACAT 
AAGTCGCCAC 
AAAATTCTAT 
GCCTGCGTTC 
AAGAAAATCT 
CTTGCTACAT 
CGTGCACGTG 
TATAAAAATC 
TTACCTAATT 
ATTGCTGTCA 
ATCTTGCTAC 
TCATGCGTAC 
TATCCTGAGT 
TAAATTGGCT 
ATACCGCCCT 



GTGTGATTTT 

TTGAACCAAA 

TACGTACCAT 

CACCAACGAT 

ATCCATATAG 

GAGCAACGAT 

TTCGTTCATT 

CAGTTCACAG 

TACGGTTTTC 

CTTCAAGTTT 

GATCTG CATT 

TCAATTCAGA 

CTTGTGTCAG 

GTCTCATAAA 

CGGCATCAGT 

CGATTCCTGT 

CTAATCGTGC 

TAGAGTCACG 

CACATTCTTC 

CTAACGCTGA 

CCGCCATAAT^ 

GTTCAAAGTG 

TTGTCTTCGT 

TACCTTCACG 

TTAAACCATA 

CACCTTCATA 

TATATTGAAT 

CATCATGAAT 

CATAGTGATA 



AGTCATACGC 
GTTACCTTGG 
TGCTTCATAA 
ACGTGCTGAT 
TATACGACGA 
AACACTCATC 
TATTCTTGAT 
CGCTTAGAAG 
TACAACGTCA 
TACTTGTAAA 
CATTTCTCCA 
TTTAAGTTTA 
TTTATACAAC 
TCGATAGAAG 
CATAATGACG 
ACCAAATGCT 
TTTTTCAACA 
AC CAG ATTTT 
AGGACTTTTA 
TTTACGACGT 
ACCTTTTTCA 
CTCTGAGAAT 
TTGACCTTCG 
TGTATCTTCA 
ACTATTTAAG 
CGTATGAATG 
CGCAATTTCT 
AGGTTCTTTA 
GGAGTCTTCT 



8560 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
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GCAAGCTCTC TAATACGCTG CTGTAATGTT TCATAGTTGT ATACAGTTGT CTCTGTGAAG 103 80 

ATTTCTCCAT CTGCTTTAAA ACGAAtGaCA GTACCTGTCT TAtCAGTnGT GCCAACTTCT 10440 

TTTAAGTCAA ATTGAGGTAC ACCTTTTTTA TATGCTTGAT GATATATAGT CTCATTTCTG 10500 

TGTACATATA CTTCTAAGTC TTGTGACAAT GCGTTTACAA CTGATGAACC AACACCATGT 10560 

AAACCACCAG ATACTTTG T A TCCGCCACCG CCAAATTTAC CACCAGCATG TAAAACAGTT 10620 

AAAATAACTT CGACAGCTGG ACGTCCCATT TTTTCTTGAA TATCAACTGG GATACCACGT 10680 

CCGTTATCCG TTACTTTAAT C CAGTT AT CT TTTTCAATAA CAACTTCAAT TTGATTTGCA 1074 0 

TAACCaGCTA ATGCTTCATC GATACTATTA TCGACAATTT CCCACACTAA ATGGTGCAAA 10800 

CCTCTCTCTG AAGTCGATCC TATATACATA CCTGGTCTTT TACGTACTGC TTCTAAACCT 10860 

TCTAATACTT GTATTTGCCC AGCACCATAA TTATCCGTGT TGTTTACATC TGACAATGCA 10920 

20 GTCACCATCG CTTTCTGTTA CTTTATAATT TCACCTTGAT TAATACGATA CAATTT AG CG 10980 

TTATTCATGA TTT CATGATC AATAC CATCT ACAGATGTCG TAGTGACAAA TGTTTGTACT 11040 

TTATGCTGAA TCGTACTTAA TAAATGCGTT TGACGCGAAT CATCTAATTC ACTGAGTACA 11100 

25 TCGTCTAATA ATAAGATGGG AT ATT CCCCA ACTTCGATAT TCATTAACTC AATTTCAGCT 11160 

AATTTAATGG ACAAAG CCGT TGTACGTTGC TGTCCTTGAG AACCATATGT TTGAG CATCC 11220 

ATGCCATTCA CATCAAAACT TAT AT CATCT CGATGTGGTC CGAATAAGCT AATGCCTCGT 11280 

TCTTTTTCTC TTTGCATATT ATCGCTAAGA ATAGACATAA TTTCTTCAAG TCGTGCCGCT 1134 0 

TCATTTTGAG CATAATCAAA TTTAAGACTA GGTAAATAAT TCAGCGACAA CGCTTCTTTA 114 00 

TCATTTGTGA TACCAGCATG AATCGGTTTA GCTAACGACT CTAGCTCTTG AATAAAATGT 114 6 0 

GCACGTTTAT CAGTTACTTT CATTGCATAT TCAGCAAACT GCTGATTTAA TACTTCCAAC 11520 

ATTGTTAAGT CCTTTTTTTG GCCTAATTGT AACTGCTTTA AGTAATTATT CTTTTGCTTT 115 80 

AAAATACGTT GGTATTGAGC TAAATCATTT AAGTAAACAG CAGAAATTTG GCCCAACTCC 11640 

ATATCTATAA AGCGTCGTCT TATTtGrGGr GAGCCTTTTA CAATATTCAA ATCTTCTGGC 11700 

GCAAATAGAA CCACATTGAG GTGTCCAATA TATTGAGTTA GACGACTTTG CTCTAAGTGn 117 60 

ATTCACTTTG GACTTGTTTA CCTTTnTTAG TTATAAACAT TGTTAATGGG CATCGTGCCG 11820 
TGT 
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(2) INFORMATION FOR SEQ ID NO : 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 



5 



10 



IS 



ATAATTATTA 


ACATGGTGTG 


TTTAGAAGTT 


ATCCACGGCT 


GTTATTTTTG 


TGTATAACTT 


60 


AAAAATTTAA 


GAAAGATGGA 


GTAAATTTAT 


GTCGGAAAAA 


GAAATTTGGG 


AAAAAGTGCT 


120 


TGAAATTGCT 


CAAGAAAAAT 


TATCAGCTGT 


AAGTTACTCA 


ACTTTC CTAA 


AAGATACTGA 


180 


GCTTTACACG 


ATTAAAGATG 


GTGAAGCTAT 


CGTATTATCG 


AGTATTCCTT 


TTAATGCAAA 


240 


TTGGTTAAAT 


CAACAATATG 


CTGAAATTAT 


CCAAGCAATC 


TTATTTGATG 


TTGTAGGCTA 


300 


TGAAGTTAAA 


CCTCACTTTA 


TTACTACTGA 


AGAATT AG CA 


AATTATAGTA 


ATAATGAAAC 


360 


TGCTACTCCA 


AAAGAAACAA 


CAAAACCTTC 


TACTGAAACA 


ACTGAGGATA 


ATCATGTGCT 


420 


TGGTAGAGAG 


CAATTCAATG 


CCCATAACAC 


ATTTGACACT 


TTTGTAATCG 


GACCCGGTAA 


480 


CCGCTTTCCA 


CATGCAGCGA 


GTTTAGCTGT 


GGCCGAAGCA 


CCAGCCAAAG 


CGTACAATCC 


540 


mTTATTTATC 


TATGGAGGTG 


TTGGtTTAGG 


aAAAACCCAT 


TTAATGCATG 


CCATTGGTCA 


600 


TCATGTTTTA 


GATAATAATC 


CAGATGCCAA 


AGTGATTTAC 


ACATCAAGTG 


AAAAATTCAC 


660 


AAATGAATTT 


ATTAAATCAA 


TTCGTGATAA 


nA 






692 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

3S 



45 



ATACTGTAGC 


GCAAATTTCA 


CAATGGCATG 


TTATAGAAGA 


TTTAGTTACG 


AATGAATTAG 


60 


GTATTAGTAT- 


TTTACCAACA TCAATTTCAG AgCAACTAAA 


- TGG AG ATGTG 


AAGCTGtACG — 


120 


CATTGAAGAT 


GCTCATGTAC 


ATTGGGAATT 


AGGTGTTGTT 


TGGAAGAAGG 


ATAAACAATT 


180 


AAGTCATGCC 


ACAACGAAAT 


GGATAGAATT 


TTTGAAAGAC 


CGTTTAGGCT 


AACATATTAA 


240 


TAAAGCACTC 


ATTATTTAAG 


GCGCATCATT 


ACGTGGGTCA 


TTGAAATAAT 


gagtgttttt 


300 


TTGTGAAAAT 


GAAGTGAAAT 


TTAGAGAGCG 


TTTCCATAGA 


AAATAGTAAT 


ACAAACTATA 


360 


AAAAAAGAGT 


ATTTTTATAT 


TGTGTACGCC 


ATCTTTATAA 


TAGTTATTGT 


AACAATTTAG 


420 


ACATATTTAG 


AAAGGGATGG 


CGCCATGCAC 


AAAGTCCAAT 


TAATAATCAA 


ACTACTACTA 


480 


CAACTAGGAA 


TCATCATTGT 


GATTACTTAT 


ATTGGCACAG 


AAATTCAAAA 


GATTTTTCAT 


540 
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ATTGTACCGC TAACTTGGGT AGAAGACGGT GCAAACTTTT TATTAAAGAC GATGGTCTTT 660 

TTCTTCATAC CGTCAGTTGT AGG t ATTATG GaTGtgCTTC CGAAATTACG CTAAATTATA 720 

5 TACTCTTTTT CGCAGTCATT ATCATAGGAA CATGTATCGT TGCATTATCT TCAGGTTATA 780 

TTGCTGAAAA AATGTCy G t T AAACwTAAAC ATCGTAAAGG TGTAGACGCt TATGAATGAT 84 0 

TACGTGCAAG CCTTATTAAT GATTTTGTTG ACTGTCGTTT TATATTATTT CGCTAAAAGG 900 

10 TTACAACAAA AATATCCGAA CCCATTTTTG AATCCAGCAT TAATTGCATC TTTAGGAATT 960 

ATTTTTGTCT TACTTATCTT TGGAATTAGT TATAACGGGT ATATGAAAGG TGGCAGTTGG 1020 

ATCAACCATA TTTTAAACGC AACGGTCGTA TGTTTAGCGT ACCCACTTTA TAAAAATAGA 10 80 

15 

GAGAAAATTA AAGACAATGT CTCTATCATT TTTGCAAGTG TATTAAcTGG CGTCATGCTG 114 0 

AATTTCATGT TAGTGTTCTT AACACTTAAA GCATTTGGCT ATTCTAAAGA CGTCATTGTA 1200 

ACGTTATTGC CCCGATCTAT AACAGCCGCA GTAGGTATCG AAGTGTCACA TGAACTAGGT 1260 

20 

GGTACAGATA CGATGACCGT ACTTTTTATT ATCACAACGG GTTTAATCGG TAGTATTTTA 1320 

GGTTCGATGT TATTAAGATT TGGAAGATTT GAATCTTCTA TCGCCAAAGG ATTAACGTAT 1380 

GGGAATGCGT CACATGCATT TGGCACAGCT AAAGCACTAG AAATGGATAT TGAATCCGGT 1440 

25 

GCATTTAGTT CAATTGGGAT GATTTTAACT G CAGTT ATT A GTTCAGTGTT AATACCTGTT 1500 

CTAATTTTAT TATTCTATTA ATTTAGATAT TTAAAATGAT AGACAGAAAG GGAGGCTATT 1560 

3Q AGTAATAATG GCAAAAATAA AAGCAAATGA AGCATTAGTT AAAGCATTAC AAGCaTGGGA 1620 

TATAGATCAC TTGTATGGTA TTCCAGGAGA CTCAATCGAC GCATAGTCGA TAgTTTACGT 1680 

ACAGTGAGAG ATCAATTTAA ATTTTATCAT GTACGTCATG AAGAAGTAGC AAGCTTAGCG 174 0 

35 GCTGCTGGTT ACACAAAATT AACTGGTAAA ATCGGTGTGG CATTAAGTAT CGGTGGCCCT 1800 

GGTTTAATTC ATTTATTAAA TGGTATGTAT GATGCCAAAA TGGATAATGT ACCGCAATTA 1860 

ATATTATCTG GACAAACGAA TAGTACAGCA CTTGGAACGA AAGCATTCCA AGAAACAAAT 1920 

40 TTACAAAAAT TATGTGAAGA TGTAGCCGTT TATAATCACC AAATTGAAAA AGGTGACAAT 1980 

GTGTTTGAAA TCGTTAACGA AGCAATTCGT ACGGCATATG AACAAAAAGG TGTAGCTGTT 2040 

GTTATTTGTC CTAACGACTT ATTAACTGAA AAAATTAAAG ATACAACGAA TAAACCAGTA 2100 

45 GATACATCAA GACCAACAGT AGTATCACCA AAATATAAAG ACATCAAAAA AGCGGTTAAA 2160 

CTAATTAATA AAAGTAAAAA GCCTGTCATG TTAATTGGTG TAGGTGCGAA ACATGCGAAA 2220 

GATGAGCTAC GTGAATTTAT TGAAATGGCT AAAATTCCTG TCATTCATTC ATT AC CAG CT 2280 

50 

AAAACAATCT TGCCGGATGA TCATCCATAT AGTATCGGtA ACTTAGGTAA AATCGGTACC 234 0 
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LLAlAlu 1 vjo 


- ATTACTTACC 


TAAGAAAAAT 


ATTAAAGCCA 


TTCAAATTGA 


CACAAATCCT 


2460 




& & & A. TAT'IA'TW"* 


GACATCGTTT 


CAATATTAAT 


GTAGGAATTG 


TTGGAGATAG 


TAAAATTGCG 


2520 


5 




TAACTGAAAA 


TATTAAACAT 


GTTGCTG AAA 


GACCATTCTT 


AAACAAAACG 


2580 




X X MvjHrtLAj X A 


AAGCGGTTTG 


GGATAAATGG 


ATGGAACAAG 


ATAAAAATAA 


TAATAGTAAA 


2640 






O TV ^"1 TV TA TV llOtl 

C-AGAACGATT 


AATGGCATCA 


ATCAATAAAT 


TTATTAAAGA 


TGATGCAGTG 


2700 


10 




ATGTAGGTAC 


AG CAACAGTT 


TGGTCAACTC 


GATACTTAAA 


CCTTGGTGTA 


2760 




AA 1 AALAAviT 


T CATCATTTC 


AAGTTGGTTA 


GGTACAATGG 


GTTG CGGTCT 


TCCAGGTGCA 


2820 




Al luLAiLAA 


TV TV TV * 1 wtV^ m 

AAATTGCATA 


TCCAAATAGA 


CAAGCCATCG 


CAATTGCTGG 


TGACGGTGCA 


2880 


15 




TAATGCAAGA 


CTTCGCTACA 


GCAGTACAAT 


ATGATTTACC 


TTTAACTGTA 


2940 




TTTGTACTTA 


ATAACAAACA 


GTTAGCATTT 


ATTAAATATG 


AACAACAAGC 


AGCTGGTGAA 


3000 


20 


TTAGAATATG 


CAGTTGATTT 


TTCTGATATG 


GATCATGCAA 


AATTTGCTGA 


GGCAGCAGGT 


3060 


GGTAAAGGTT 


ATACAATTAA 


GAGTGCTAGC 


GAAGTAGATG 


CTATAGTCGA 


AGAGGCATTA 


3120 




GCACAAGATG 


TACCAACGAT 


TGTAGATGTA 


TATGTTGATC 


CTAATGCTGC 


GCCATTACCA 


3180 


25 


GGTAAAATTG 


TAAATGAAGA 


AGCGCTTGGT 


TATGGTAAGT 


GGGCATTTAG 


ATCAATTACT 


3240 


TV TV TV ^»TV TA TV 

GAAGATAAAC 


ATTTAGATTT 


AGATCAAATT 


CCACCAATTT 


CAGTGGCAGC 


AAAACGTTTC 


3300 




' 1 " 1 1 TV q»K ft /' TV 

TTATAACTGA 


TTTAAAGGTT 


ATCACAATTG 


AATTGAACTA 


TAAAAACGGT 


AATTT CTATT 


3360 


30 


•P^TV TV /"» TV TV TV TV *V 

TCAACAAAAT 


GGGAATTGCC 


GTTTTGTTTA 


TTTATCACAA 


ATGATCGTAC 


TGAATTGATG 


3420 




TV *p Jv TV TV TV wtv-irn 

ATAAAATTGT 


GAAAAAGTTG 


TTGAAAACGC 


TTTTACAAAT 


ATGTATAATA 


GCTATGAATT 


3480 




AGATATCALi 


TGCGTGTTAC 


TGGTAATGCA 


GG CATGAGCA 


AACAACCGCA 


CTATGAGAAT 


3540 


35 


AGTCTTGTTT 


GTTCATGCCT 


GCTTTTTTTG 


TACATGGAAG 


CGGAAATTGA 


GATAGGGGAT 


3600 




GTTTATATGT 


TTAAGAAATT 


GTTTGGACAA 


TTGCAACGTA 


TCGGTAAAGC 


ATTAATGTTA 


3660 





CCTGTTGCGA 


TTTTACCAGC 


AGCTGGTATT 


TTATTAG CGT 


TTGGTAACGC 


AATGCACAAC 


3720 


40 


TV TV O TV TV nwp tv 

oAACAATTAG 


TAGAAATTGC 


ACCATGGTTA 


AAAAACGATA 


TCATTGTAAT 


GATTTCGTCG 


3760 




PTV*TiT/VTi TV 

vy rCATGGAAG 


CAGCAGGACA 


AGTTGTATTT 


GATAACTTGC 


CATTATTATT 


TGCAGTTGGT 


3840 




TV /-» T\ f^r* TV /"^T^TV* 

ACAGCACTTG 


GATTAGCAGG 


AGGAGACGGT 


GTTGCAG CAT 


TAGCAGCGCT 


AGTAGGTTAC 


3900 


45 


TTAATTATGA 


ATGCAACAAT 


GGGGAAAGTG 


TTGCACATTA 


CAATTGATGA 


CATTTTCTCA 


3960 




TATGCCAAAG 


GGGCAAAAGA 


ATTAAGTCAA 


GCAGCGAAAG 


AACCAGCACA 


TGCTTTAGTA 


4020 




TTAGGTATTC 


CAACGTTACA 


AACGGGTGTG 


TTTGGTGGTA 


TTATCATGGG 


TGCTTTAGCC 


4080 


50 


GCATGGTGTT 


ACAACAAATT 


TTATAATATT 


ACACTACCAC 


CATTTTTAGG 


ATTCTTTGCA 


4140 
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AGCTTTGCGT GGCCACCAAT TCAAGATGGA TTAAATAGTT TATCGAATTT CTTATTAAAT 4 260 

AAAAATTTAA CATTAACAAC GTTTATATTC GGTATTATTG AACGCTCATT AATTCCATTT 4 320 

GGTTTACATC ATATTTTCTA TTCACCGTTC TGGTTTGAAT TCGGAAGTTA TACAAATCAC 4 380 

GCAGGTGAAT TGGTTCGTGG TGACCAACGT ATTTGGATGG CACAATTGAA AGATGGCGTA 444 0 

CCATTTACTG CTGGTGCATT TACTACTGGT AAATATCCAT TTATGATGTT TGGTTTACCA 4 500 

GCGGCGGCAT TTGCTATTTA TAAAAATGCA CGACCAGAAC GTAAAAAAGT CGTGGGTGGT 4 560 

TTAATGTTAT CAGCAGGATT AACTGCATTT TTAACTGGTA TCACTGAGCC ATTAGAATTT 4 620 

TCATTCTTAT TTGTAGCACC AGTACTTTAT GGAATTCACG TATTATTAGC TGGTACATCA 4 6 80 

TTCTTAGTAA TGCATTTATT AGG CGTTAAA ATTGGTATGA CATTCTCAGG TGGTTTCATA 4 74 0 

GATTATATTT TATATGGTTT ATTAAACTGG GATCGTTCAC ACGCATTATT AGTTATTCCA 4 800 

GTCGGTATTG TATATGCTAT CGTGTATTAC TTCTTATTCG ACTTTGCAAT TCGTAAGTTT 4 860 

AAATTGAAAA CACCAGGTCG TGAAGATGAA GAAACTGAAA TTCGTAACTC TAGTGTCGCA 4 920 

AAATTACCAT TTGATGTCTT AGATGCAATG GGTGGAAAAG AAAACATTAA ACATTTAGAT 4 980 

GCATGTATTA CACGTCTACG CGTAGAAGTG GTTGATAAAT CAAAAGTAGA TGTAG CAGGT 504 0 

ATTAAAGCTT TAGGCGCATC AGGTGTATTA GAAGTTGGAA ACAATATGCA AGCTATCTTT 5100 

GGTCCAAAAT CAGATCAAAT TAAACATGAT ATGGCCAAGA TTATGAGTGG TGAAATTACG 5160 

AAACCAAGTG AAACGACAGT GACTGAAGAA ATGTCAGATG AACCAGTTCA CGTAGAAG CA 522 0 

CTTGGAACAA CAGACATCTA TGCACCAGGT ATCGGTCAAA TCATTCCATT ATCAGAAGTA 52 80 

CCTGATCAAG TATTCGCTGG TAAAATGATG GGTGATGGTG TTGGCTTTAT CCCTGAAAAA 534 0 

GGTGAAATTG TAGCACCGTT TGATGGTACA GTGAAAACAA TCTTCCCTAC GAAACATGCG 54 00 

ATAGGATTAG AATCTGAAAG TGGCGTCGAA GTACTTATTC ATATTGGTAT CGATACAGTG 54 60 

AAACTGAATG GTGAAGGATT CGAAAGTCTG ATTAACGTTG ATGAAAAAGT AACACAAGGT 552 0 

40 CAACCATTAA TGAAAGTGAA TTTAGCATAC TTGAAAGCAC ACGCACCAAG CATCGTTACA 558 0 

CCAATGATTA TTACAAATCT TGAAAATAAA GAACTTGTCA TTGAAGATGT ACAAGATGCT 5640 

GATCCAGGTA AGCTAATTAT GACAGTCAAA TAATGATTAA AAATGAAACA GCATATCAAA 5700 

45 TGAATGAACT TTTAGTCATT CGTAGTGCGT ATGCGAAGTA GCGAGTTGAA AGAGAATACG 5760 

TTACAAAAGG CAGTAGCTTA AAATGAAGCT ACTGCCTTTT TAGTGCGCAA TGATGTATAG 582 0 

CAGGTGTGTT GATGrTAATA AGTTAAATAT TAGTGTTAGA TATAGAAAAC ATTGCTTATG 5880 

TTTTTGTCAC ATTTTAGAAA AATGCATCTT CGCGACTAGC CAAATTAATA GTCTCATTGA 5940 



25 



30 



35 



50 
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AATAAATTAA 


CATGATTTTA 


AATCTATTTG 


TAAGATAAGG 


AGATTTGTCA 


TTATGACAAC 


6060 




AGAAGGTCTA 


TTAGTTGCAG 


AGAAAGAAAT 


CGAAGTGAAT 


GGTTACGACA 


TTGATGCGAT 


6120 


5 


GGGTGTCGTT 


AGTAATATCG 


TTTATATTAG 


ATGGTTCGAA 


GATTTGAGAA 


CAGCGTTTAT 


6180 




TAATCAGCAC 


ATGAATTACT 


CAACAATGAT 


CAATCAAGGC 


ATTTCACCTA 


TACTTATGAA 


6240 




AACGGAAGCA 


GAGTATAAAG 


TACCTGTCAC 


AATACATGAC 


AAACCAGTAG 


GTCGTATTTA 


6300 


10 


CTTAGTTAAA 


GCAAGCAAGA 


TGAAATGGGT 


GTTTCAGTTT 


GAAATTGTGT 


CCGCACATGG 


6360 




CGTGCATTGT 


ATTGGTACAC 


AGACAGGCGG 


TTTTTACAGA 


TTGAGTGATA 


AGAAGATAAC 


6420 




CTCTGTGCCA 


CAAGTGTTTC 


AAGACATTTT 


AGCAACAAAA 


TAATGACTTC 


ATTTTAAAAT 


6480 


15 


ATAAAAAGTA 


AGAAGGTGTT 


CGAAATGGTT 


AAGCAATTAA 


ATAGTGTCGA 


AGCATTCCGT 


6540 




GAATTTATTC 


ATCAATATCC 


GTTAGCAGTT 


GTACATGTCA 


TGCGCGATCA 


GTGTAGCGTG 


6600 




TGTCATGCCG 


TTTTACCACA 


AATTGAAGAC 


TTGATGCAAT 


CATATCCCAA 


TGTGCCATTA 


6660 


20 


GCTGTGATTA 


ATCAAAGTCA 


GGTGGAAGCT 


ATTGCTGGAG 


AATTAAATAT 


TTTCaCTGTA 


6720 




CCTGTGGATT 


TAATTTTTAT 


GAATGGAAAA 


GAAATGCATC 


GTCAAGGGCG 


TTTTATCGAT 


6780 


25 


ATGCAACGTT 


TTGAACATCA 


TCTTAAGCAA 


ATGAATGATA 


GTGTAAATAA 


CGATGTCGAT 


6640 


GAGCATTAAT 


ATCGCAAATG 


ATTAGCATTG 


CTAAGATTAT 


GTAGACATCA 


TAACTTATTT 


6900 




CCCAGTAAAT 


ATTGGTAGTA 


ATTAGAATCA 


GCATGGTACA 


GTAGAACTAT 


AGTAGAAATC 


6960 


30 


ATCAAAGAGG 


AGTGACGACA 


AATGCGTAAA 


AAATGGTCTA 


CACTTGCGTT 


TGGATTTTTA 


7020 


GTTGCAGCAT 


ACGCACATAT 


TAGAATTAAA 


GAAAAACGCA 


GTGTGAAAAG 


TTATATGTTA 


7080 




GAAGAAGGTA 


TACGATTATC 


TAGAGCTAAG 


CGTCGTTTTA 


TGTATAAAGA 


AGAAGCGATG 


7140 


35 


AAAGCATTAG 


AAAAAATGGC 


GCCACAGACA 


GCAGGCGAAT 


ATGAGGGAAC 


CAATTATCAG 


7200 




TTTAAGATGC 


CAGTAAAAGT 


GGATAAGCAC 


TTCGGTTCAA 


CCGTTTATAC 


CGTTAACGAT 


7260 




AAACAAGATA 


AGCATCAACG 


CGTTGTATTA 


TATGCACATG 


GAGGCGCATG 


GTTCCAAGAC 


7320 


40 


CCACTCAAAA 


TTCATTTCGA 


ATTTATTGAT 


GAACTTGCAG 


AAACACTCAA 


TGCTAAAGTC 


7380 




ATCATGCCAG 


TATATCCGAA 


GATTCCGCAT 


CAAGATTATC 


AAGCGACGTA 


TGTGCmTT 


7440 




GAAAAGTTGT 


ACCATGATTT 


ATTGAATCAA 


GTAGCAGATT 


CTAAACAAAT 


CGTTGTAATG 


75O0 


45 


GGTGACTCTG 


CGGGCGGTCA 


AATTGCTTTA 


TCATTTGCTC 


AATTGTTAAA 


AGAAAAACAT 


7560 




ATTGTGCAAC 


CAGGACATAT 


TGTATTAATT 


TCACCAGTTT 


TAGATGCAAC 


GATGCAGCAT 


7620 




CCTGAAATTC 


CTGACTACTT 


AAAGAAAGAC 


CCAATGGTAG 


GTGTGGATGG 


CaGTGTGTTC 


7680 


50 


TTAG CTGAAC 


AATGGGCAGG 


GGACACACCT 


TTAGATAACT 


ACAAAGTATC 


ACCAATTAAT 


7740 
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CCAGATGCTT TGAACTTATC GCAATTGTTG AGTGCGAAAG GTATCGAACA TGACTTTATA 
CCTGGATATT ACCAATTCCA TATTTATCCA GTATTTCCGA 
(2) INFORMATION FOR SEQ ID NO: 139: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 



20 



30 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
GTCTAAATAA ACAAAATTAT CATTG ATT a C TGAACTGGCA TTTCGAAGTA ATGCTTCAAT 
ATCATTCGAA TATTTCTTCA ATTTATGATT GTGAAATAAT TCTTGCATCA AAAATGGTCT 
TTGGTCACAT GAATGTGCAT CTGAAGCTAC AAAATGAGCC AAATTACATT CTATAAATTG 
TAATGATAAC TTTTGAATGT TTTTACCAAA TCCACCAACT AAAGAACTCG ATGTTAATTG 



€0 
120 
180 
240 

ACTCAGTGCC CCATTTGCAA CCAATTCATA TAATATTTCC GGATTTTTGG CGATACTTCT 3 00 

2S ATTTCTTTCA GGATGTGCAA TGATTGGTAT GTAACCTCTC GATTGTATTT CAAAAAACAA 3 60 



420 
480 



TTGTTTTGTA TAATGTGGTA CTTCGCCCGT TGGAAATTCA ATTAATAAAT ATTTCGAACG 
ATTAATACCT TGAATACTAC CATTATCTAA GCCTTTCAGA ATCGAATCTG TAATTCTAAT 

TTCTTGCCCG GGAAATAATT TAATATCCAA TGCTTGAACT TCTGGATGCG TTCTTAACTC 54 0 

CGCCAATTTC ACAAGCACTT GTTGAAATGT ATTATCATAT CTCGGATGCA AATGATGAGG 6 00 

TGTCGCTACA ATACTTGTTA CACCTTCATC CTTAG CTTGC TTTAATAGTG CAATACTCTT 660 

35 TTCAATTGTT TTAGGACCAT CATCTATATC AACTAATATA TGGTTATGAA TATCAATCAT 720 

GATTCATCAG TCCCATAATA TGCATAGTAA CTAGCACTTT TATCTTTAGG CATTCTATTT 780 

AAGACTACAC CTAATAATTT AG CACCTGTT GCTTCAATAA GTTCTTTTCC TTTTTTAACT 84 0 

TCATCTCTAT TATTATTTTC CGAATTAACT ACGTAGACAA CATTGCCGGT AAACTTTGAA 900 

AATAATTGCG CATCTG TAAC TGTGTTCACT GGTGG CGTAT CGATAATTAC AAAGTTATAA 960 

TTCATCAATA ATGTGT CAT A CAAATTTGCA AATGCCCTTG ATGTAATTAA CTCTGACGGA 1020 

TTCGGTGGGA TTGGCCCAGA CGTCAAGACG TCTAAATCTT GAATTTCAGT TGAGATAATA 108 0 

CTGTCTTGAT AAGTTGACCA ATTTAGCAAT AAACTTGATA GGCCTTCATT GTTTGGCAAA 114 0 

TTAAAAATAT AATGCTGCGT AGGTTTACGC ATATCCCCGT CTACGATTAG TGTTTTATAA 1200 

CCTGCTTGCG CAT ATG CAAC TGCTAAATTT GCTGCAATTG TAGACTTACC TGCGCCTGGT 12 60 
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GATCTTATGC 


CTCGAAATTT 


CTCGCTAATA 


GGTGACTTTG 


GTTGTTCATG 


GACAATTAAA 


1380 




CTTGATGTAC 


TTCyTCGTGT 


ATTCGTCATG 


GTAATTCCTC 


GTAAATTAAA 


ATTTTTGTAT 


1440 


5 


TGAACCTAAA ATAGGTAATC 


CTAGTTGCGA 


TTCAACATCT 


TCTTCTGTCT 


TAATACGCTT 


1500 




ATCTAATAAT 


TCTTTTAAGA 


AAATAATCAA 


TATTGCTAAA 


ACAATACCAA 


CAATAATGCT 


1560 




GATAACTAAG 


TTGACAGATA 


CTATTGGAGA 


TAC1TTTACA 


GCATTATCAT 


GTGCTGAGGA 


1620 


10 


AAGTATCGTA 


ACATT AT CAA 


CACTCATAAT 


TTTAGGCATG 


TCATGAGCAA 


AAACTTTAGA 


1680 




TATTTTATTA 


ACAA'rrnxjT 


CAGATTCAGA 


TTTATTCCCA 


GTGGTAACTG 


ATACAGTAAT 


1740 


15 


AATTTGAGAG 


TTTGTTTGAT 


TGGTTACTTT 


TAAAAATGAA 


TTCAACTCAG 


CTGTTGAATA 


1800 


CTGACCATCA 


AnTTCTCTAG 


ATACTTTATC 


TAGAATTCTA 


GGACTTTTGA 


TAATTTCCGT 


1860 




ATATGTATTA 


ACAGACTGCA 


AACTACTTTG 


AACATTTTGG 


AAAGCTAAAT 


CACTTGAGGA 


1920 


20 


ci"rrrrcATG 


TTCACTAATA 


TTTGAGTAGA 


AGCAGTATAT 


TTGTCAGGCA 


TAACAAAAAA 


1980 


GGTT 












1984 




(2) INFORMATION FOR SEQ ID NO: 14 0: 








25 


• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 








CAAATCCCTT 


GGTGATGAtA 


AAtGtATTGC 


TGTGTAGCCA 


AATAATCTTC 


GTATATATGA 


60 


35 


CTGACGTTCA 


ACAACAGCTT 


GCAATCGTTT 


CGTTGGTACA 


GTTACTTTCT 


TCTTGTTAAA 


120 





GJ^C^TAT 


TCftATTTTAA 


GTTGCTCATT 


TTCAAGCATC 


ACCGAAAAGC 


CATAAAATCT 


180 




TATCATTGTT 


ATAATCGTTC 


CAATAATATA 


TGCCACTATT 


AATACTAGTA 


AAATGATGAT 


240 


40 


TAATACTGAA 


ATACTTACAA 


TTTGAACCCA 


TTGACTAATT 


TCATGATTTA 


GCTTCGACCA 


300 




TGGGATCAAC 


TCTCTTACAG 


CCCCGTAAAT 


CGGTACTAAA 


GCTGCTAACG 


TTACACCAAT 


360 




GGCGCCACTG 


GTCATTGCCA 


TAAATAGTGA 


TTCTTTAAAA 


TTCATCTGAT 


ATATAGGAAT 


420 


45 


GCGTTTATTT 


TTCTGATTAA 


GCATACTATC 


AGTGTTCTGC 


ACTTCATCTA 


AGCGACCTTC 


480 




TGCGATGTCT 


TCCACATTAC 


CTTCAATGTC 


ATGATTACAG 


TTGTCATTCT 


TCTCAGCACT 


540 




AGACTTTTGC 


GCCACTTCTG 


TCTTCAACTC 


TGTTTGCAAT 


TGATCAATAT 


ATCGTTCAAG 


600 


SO 


ATATTCACCT 


TGTTTTTTCG 


AAATAACACT 


TAAGACAATA 


CCATCACTTG 


GTGTTTTGAT 


660 
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AATACGTTTT ATATTTAATT CTTTACGCTT TTTATTAAAA ATACCTGTTG TTAAAATGAA 78 

ATAA1TATCC tCAATCCAAT ATCGCGTGTT CATAATTCCG A CAATTTG AG AAATGTATGA 84 1 

TATTAAAAAG AATACAAATA CAATACCTAT CCATAAATAT GATTCGGGAT TCGTATAATC 90l 

AAAATCTTTC AATTGAAAGA TAATGAAAAT AAAAAAGACG ACTATGTTTT GTTTGATAGC 96l 

w ATTGATTATG CCATTAAAAT ATGAAATCGG ATGTAATTTT TGAGGTTCAG ACATCACTTT 102( 

CAACCCCTCT CAAATTCGAC ATAGTTCTCT CTTCGATTAT TTTAACATCG TCATGAGACA 108C 

TCATCGGTAA ATAAATAGTA TGACCTGCAG TCATAAATCC AACTTTATAC AAATTAAGCA 114C 

,5 CTTTACTAAT TGGATTAGAT TTAATCGACA AGTATTGTAA ACGTTCAATT CGACTCGTTT 12 0C 

CTTCTTTATA TATAAAAAAT GATGTACGAT ATTGTACACT TAGTTGATCA ACTTTATAAA 1260 

AGCGACAATG ATATTGCCAT AAAGGCTTAA TAAATAATTT TAATGTACTC AGAGCACCTA 132 0 

20 AAACCAACAA AATATAAAGT AAGTAATGTG GCCATTCAAA TCTTAACCAT ATAAAATAAA 1380 

AAATGACATA CACAGCTACA CTCAATATAA ATTCTAAGCC ATTCGTAATG TAGTAATACA 1440 

ACAATGCTGA CTTAGGACTC TTAGTCAACT TAGTATAATC TGACATATAC CCCTCTCCCC 1500 

AAATAAAAAA TTATACGGAT TTATAATCTA TTTCATTTTA TTTTTATATG ATGATAATTA 1560 

TAGCATATGG AATATTTCAT GCTAATTTAT TCTTCCTAAA GGTACATCTA AAAATTTAAT 1620 

30 TAAGCAGAAA GTGCTTGAAT TGCTAAAAAG ACACCATGTT ATAATTTTAT CAACATGATG 1680 

CCTTTCATCT ATAATCAATC TTTCATCTTA TCAAGAGCGA TATTTAGTTC AAGCACATTC 1740 

ACATAATCAT TTGTTAACAC ACCACGCTGC TTACGATGTT GAATCAAGTC GGCCACTCTT 1800 

3S GAAGTAGATA CATGACGAGC ATCAGCAATA CGAGGTGCTT GCTTCAATGC ATTTTCGACC 1860 

GTAATATGCG GATCTAAGCC CGACCCAGAA CTTGTTGCAG CATCTATTGT TACATTTGAA 1920 

TTCOCAAATT TAACATGATG TTTCATGCGT GCTATTAATT CGGTGTTTCC ATTCGATTCA 1980 

40 TTACTTGCAC CTGAAGATAC GCCGTTTTTA TATAATTTTT CAGGATTCAT ATTATAATCA 2040 

ACTGCACTCG GTCTCCCGTG AAAATATCGT GTCTCTGTCC AGTGCTGTCC AATCAATTTT 2100 

GATCCAACTA TACGATTGTC ATACGTAATT AAACTGCCAT TTGCTTGTTG ATAAAAAAAT 2160 

45 

ATTTGACCAA TTAACGTGAT AGCTAACGGG AATAAAAATC CACATAATAC CATAGTTATT 2220 

ATCGTTAAAC AAATACTATT TCTTATCGTA TTCATGGTAC AGGCTCCTTC CTCTTTACAC 2280 

SO AAAAAATTGT ACAATCATAT CTATTAATTT AATGCCTAAA AACGGGACGA TTAATCCACC 2340 



55 



TAATCCATAA ATCAA CAT AT TATTTATAAA GATTCTATCA ATGCTGTAAC CCTTTACTTT 
TACACCTTTC ATGGCAATTG GAATTAAGGC AACAATGATT AATG CATTGA AT ATCAAAG C 



2400 
2460 
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AATTGTTGAC ATCATTAGTG CAGGTAAAAT TGCAAAGTAT TTTGCTACGT CATTAGCCAA 2580 

ACTAAATGTC GTTAATGCAC CTCTCGTCAT TAATAATTGT TTGCCTATTT TTACAACCTC 2640 

5 

TATTAACTTT GTAGGATTCG AATCTAAATC AATTAGATTA GCTGCCTCTT TAGCACTAAT 2700 

TGTCCCTGAG TTCATAGCTA ATCCTATATT CGCTTtGTGc tAGCGCAGGT G CAT CATTTG 2760 

TACCATCTCC TGTCATCGCA ACAATATGGC CTTTCGCTTG TTCATCTTTG ATGACTTTAA 2820 

10 

TTTT ATCTT C GGGTTTACAC TCTGCAACAA ATCTATCAAC CCCGGCTTCT TTTGCAATTG 2880 

TAGCTGCTGT T AAAG CATT A TCAC CTGTAC ACATAACTGT TTCAATCCCC ATTTTTCTCA 2940 

T5 ATTCAGTAAA TCGTTCTACA AGACCATCTT TAATCACATC TTTTAAATAA ATCACGCCAA 3000 

GCATGACATT GTTTTCAATG ACTATTAAt G GnGTGCCACC TTTACTCGAT ACATCCATAC 3060 

AGAGAGACTC AATATTAAGA GGAATATTGC CTTGTTGTTG TTTGACAAGA TTTATCATAC 3120 

20 TATTAGGTGC AC CTTTG AAT ACCGATATTT CATTTGTAAT GATTCCGCTC ATT CTAGTTT 3180 

CAGCTGTAAA AGGCTTATAT GTGCCATCAA TGTCTTTAGG CAGCTCATTT ATATACATcT 324 0 

GcttCGCTAA TCGTACAATA CTTTTTCCTT CTGGCGTATC ATCGTAGATT GATGACATAT 33 00 

25 

AAGCAGCGAC TATCAATTTT TCAAGCATTT GTTGATTCAC TGGTAAAAAT TCACTAG CGA 33 60 

TTCGATTGCC ATAAGTGATT GTGCCTGTCT TGTCTAAAAT CATTACATCG ACATCTCCAC 3420 

ATACTTCTAC AGCACGCCCA CTTTTCGCTA ATACATTGAA TTGAGTAACA CGATCCATGC 34 80 

30 

CTGCAATACC AATCGCCGAT AACAAACCAC CGATTGTCGT TGGTATTAAA CATACTGTTA 354 0 

ACGCAATGAG CATCG CAATA GGTAAAATTA AATGCAGGTA AGATG CTATT GGATATAACG 3600 

35 TTACAATAAC GACTAAAAAT ATAATTGTTA ACGTTGTTAA TAATGTAAAA AGTGCAATTT 3660 

CATTTGGTGT TTTATTTCTT TCCGCCCCTT CAACTAAGGC AATCATTTTA TCTAAAAAAG 3720 

ATGTACnCGC TTCACTCTCA ACACGTATTT CTAACCAATC AGATGTTACA AGTGTACCGC 3780 

40 CAATGACTCC ATCAAAATCG CCACCTGATT CTTTTATCAC AGGTGCAGAC TCACCAGTAA 3840 

TTGCAGATTC ATCAACGGTT GCTAATCCAT TTATTACAAC GCCATCAGCA GGGATTGTTT 3900 

CTCCATTTTC TACCCGAATA TTTTGTCCGG CTTTTAACTC TGTGGCGTTC ACTATCCGAT 3 960 

45 

ACGCACCATT TTCTTCTATC AAT CG AG CAG TTAAATTTGA TTGTGCTTGT CTTAAACTAT 4 020 

CAGCTTGCGC TTTTCCACGA CCTTCAGCAA AGGCTTCTGA AAAATTAGCA AACAATATAG 4 080 

TTATTAATAA TATGATAAAA ATTGTAATCA AATAACCTCG CGATAGATAG CTAGTTCCAA 4140 

50 

ATATGTCAGG AAAACATATT AAT AT CAACG TTAAAATCAT TCCAACCTCA ACGACAAACA 4200 

TTAT CGGATT TTTTATTAAT TGTTTAAGAT TCAGCTTATA AAAACTCATT TTCAAAGCTT 4260 
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TTTATTTTAA 


AGTTAAAAAT 


TCACCAATAG 


GACCAAGTAA 


TAGTACTGGA 


ATAAATGTPA 






AACCACTTAG 


TAAAACGATA 


AATACGATTA 


GTGATACGCC 


AAAATAAGGT 


TTATCAATCG 


444 0 


5 


CTATTGTATA 


TTTATCTTGA 


TGGTATGATT 


TTTTATTCAC 


TAAACTTGAT 


GCAATCATTA 


^ C A n 




ATTGCAAAAT 


AATTGGTATA 


TAACGAGAAA 


GCAACATAAT 


GATTC CTGTA 


GAGATATTPP 


>4 C ^ t\ 


10 


AGAATGTTGT 


ATCATCTTTC 


AGTCCTTCAA 


ACCCTGATCC 


ATTGTTCGCA 


GCAGPTGATG 


^i *■> r\ 




TCATTTCATA 


CATAACTTGT 


GAAATACCAT 


GAAAAGACGG 


ATTCGTt ATa 


w x x v» x x vj 


^ /; q A 




CTCCAGGAAT 


CATAAAAGCA 


AGTG CTGAAA 


ATACTAAAAT 


TAAAATTGGG 


TGT ATG AG A A 


** / *% U 


15 


AGACTAAGAC 


AATACATTTC 


ATTTCACGGG 


CGCCAATTGG 


CATATTTAAA 


TATTTTGGTG 

X ^^X X W X X v7 


nOUU 




TTTTACCAAC 


CATCAAACTG 


CATATAAACA 


CCGTCAGTAA 


GACAAATATC 


AATAAATTPA 






TGAGTCCTAC 


GCCTTCGCCA 


CGAAATACAA 


CATTT AG CAT 


CJVTTAAT AC C 


A'l'TGGTPPTA 


* y ^ u 


20 


ATCCACCTAT 


AGGCGTTAAG 


CTATCATGCA 


TGTTATTAAC 


AGAACCPGTT 


GT AAA TYSPPn 






TCGTAATAAC 


TGTAAATAGT 


GCTGACAAAC 


CTGCTCCAAA 




r i " i ■ a pf^*i a \ , (~*c*Ti 


5040 


25 


TATTCGGTCC 


ATAAATGCCT 


AAATTCG CTA 


GTATTGGATT 


ACCACGATAP 


X X ^—Vw/\\ h\ 


bluu 


TAGTTAATGT 


AAGAATTGCT 


ATAAAAATGA 


AAAACATTGC 


GACAAATAAT 


ATP A A pgp A T 


blfeO 




GACGATGTAC 


TCGTTTACCA 


TGT CTACTTA 


ACATGCGACC 


AAATAAGAAH 






30 


TAGGAAGTAA 


CATCATACTG 


CCCATTTCTA 


TAAAATTGCT 


CCAAATATTT 




c o r\ 




AAGGTGTTGC 


AGAATTTCCT 


GCTAAAAATC 


CTCCACCATT 


CGTACCAAGA 


luiiilni Xvj 






ATTCAAGTGA 


TGCAATAGGT 


CCAAATGCAA 


TATGTTGAAT 


ATGTCCGCTT 


AAAGTPPG A A 




35 


TCATTAAATT 


AGCATGCAAC 


GTTTGTGGTA 


CaCCTTGAGT 


CATCAATAAA 


ATA CTA ATT A 


J* D U 




AACATGATAA 


TGGTAAAAGT 


ACTCGGACAA 


TAAACCGAAC 


AATATPTTTGA 

X X *w X X wx\ 


TAA A A ATTAC 






CAATGATATT 


AGTTAATCCA 


GTTAAACGTC 


TCAACATCGC 


TATACAAACG 


GPGT A A CCTG 




40 


ATGCACTAGA 


TGTAAACATT 


AAATATGTCA 


TTACAATCAT 


TTGCGTTAAA 


TATGTP A PAT 






CTGaTTCACC 


GTTATAGTGT 


TG t AAATTAC 


TATTTGTTAA 


AAAAGATATT 


GPTGTATTAA 
vjv — x vj x *x x x 




45 


ACGCTAAATC 


TATCGATTGG 


TTTAAATTAT 


GATTTGGATT 


TAAAAAAAGC 


CATTGCTGAA 


5760 


CT ATT AG CAA 


TACAAATGTT 


ATAAACCCCA 


TAAATP. CATT 


AAATGPPAGA 


A A ATGTTTG A 






t_A 1 A IXj I I I I 


AGCTGACATG 


TGTTCTAAAT 


CTGTGCCGAT 


AATTTTAAAA 


CACATATTTT 


5880 


50 


CAAATCTAGT 


AAATATTAAA 


TCTACTCTTG 


ACGATTGCAC 


CAATGCTACG 


CGATATAGAT 


5940 




ATCCACTAAA 


AACATACGTA 


ATCATAACCA 


TCATTGTTAG 


AAACAAAATT 


ATTTCCATGA 


6000 




TAACCCTCAC 


TTAATATATT 


TCTAAAATTT 


TTCACTACGA 


ATTAAGGCAT 


AAAATAAATA 


6060 
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25 



ACACAACAAC ATCGTAACAA CTTGTTTATG AGAGAAATnT TAATTTTCAA ACTTAGTTAT 618 

TAAGAAAnCA TTAAGATGTG TATGCAGAAA TAAATTTTAT AGCATTTAAT TGTGAAGAAT 624 

5 ATTATGATAT TGCTATCGAG GTGAAGGTTA TG 627 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAATGATGTT TTACAATAAA TATAnAAACG TATCAACATA TATCATCATA TTTTTAGTTT 6< 

20 CAAGTGCAGC CTTTGCAATA TTCTTGTTAA GTGCGnACAT TAGTGCTCAC TCGGAACAAG 121 

TGTACGAAAT GACTGACCAT CAAATTAAGA ACAATACGAT AAATAAAGCA TACGAACATA 18< 

AAGACCCTAC AAACAATAGC GAACAAAGAG ATGGGAAAGT GTTCGCTTTA ATAAATTGAT 24 ( 

ACATTGTCAC AACGTTATTT TG C CTATTTT TGCGmAATAG CGTTTTTTAT TACwTTTTTG 30< 

CTGATsTTAA ATTTGTTATA TTTTGTTAAA GTATTATAAT GATTGAATAA ACAAATTGAA 36< 

GGTAGGTTTT TTAATTGAGT AATTCTGATT TGAATATCGA AAGAATTAAC GAGTTAGCTA 42L 

AAAAGAAAAA AGAAGTAGGA TTAACTCAAG AAGAAGCAAA GGAGCAAACA GCCTTAAGaA 4 80 

AAGCTTATCT TGAGAGTTTT AGAAAAGGGT TTAAACAACA AATTGaAAAT ACTAAAGTAA 54 0 

TTGATCCAGr AGGTAATGAT GTAACACCTG AAAAAATTAA AGAGATACAA CAAAAAAGAG 600 

ATAATAAAAA TTAAATCACA AATCTGTAAA GAATTTTCTG ACATTATAAC TTGAAATAAG 660 

TATTTTACTT ATCTTTTTAT TTTAAAATAA GTTATAATGT ATTTGATAAA ATTGAAGAAG 720 

40 GGAAGATACA CAAGATGTTT AATGAAAAAG ATCAATTAGC TGTTGATACG CTACGTGCAC 7 80 

TAAGTATCGA CACAATCGAA AAAGCGAATT CTGGTCATCC AGGATTACCT ATGGGAGCTG 840 

CCCCAATGGC TTACACTTTG TGGACACGTC ATCTGAATTT TAATCCACAA TCTAAAGATT 900 

ACTTCAATAG AGACCGTTTC GTATTATCTG CAGGGCATGG TTCAGCATTA TTGTAT AG CT 960 

TGTTACATGT TTCTGGTAGT TTAGAATTAG AAGAATTAAA GCAATTTAGA CAATGGGGTT 1020 

CTAAAACACC AGGTCATCCT GAATACAGAC ATACAGATGG TGTAGAAGTT ACTACCGGAC 10 8 

CACTTGGACA AGGTTTTGCT ATGTCAGTAG GATTAGCTTT ACAGAAGATC ACCTAGCAGG 114 

gAAATTTAAT AAAGAAGGAT ATAATGTTGT AGATCATTAC ACATATGTAT TAGCTtCTGA 120 
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AAGTAAATTA GTTGTTTTAT ACGATTCAAA TGATATTTCA TTAGATGGCG AATTAAACAA 1320 

AGCTTTTTCT GAAAACACAA AAGCTCGTTT TGAAGCATAT GGTTGGAATT ACTTACTAGT 1380 

TAAAGATGGT AATGATTTAG AAGAAATTGA TAAAGCGATT ACTACAGCTA AATCTCAAGA 144 0 

AGGACCAACG ATTATTGAAG TTAAAACAAC AATCGGATTT GGTTCACCGA ATAAAGCAGG 1500 

AACTAATGGT GTTCATGGGG CACCTTTAGG TGAAGTTGAA AGAAAATTAA CATTCGAAAA 1560 

TTACGGTTTA GATCCTGAAA AACGTTTTAA TGTTTCAGAA GAGGTATACG AAATTTTCCA 162 0 

AAATACTATG TTAAAACGTG CTAATGAAGA TGAATCTCAA TGGAATTCAT TATTAGAAAA 1680 

ATATGCAGAA ACATATCCTG AATTAGCAGA AGAATTTAAA TT AG CG ATT A GTGGTAAATT 174 0 

GCCTAAAAAT TATAAGGATG AATTACCACG TTTTGAACTG GGTCATAATG GTGCATCTCG 1800 

TGCTGATTCT GGTACTGTTA TTCAAGCAAT CAGTAAAACT GTCCCTTCAT TCTTTGGTGG 1860 

ATCAGCAGAC CTTG CTGGTT CAAACAAATC CAATGTAAAT GATGCAACTG ATTATAGTTC 1920 

TGAAACACCT GAAGGtAAAA ATGTGTGGTT TGGTGTACGT G AATTTG CT A TGGGTGCT 1978 
(2) INFORMATION FOR SEQ ID NO: 14 2: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
TAGTAGTATT TATTAAATTA TACGAAGGGA CCcAACACAG AAAATTCATT TTATTGAATT 60 
TTACATTTAT GTGCCAAGTT GGGAAAAATG TCTTATTTTT TCaAAGTATT TAAAAGTAAA 120 
ATTACATGTT AATACGTAGT ATTAATGGCG AGACTCCTGA GGGAGCAGTG CCAGTCGAAG 180 
ACCGAGGCTG AGACGGCACC CTAGGAAAGC GAAGCCATTC AATACGAAGT ATTGTATAAA 24 0 

TAGAGAACAG CAGTAAGATA TTTTCTAATT GAAAATTATC TTACTGCTGT TTTTTAGGGA 3 00 

TTTATGTCCC AACCTTTTTA GAATATTAAA TTTCTACAAT TTCGTCATCT TCAACAATAA 3 60 

45 AGCCCATTGT ATTGACGCTG TTATTTAAGA AAGTCAGAAT ATAACGCATT ACTTCATCAC 420 

GTTCTGGCTC ATTGTGAACC TCGTGGTAAA AACCTTGCCA AGCTTTAAAA TATAATTCAG 4 80 

GTGTTTGATA TTTTTCTTTA AACTCATCAA TTG CCCTAGT ATCAACAATT AAATCCTTCG 54 0 

50 TTCCATACAT TAATAGCGTT GGCATTGGTT GAATGTCATG AATATGAGCC ATCGTATCTT 6 00 

TCATCGTCTC ATTAATTGTA TTATACCAAT GATACGTTGC TTTTTTTAAC ATTAAACCAT 660 
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CATTAAAACG TGTGTCTTTT GAAATTTTAC CTATATTTGA AACAAGTTTA TCTTTACGAT 780 

TTTTTCCATT CTTTTGAAGT TCTAGCATAG GAGAAATTAA CATCATCCCC TCGATTGGCA 84 0 

ATTCTACTTT TTCAAGTAAA TTTAATAAAA TCAAACCGCC AAGTCCTACC CCTAATACAT 900 

AAGTAGGAAT TTTATATTCA TT AG CTATCT TTAACCAGTC TAGCAAACTT TCGTGATACG 960 

TTTGAAAGTT TTCAATTTGT CCTTTATTAG CTCTTGAAGT TTGACCTTGA CCAGG CAAAT 102 0 

CTCCCATAAT CACATGATAG CCATTTCTTC TTAACATCGT AATAACATAT GCATATCTTC 1080 

CCGTATGTTC TAATATATTA TGAGCAATAA CAACGACGCC TTTCGCATCA TTTTCAGCTT 114 0 

CCCACTTCCA CATTATTATA CTGCCCCTTT TTCATTAATC TTCAATAACA TAATTATAGC 12 0 0 

AAATTCACTA TGTAGATTTC TATTTATAGT ATTATTGTTG TCCATATTAT TATATATAAA 1260 

TGAAATCAAC ATCAATAATA GTGTAATTAT ACATAATTAT TTTTGATTGT TTTTGATGAA 132 0 

AACGCTTTCT CGAATATTTT TTTCATGCTA AACTTATTGT AAACACAAGG GTTTGGAGGA 13 80 

GTAGCAATGG CACTATTAAA GAATTTTTTT ATCGGATTAT CTAATAATAG TTTTTTAAAC 144 0 

AACGCAGCAA AAAAAGTGGG CCCACGTTTG GGCGCCAATA AAGTCGTTGC CGGAAATACA 1500 

25 ATTCCAGAGT TAATTAATAC AATCGAATAC TTAAATGACA AGAATATCGC TGTTACGGTA 1560 

GACAATTTAG GGGAATTTGT CGGTACAGTT GAAGAAAGTA ATCATGCTAA AGAACAAATT 1620 

TTAACAATTA TGGACGCGCT TCATCAACAT GGCGTAAAGG CACATATGTC TGTTAAATTG 1680 

AGTCAGTTAG GTGCAGAATT CGACTTAGAA TTAGCTTACC AAAATTTAAG AGAGATTTTA 174 0 

CTTAAAGCAA ATACTTACAA CAATATGCAT ATAAATATTG ATACTGAAAA ATATGCTAGC 1B00 

CTGCAACAAA TTGTTCAAGT TTTAGATCGC TTAAAAGGCG AATTTAGAAA TGTTGGTACT 1B60 

GTAATTCAAG CATATTTATA CGATAGCCAC GAATTAGTTG ATAAGTACCA AGATTTACGA 1920 

- TTACGTTTGG TTAAAGGTGC ATATAAAGAA AACGAATGAA TTGCATTTCA -ATGTAAGGAA 19 80 

GACGTAGATG CAAATTACAT CAAAATAATT GAACAACGTT TGTTAAAGGC ACGCAATTTC 2040 

ACTTCAATTG CAACACATGA CCATCGCATC ATTAATCATG TAAAACAATT TATGAAAGAA 2100 

AAT CACATTG AAAAAGATCG TATGGAATTC CAAATGCTCT ATGGTTTTAG ATCAGAGTTA 2160 

45 GCAGAAGAAA TCG CAAATG A AGGCTATAAT TTCACTATTT ATGTACCTTA TGGCGATGAT 222 0 

TGGTTTGCGT ATTTTATGAG AAGATTAGCA GAACGCCCAC AAAAC CTATC TCTTGCTGTA 2280 

AAAGAATTTG TGAAACCTGC TGGCTTAAAA CGTGTTGGCA TAATTGCAGC TTTAGGAGCT 234 0 

50 ACAGTTATGT TAGGTTTAAG TACAATTAAA AAATTATGCC GTAAATAGAG CAAGACATAA 24 00 

ACAATAATTT AGGAGTCTGG AACAATAATC AATGTTCTAG GCTCCTAAAT GTTATATTGG 2460 
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TAGATTTTAA 


TAAATTAGCC 


ATTTCAATTG 


CACTTACTGC 


TGCTTCAGCA 


CCTTTATTGC 


2580 




CAG CTTT CGT 


ACCTGCTCTT 


TCCACAGCTT 


GTTCAATACT 


TTCAGTCGTT 


AAAATACCAA 


2640 


5 


ATATGACTGG 


TACATTAGTT 


TGATCATTCA 


CTTTAGAAAC 


ACCTTTCGCG 


ACTTCATTAC 


2700 




AAACATAATC 


ATAATGAGAC 


GTAGCACCGC 


GAATTACGCA 


TCCTAATGTA 


ATTACTGCAT 


2760 


W 


CATAATTTCC 


TGATGAGGCT 


AATTTTTTAG 


CTACTAAAGG 


AATTTCAAAC 


GCACCTGGCA 


2820 


CAAATGCTAC 


ATCAATATTG 


TCTTCATTAA 


CATCATGTCG 


AATCAAAGTA 


TCTTITG CAC 


2880 




CTTCAAGTAA 


TCTTCCAGTG 


AT AAAAT CAT 


TAAATCGACT 


AACTACGATT 


GCAACTTTCA 


2940 


15 


AATCTTTTCC 


AATTAATTTA 


CCTTCAAAAT 


TCATGTTAAA 


ATCCTCCTAT 


ATTAAATGAC 


3000 




CCATTTTTAT 


TTTTTTCGTT 


TCCATATAAT 


CATGATTATG 


TACCGTTTCT 


GGTACGATAA 


3060 




CTTCAATTCT 


TTCTGCAATA 


TCAATGCCAT 


ATTGTTTTAA 


TCCCTCAAAT 


TTACTTGGAT 


3120 


20 


TATTACTTAA 


TAAATTGATA 


TGTTCGATGT 


TAAAATATTT 


TAAAATCTGT 


GCAG CAATAT 


3180 




GATAATCTCG 


CAAATCTTCA 


TCAAAACCTA 


ATGCTAAATT 


TGCAGTTACT 


GTATCATATC 


3240 




CTTGCTCAAT 


TAATTCATAT 


GCGCGTAATT 


TGTTTAACAA 


TCCTATGCCA 


CGACCTTCTT 


3300 


25 


GAGGTAGATA 


AATAATCATG 


CCACCATGTT 


CATTGATATA 


CTTCATAGAC 


GATTCAAGTT 


3360 




GAGCACCACA 


ATCACAACGT 


TGACTATGGA 


AAATATCGCC 


TGTAAGgCAC 


GCAGAATGTA 


3420 




AGCGTACATT 


TTCATGTTGT 


CGAATTGCAC 


CTTTTGTCAG 


TACAACTATC 


TCTTCATCTG 


34B0 


30 


TGTATGTCGC 


TTTAAAACCA 


TACATATCAA 


ATGTTCCGAA 


ATCTGTAGGC 


ATTTTCACTT 


3540 




TTGCCTTAAA 


TTCAATTTCT 


GGTTCTAATT 


TTTTACGATA 


TTCAATTAAA 


TCATCAATCG 


3600 


35 


TAATCATCTT 


TAATTGATGT 


TTTTCTTTAA 


ACTTTTGTAA 


ATCTTGTCCT 


TTCGCCATCG 


3660 


TGCCGTCATC 


ATTCATAATC 


TCACAAATGA 


CACCAGCGGG 


CTTGGCACCA 


GTAAGTTTAG 


3720 




CTAAATCAAC 


AGCCGCTTCT 


GTGTGTCCAT 


TTCTAGCTAA 


TACGCCTTTA 


TCTTGTGCTA 


3780 


A/% 


CTAATGGAAA 


TAAATGACCA 


GGACGATTAA 


AATCTTTAGC 


TTCACTACTA 


GGATCAATGA 


3840 




GCTITn'GGC 


AGTCAATGTA 


CGTTCATAAG 


CACTAATTCC 


TGTTGTTGTA 


TCTACATGAT 


3900 




CAATACTCAC 


TGTAAATTGC 


GTACCAAAGA 


TGTCGGAGTT 


ATCATCAACC 


ATTTGTACCA 


3960 


45 


AATCCAAACG 


TTGTGCAATA 


TCTTTAGACZA 


CTGGTGCGCA 


TATTAATCCC 


Ct TGCTTCTT 


4020 




TCG CCATAAA 


ATTAATGGTA 


X ± .cX X V— Vj X X V — r-V 






AAATCACCTT 


4080 




CATTTTCACG 


ATTCTCATCA 


TCTACTACAA 


TAATTGGTTC 


TCCA'rrri-ri' 


AAAGCCATTA 


4140 


50 


AAGCACTGTC 


AATATTATCG 


AATTGCATGC 


TACCCCTCCt 


AAAAACCAAA 


TGCTCTTAAT 


4200 




TTATCTACAG 


ATAATTGGTC 


TTTATCTTTA 


TTTAAAATAT 


TTTCAACATA 


TTTAAACAAA 


4260 
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CTCGTTTCTG GAATAAGATG AATGTCAAAA CTGTTAT CAT GCTTATCAAA TACCGTTAGA 4 380 

CTAACACCAT CCACAGTAAT AGACCCTTGC TTAACTAACT GATTATTAAT ATGTTGGCTA 444 0 

CATTGAATCG TAATAATTTT TGCATTGGCT GTTTCATTTA TTTTTGAAAC TGTTCCTAGT 4 500 

TCATCTACAT GACCGAGGAC AAAATGTCCA CCAAACCTAC CGTTACCACT CATGGCACGC 4 560 

TCTAAATTTA CTTCTGATTG TCGCTTAACA TCTGCTAAAT AGGTTTTATT TTCAGTGCCT 4 620 

TTAATTACTT GAACAGTAAA AGATGTCTGA TTAAAATCAA TCACTGTTAA ACATG CACCA 4 680 

TTAACACTGA TGGAATCACC AATATGCATA TCTGCCGTAA TCTTATGTGC TTCAATTTCA 4 740 

ATCGTCCTGA CTGATTGACG AATTTGAACA CTTTTAACGA CACCTATTTC TTCAACGATG 4 800 

CCAGTAAACA TGCATCATCA CTTCTTTCGT AAAGTTAATT TAACATTTTG ATTTAATAAC 4 860 

TCGGAATGAA CAATTTCAAA TTGGTTCGCA TCTGGTATCT CAATCACATC ATTTGTTTGA 4 920 

TAAAATTGAT AATTTCCAGA TCCGCCAATT AATTTCGGGG CATAATAGAG AATAAATTCA 4 980 

TCTATATAAT TAGATTGGAG AAATTCTGAA GTAGTGGTTG GACCTGCCTC GACTAGCAAA 5040 

GTTCCAACTC CTCTTTTATA TAAATTGTGA AGAATTGTTG TTAAATCGCA AGACTTCAAG 5100 

25 TAAATAATTT CAATATGTGT TTGATTGGTT GTTAAATTTG GATTTTCAGT ATATATCCAA 5160 

ATTGGTGTTG ATTCATCTTG ATAAATTTGC TGATTAAAAT GAATATTCCC AGACTTAGAC 5220 

AATATTACTT TTATAGGGTT TTTTCCATCT TGAATACGTG TAGTATATTG TGGATCATCT 5280 

AATTCAACTG TACGTCTTCC AGTTAACACT GCGTCGTGTC GATGTCTTAA CTTATAGACA 534 0 

TCTTGTTTAA CCTCTTTGTT AGTAAT CCAT TGACTTTGTC CATTATCATT CGCTTGTTTA 5400 

CCATCTAAAC TTGCAGATAC TTTCACTGTA ATTTGTGGCA GTTGCTTTGC TTTTGCTTTA 54 60 

AAAAAGTCTT GGTATAATTG TGATGCCCGT TCATCATCAA CGCATTCAAC CTCAATACCG 5520 

TGAGCCCGTA ACGTGTCATC ACCATGTGTG TCTAACGAAT TGTCTTTTGT -TGCGTATACT 5580 

ACTTTTGCTA TCTTACAATC AATTATTTTG TTAACACAGG GTGGTGTTGA ACCAAAATGA 564 0 

CTACATGGCT CTAACGTAAT ATAAATCGTC G CAC CTTCAG CATTTTGTTG TGCCATATCA 570 0 

AGTGCTTGAA CCTCCGCATG CTTGTCACCT TTTCTCAAGT GTGCACCAAT ACCAACAATC 5760 

45 CTACCTTCTT TAACTACAAC AGCGCCAACG GGTGGATTAA CACCTGTTTG ACCTTGT AC C 5820 

ATATTTGCAA GTTGAATCGC ATAATCCATA AATTGACTCA AATGATCACC TCTATAAACA 5880 

AAAATCCTCA CAT CATG AAT TAAGATGCAA GGAGaAAAAT TTATCGTTAA ATAAGC CTAT 594 0 

50 TTGTACACAT TTTTACAAAT ACGCTACATT ATCTTTGTCG ATAATTAACA TTCTTTCTCC 6000 

CATCCAGACT TTAACTGTCG G CT CT AG AAT CTCACTAGAT CAGCCACTAA TATGAAACAT 6060 
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TTaTATATGA 


AATTGTTATA 


GATTATTTGA 


GTACGTAGTA 


TGTCAACTAC 


ATTTAAAATG 


6180 




ATACTATATG 


TTTTCTGAAA 


AAACAATTAA 


TGACGGTTTT 


AATTTAATAT 


AATCTGAGTA 


6240 


5 


CTATAGGCAT 


CTCATTGATA 


TGATTCTTAC 


TAACAGACAT 


TAAAATCAAA 


CCTTCAATTC 


6300 




GTCTCTATAG 


AGCGTTCTCT 


TTATTATCTT 


CTAGTTACAA 


ATTATTGATT 


GtCACtGCGC 


6360 




TGTTGTTGCT 


CATTCGATTC 


TAAAGCATCA 


TATAATTGAG 


ATACTGTATG 


CGCAACTTGT 


6420 


10 


TCTACAATCA 


TTTTCACACC 


GTTTCGTAGT 


TTATTAACAC 


CGTTTGTCAT 


TTGACCTATC 


6480 




GCAATCATAT 


TTGTTAATGT 


TCCAAACCTT 


GGACTAATAA 


CTTGATTGGT 


TTCCGGAATG 


6540 


15 


ATTTGTATGC 


CTCCCATTGG 


GTGTGCTTGT 


ACAATTTGTC 


TATTTTCAAG 


ATTTCTAATT 


6600 


AATTGATCAT 


CTTGATCCAA 


TTCATTTAAA 


TGACTTTTTG 


CACCTGTCGC 


GTTAATGACA 


6660 




ACATTATATA 


TGTCTACTGA 


TTCTTGGTTT 


TTGTATGAAA 


AATAATACAA 


CTTGCCATaC 


6720 


20 


ATGTTCACAT 


CTTCTAAATC 


TTTTTTCAAA 


ATTAAAGACT 


TATTTTCTAT 


TAATTCAATA 


6780 




ATTAGTTCAG 


CAGTTCTTGG 


AGGCATTGGA 


TTTGAATTTA 


ATTGAATCAT 


CTTTGAGTAT 


6840 




TTTTGATTAA 


ATTGATGTTG 


GTCTTCAATA 


CTTAAGCTAT 


TCCATATCCA 


ATTTAAATTC 


6900 


25 


TCTTTCAAAT 


GTTCAATCAT 


ACTTTGGAAA 


ATGCCCaTTT 


CTGTTGGACG 


CGCTAAATCA 


6960 




TACTTCAAAT 


CTGCAATATG 


ATTTCCTGTA 


CGTCTATGTA 


ctaatitttt 


AAAATCAATG 


7020 




TCATATTCAG 


CACATTCTTT 


TAAAAATAAA 


GAAACTAAAG 


TATCAAGCGG 


TGCATTGCCG 


7080 


30 


AAATGATGTT 


TTTTAATGTC 


ATTTAATTTG 


TCTTTAGTTA 


AGTACTTGAA 


TGTCACGTCT 


714 0 




ATCATTGTAC 


CTCTTACACT 


TGGTAAATGA 


GCAGAACGAC 


TCGTCATAGT 


AATTGGTAAT 


7200 


35 


TTTGGATGAT 


GAGCAGCAAC 


ATAACGGACA 


ACATCTAAAC 


TGGCAAGGCC 


TGTACCAATA 


7260 


ATCGCAATAT 


CGTCCAGTTC 


ATTTACTT CG 


TCTAACGTAT 


TATATGTTGG 


ATAAGGCGTA 


7320 




gcGATATATC CTTTTTTACC 


CTTTAAGTTA 


TATGGATCAT 


GGTAGGCAAA 


TGTACCACAT 


7380 


40 


GTTAAAAATA 


CATAATCGTA 


CGCTTGCCAT 


GATTGTCCTG 


AATTTGTAGT 


ACATATGTAA 


7440 




TAAGTTAAAT 


TCGTTTCATC 


GATATTAGAA 


TTTGTATAAA 


TCTCTTGAAC 


TTTATTATAA 


7500 




TTAGTTGATA 


TATTTGGATA 


TTTTTTCGTG 


AACATAGATA 


AATAAGATTT 


CATATAATGT 


7560 


45 


CCGAATACAA 


ATCTCGGTAA 


ATATGCAG 








7588 




(2) INFORMATION FOR SEQ ID NO: 143: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

nCTAGGTATT TTAAACCTAA TCTAGATAAA CTAGCTTCGT AAGCAGCTGC TACATTTTCA 60 

CGACCGAAAT CCTCAAAATA TAATTTTGAA GTAATAAATA AGTCTTCTCT AG CAATACCA 120 

GTTGACTCCA AT CCGGCACG AATGCCAGCA CCTACTTGTT CTTCATTCCC ATAAACTTTT 180 

GCGGTATCAA TACTACGATA TCCTTGTTCA ATGG CAT ACT TAACACTTTC CATGCAATTT 24 0 

TCATCATTTT CCACACGAAA TGTCCCTAAA CCAATTTGTG GCATCGTGTT TCCATTATAA 3 00 

AATGTTTTAA CCTCCATAAA TATCGCCTCA CCTTTTTGAT GTATTATACC CTGTTAT CAT 3 60 

AACAAATCTG AGTTGAATAC ATGAGAAAAA ACACTTAGAG CAATCAACCA CTAAAATTCT 420 

AGTAATATCT CTCAAATATT AATCAAATTG TAAAAGTAAT TCTGTTTAAT TTATGACAAA 4 80 

CTAAAAAAGC CGAAGTAACA ACATATAGTC ATCACTTCAG CCTAACATTT AATTGAATGA 540 

TTCAATTTTA TCCATCATTT GTTGTAAGTC TTCCACGTTG TATTGAATAC GACCATGGAA 600 

TACAAATTTG TTAAAGAACT CGTCTAATTG TTCAG CACCG ACAAGCACTT TGACAGCACT 660 

ATTTTGATTA TAATTTGAAA TCGTTACATC GCCTTCATTT TTAAGATTAA AGTATAAAAT 72 0 

25 TGAAGTTGGT GTATATTTGG CACCTAATTC TTTTTGTAAG TCTTCAGCCA ATTGTTTAAT 780 

CGCCTCAATT TGATCTGAAT AATTTACAAA TGATAATGAA CGTTTGTCAT CATTTTGATC 84 0 

CATCACAATA GTTTGCGGTC TAGATTTATC TAAATCCAAT GTATCAAATA CTTGTTCCAT 900 

TGGTGGTAAA TCTTTAAATT GACCGCCACT AATACCATTA TAAACATGAC CTTTTAACAA 960 

TTGAGAATCA ATAATATAAA GACCAGTTCT TGTTAATACT AAATGACTAA TTCGTTCAAT 1020 

ATTATTAAAG CCATCCTTTG GTAAAAAGAT ATTTG CCATA ATGTGCATAT CTTCTGGTCG 1080 

AATTCGTTTT TCTTTAACTA ATCTTTCACG AATAC CAATT AATCTCATGT C CGTTACATA 114 0 

" TTC^ATGA 'TTTTTO^GA ACMTm 1200 - 

CATGTGATTA TAATCTTCTT GTTGTTTTGT AATTGTCTTT TTATTTTGAA TACGCTCTTT 1260 

CTCTAAAGCT TCTTCATGAG ACTTTTTAAT GTTTTGTTCT TGTTGTTCAT ACTTTTCTTC 1320 

TGTTTGTCGC TTAACTTTTT TCTTACTACC T AAGG CAACT AAAAAAAGGA CAAAAAAGAT 1380 

45 TAATG CAATG AgCTACTGCA ATAATGAGTC CAATGACTAT CGGTGAAGAT AAATCCATCA 144 0 

CAACAACGCT CCTTTTTAAT ATATGAATAA CTTTAATTAT AATAGAaAAG CTAAAGATTT 1500 

TCGATACATA TTATCATTTA TATACCGAAA ATCTTTTATT TAGCTATATT CAATT CAT CT 156 0 

50 TATTATTTTA CTGCGTCTTT TAATTCTTCC ACTTTGTCTA ATTTTTCCCA TGGGAATAAG 1620 

ACATCTGTAC GTCCAAAATG AC CATAAGCA GCAGTTTGTT TGTAAATCGG TTGTTTCAAA 1680 
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AGTTGCCCTT 


CAGAAACTTT 


ACCTGTTCCA 


AATGTATCAA 


TTGCAATTGA 


CACTGGTTCT 


1800 




GCAACACCAA 


TCGCATATGC 


CAATTGTACT 


TCACATTGAT 


CTGCTAAACC 


TGCTGCAACA 


1860 


5 


ATATTTTTAG 


CCACATAACG 


TGCAGCGTAT 


GCAGCTGAAC 


GGTCTACTTT 


TGTAGGATCC 


1920 




TTACCACTGA 


AGCATCCGCC 


ACCATGACGT 


GCATAGCCAC 


CGTACGTATC 


AACAATGATT 


1980 


10 


TTACGTCCTG 


TTAATCCTGC 


ATCACCTTGA 


GGTCCACCGA 


TTACAAAGCG 


TCCTGTAGGA 


2040 


TTGATGTAGA 


ATTTAGTTTG 


TTCATTAATC 


AAGTTTTCTG 


GAACAGTTGG 


ATAAATGACA 


2100 




TGTGCTTTAA 


TGTCTTCTTG 


AATTTGTTCA 


AGTGTCACAT 


CCTCAGCATG 


TTGTGTTGAT 


2160 


IS 


ACGACAATCG 


TATCAATACG 


TACTGGGTTA 


TCATTTTCAT 


CATATTCAAC 


AGTGACCTGA 


2220 




ACTTTACCGT 


CTGGTCGTAA 


ATAATTTAAC 


GTACCATCTT 


TACG CACATC 


TGATAAACGT 


2260 




TTTGCCAATT 


GATGTGATAA 


ATAAATTGCT 


AGAGGCATAT 


ACGTCTCTGT 


TTCATTCGTT 


2340 


20 


GCGTAACCAA 


ACATTAAACC 


TTGGTCACCT 


GCACCTGTTG 


CTTCAATTTC 


TTCTTCGCTA 


2400 




TCTTTATCAC 


GATACTCTAA 


TGCTTTATCC 


ACGCCTTGTG 


CAATGTCAGG 


TGATTGTTCA 


2460 




TCAATCGCAG 


TTAAAATTGC 


CATTGTTTCA 


TAATCATAAC 


CAT ATTTTG C 


TCTTGTGTAT 


2520 


25 


CCAATTTCTT 


TAATTGlTi'C 


TCTAACAACT 


TTCGGAATAT 


CAACATATGT 


TGTTGTAGAA 


2580 




ATTTCGCCGG 


CGATCAATGC 


CATACCTGTT 


GTAACAGTTG 


TTtCACAAGC 


TACACGTGCA 


2640 




TTTGGATCGT 


CTTTTAAAAT 


AGCATCTAAT 


ATTGCATCTG 


A CACTTGGTC 


AGCGATTTTA 


2700 


30 


TCTGGGTGTC 


CTTCTGTAAC 


AGACTCTGAA 


GTAAATAATC 


GTTTGTTATT 


TAACATAGTT 


2760 




TGCTCCTTTA 


AATTTATATT 


ACGAAAATTC 


TCTCTCTGTG 


AGCTAAATAA 


AAAAGACCTT 


2820 


35 


CTAACTATTA 


ATATAGAGAG 


AAGG CCTAAT 


ACGTCCATTC 


GCTCTTATCG 


TTCAGACCTA 


2880 


TTTGTCTGCA 


AAcGGTTTGG 


CACCTTTCTT 


TTATAAAAAA 


GAGGTTGCTG 


GGTTTCATTG 


2940 




GGTCCATGTC 


CCTCCACCAC 


TCAGGATAAG 


AGAATCCGTT 


AAAAATAATA 


GTACCTAATT 


3000 


40 


AATGAATTAA 


TGTCAATTTT 


TCACAAATAA 


ATTTACAGTA 


AAATATTGTA 


GATTAATTAT 


3060 




GTTAATGTGT 


TATACTAATT 


AAATGTAAAG 


GCTTACATTT 


AAATTATCGC 


TTTGGAGGGA 


3120 




TTTAGGATGT 


CAGTAGACAC 


ATACACTGAA 


ACAACTAAAA 


TTGACAAATT 


ACTGAAAAAA 


3180 


45 


CCAACGTCAC 


ATTTTCAACT 


TTCGACGACA 


CAACTTTATA 


ATAAAATCTT 


AGACAATAAC 


3240 




GAAGGGGTAT 


TAACAGAACT 


TGGTGCTGTT 


AATGCAAGTA 


CTGGAAAATA 


TACTGGTCGT 


3300 




TCGCCTAAAG 


ACAAATTTTT 


TGTCTCTGAA 


CCTTCATATA 


GAGATAACAT 


TGATTGGGGA 


3360 


50 


GAAATTAATC 


AACCTATCGA 


TGAAGAAACT 


TTCTTGAAGT 


TATACCATAA 


AGTACTAGAC 


3420 




TATTTAGATA 


AAAAAGATGA 


ACTATACGTA 


TTTAAAgGcT 


ACGCTGGTAG 


CGATAAAGAT 


3480 
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ATGTTTATTA 

A AAA A A «k 


GACCTGAATC 


AAAAGAAGAIV 


GfTACAAAGA 


TTAAACCTAA 


CTTCACTATC 


3600 


GTTTCTG CAC 


f^ACATT"T v T'A A 
v*iw^» A A Ann 


AGCAGATCCA 


GAAGTTGATG 

wnnVJ X X >J«» a nj 


GTACTAAATC 


TGAAACCTTT 


3660 


GTCATTATTT 


CATTTAAACA 


CAAAGTCATT 


TTAATCGGCG 


GTACTGAATA 


CGCTGGTGAA 


3720 


ATGAAAAAAG 


GTATCTTCTC 


TGTAATGAAT 


TATCTCTTAC 


CGATGCAAGA 


TATTATGAGC 


3780 


ATGCATTGCT 


CAGCAAACGT 


TGGTGAAAAA 


GGCGATGTTG 


CATTATTCTT 


TGGTCTATCT 


3840 


GGCACTGGTA 


AAACAACCTT 


ATCGGCTGAC 


CCACACCGTA 


AACTAATCGG 


TGATGATGAA 


3900 


CACGGCTGGA 


ATAAAAACGG 


GGTCTTTAAT 


ATCGAAGGTG 


G CTG CTATGC 


AAAAGCAATT 


3960 


AATCTTTCCA 


AAGAAAAAGA 


ACCACAGATT 


TTTGACGCAA 


T CAAAT ATGG 


TGCAATTTTA 


4020 


GAGAACACTG 


TAGTTGTAGA 

X X X vW\Wl 


AGATGGTTCA 


GTGGAC1TTG 

\* X \7>#V% w AAA 


AAGACAATCG 


TTATACAGAA 


4080 


AACACGCGTG 


CCG CTTATCC 


AATT AAT CAC 


ATTGACAATA 


TTGTAGTACC 


ATCTAAAG CA 


4140 


G CACATC CAA 


ATACAATTAT 


AAA V» A A A 


GCGGATGCAT 


TTGGTGTTAT 


TCCACCGATT 


4200 


TCAAAGTTAA 


ATAAAGACCA 


AGCAATGTAT 


CATTTCTTGA 

AAA w A A Wfl 


GTGGTTTCAC 


TTCTAAATTA 


4260 


GCTY3GTACA» 


G PGTGGTGTG 


ArAGAACCTG 


AACCATCATT 


CTCAACATGT 


TTCGGAGCAC 


4320 


\_VJ X X V_ X X v_ V_ V_ 


GTTArArrrx 

VJ X X r\ V V— v_ X 


/*Vv* X O X X A rtV^,V3 


CTGATCTATT 


AGGTGAACTT 


ATCGATTTAC 


4380 


A TG ATfZTTY"; A 


ivji x ini V- 1 x 


f3TTA ATACTG 
<J 1 1 r\r*L X X w 


GATGGACTGG 


CGGAAAATAT 


GGTGTAGGAC 


4440 


GTAGAATCAG 


w x x A\#n x x tx\~ 


APJVPGTCAAA 


TGGTAAACCA 


AGCGATTTCT 


GG CAAATTGA 


4500 


AAAATGCAGA 


ATATACAAAA 


GATAGTACGT 


TTGGTTTAAG 


CATTCCTGTA 


GAAATTGAAG 


4560 


ATGTACCGAA 


AACAATTTTA 


AATCCAATTA 


ATGCTTGGAG 


CGACAAAGAG 


AAATATAAAG 


4620 


CACAAGCAGA 


AGATTTAATT 


CAACGTTTTG 


AAAAGAACTT 


CGAAAAATTT 


GGTGAAAAAG 


4680 


TTGAAGATAT 


TGCTGAAAAA 


GGTAGCTTCA 


ACAAATAAAT 


TTGAATACTA 


AATCaAAACC 


4740 



— ACC<£TGTGA~ ACGGGTGGTT~TGTTCTGCGG CTATAAGCCT TCCTTACTGG - CCAGGCGTAA ~ - 4 800 

AAGGGCACTG ACAAGTCAGC CAACTGCACT ACTATTCCAG CAACCCTAAA GGGTTACTCT 4860 

40 

TTTTTCTTTC TTTTTTTATT TTTCTCTCCA GTGAAAGGAT CTAAAT ATT C TTCCATTGAG 4 920 

ATTTGGTCTG CAACG AT AT C CTCTTGTAAT TGATTACGAA TATAATTTTC AATCACTTTT 4 980 

45 TTATTTCTAC CTACTGTATC CACATAAAAT CCTTTACACC AAAACTTTCT ATTTCCATAT 5040 

CTATACTTTA AGTTAGCATG TCTATCAAAT ATCATTAAAC TACTTTTTCC TTTTAAATAG 5100 

CCAACAAATG ATGATACCCC AAGTTTGGGT GGTATACTAA CTAACATATG GATATGATCT 5160 

50 TTACATGCCT CTGCTTCAAT TATCTCTACA CCTTTTCTTT CACATAATTG ACGCAATATA 5220 

ATCCCTATAT C TTTTTTT A T TTTTCCATAT ATCACTTGTC TTCTGTATTT AGGTG CAAAG 5280 
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AAATAG CATC TCCTCGTGTT GATTATTTTG GTTGGCTGAC CAATATTTAT TCTAGCACGT 54 00 

AG AGATG CAT TTTTTGTGAC AATGGTAGAA CCTTTTCtGa ACCATACGCA TAGCGTATGG 546 0 

TTTTCTTTTT ACAATTAAAG AGCCAACCGT TGTTATAGTC TAACAATGGT TGGCTCCTCT 5520 

TATTTTATGT GCTAAAAATT TATAGGCAAT TTTATTACAA CAATGTACAT TTAAGGTGAC 5580 

CTTCATGCCA AAATCGCATC ACTCATTTAA TGGAAG CAGC ACGTCTTCAT ATAAAGTACC 564 0 

GATCCCTAAT TCAACGCATG TAGTACCACA TCTTCAAAGC TTGATAGTTC CCATGCGCAC 5700 

ACCACGTTTC ATACTAGCTA TGCGACTCAA CTTGGTTCAT AAACTCTTTA ATATAAGTCA 5760 

ATGTTTCAAC CATCGCTGGT GGTCTTGGCA CATGTCCTTC TGCCATTTGA TAAAATGTTT 5820 

CATGCGTGGC ACCTTTTAAC TCTAGTTGGT CCGCTAAATA ATACGCATGA TGAATACCAA 5880 

CTTGCTGGTC TTTCCCTCCA TGTACAATTA ATATTGGCGG ACTGTTTTCA TTAATGTTTG 594 0 

20 GAATCGCTTG GCGTGCCTCA TATGCCGCTC GATCTTTTTT CGGATGACCA ATCATTCTTC 6000 

GTAGCATGCC TCTTAAATCG ACACGTTCTT CATACATTAA ATCAATATCT GAGACACCAC 6060 

CCCAGATTGT ATAACTTGTT ACTGGTAAGT CTTGAAATGT CAACAATCCT TGTAAACCAC 6120 

25 CTCGCGAAAA ACCAACCATG TGGATAAATG CATGTGGATA TTTATCATGT AGCAACCTTA 6180 

ATAATTG CGT CACATCATTT AAATCGCCAC GGTAAAATTC GTCTTTGCCT TCACTCCCAT 624 0 

TGTTACCTCG GTAGTATGGC CCAATCACTA AAGTTTGACT ATCTGAAAAT TGCATTAATC 63 00 

TACCTGCGCG CACACGTCCT ACTTGACCTT TGCCACCTCG CAAATAAACT ACAATGCGAT 6360 

TTACTTCATG ATGTGGTGTC ATCATTAAAG CTTTTACTTG TAAGTCATCT GACAAATATG 6420 

TAATTTCTTC GAATTGATGC GTAAAATATT CAATTGGCAT TCGTTTACGT TTGATAAAAC 64 8 0 

CCAAGTGATT GCACCCTCTC TACGCATTTT AAAATGGTAC TATCTTGCAG TAAGAAACTC 6540 

CGTTGTG CGA GTTCAATATC ATTGATACAG TTAAACAACA CTGGCCCTGC TGTTTCTAAA 6 600 

TAATCGTTCT TGCTTACCAA TG ATT CAACT TCGATAAAAT ATACATCTTT TACAAAATCA 6660 

GTTTGATCAT GTGTTTCAAT GGTATATTGT GCTATGTAAT AAATATTTTT AACTTTGGCG 6720 

CCTGTTT CTT CATATAATTC aCGTGTAACT GCTTCAGCAC TACTTTCCCC GCGTTCCCTT 6780 

45 TTACCACCAG GAAATTCAAT CCCCCGTAAA TTATGTTTGG TAAAAAGCAA TTGATTTTTA 6 84 0 

AACGTTGGAA TAGCTAGCAC ATGATTGCCA TCTGCTATCT C ATT AT C CTT TTTAAATGTC 6900 

AAATTAACTT GACGATTATC TTTATCCCTA AACTTCACGC GCATCACATC CCTACATTGT 6 960 

50 

ATGTTAATAT AATAGTTAAT TACTATCGTT GGAGGCATTA ATTATGAAAA AGATATTCTT 7020 

GGCGATGATT CATTTTTATC AACGTTTCAT TTCGCCACTC ACTCCACCAA CTTGTCGTTT 7080 
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CCTTTATTTA GGTATCCGTC GTATTTTAAA ATGTCATCCG CTTCATAAAG GCGGCTTTGA 7200 

CCCTGTTCCG TTAAAAAAAG ACAAGTCAGC AAGCAAGCAT TCACATAAAC ATAACCATTA 7260 

ATATGGTTGT AATTGAGTTA TATCCACTAA AGGGGGGCGA AATTCGAGTC GCCCCTCTTT 7320 

TAATATG CCT GAATGCGCCA CCACATCTTG TTCAAAATAA TAACCTG CTG GTGTAACATC 7380 

TCCTGGATAA TCACCTTTAC GAG CAAG CAT CGCTGTAAAA TAGCGGCTTA AACCATATTC 744 0 

GTACATGCCG CCAATAACCA CTTTTGCACC ATGACTTTTC AAAGTATCAA TTGCCGTTTG 7500 

CACTTTATCA ATG CCACCTA GACGAAATGG TTTTAATACA ACAACTTTCA CATTGTATAA 7560 

TTCTATCAAA TTAATTATGT CCaACAACGA TGTTGCCTTT TCATCAAGGG CTATTGGAGG 7620 

TATTGTTCCA TCCGCTACTT CATCAAG CAT GGAGATATCT TTAAATGGCT CTTCGATATA 7680 

AAGAACCTGT TCACGCGCTA ATAACTGTAA CTGTGTGAAA TCTTGACGAT CCAAGGACTC 7740 

ATTTGCATCT ATAACCAATT GAAAGTGAAA GTCTAATTCC CGTAACACTC TAATTTGATG 7800 

CATGATTTGA GGCGTCCATT TTAATTTAAT TCTGGTCGGC TTTGTTGCTT TTAATGACTC 7 860 

TAGTTGTTTA TTTGATAAGC CGCTCGcTGT CGCTCCATAT GCTACTGAAA ATGAAGGCAG 7920 

25 TACATGAAAC ATTTGATACA ATGCCATGAC AATAGTTGCC CTTGCAGCAG GCGTATTTTC 7980 

CAATGAATCT ACTAATTTTA GTGCTGCTTC ATACGTTTCA AATGATTTAT TTCTATTATC 804 0 

TTCGAACCAT TGCTCAATTA CATGTTTCAC TGAGGCAATT GTTTCATGAT CATACCAATC 8100 

TGTTTGAAAA GCGTTACATT CCCCGAAATA TGCATTTCCT TTGTCATCAA TCAATTCGAT 8160 

AAACAAACAA TCACGATGCG TTAAAGTGAC TTTCGGTGTT ACAATTTGTG ACTTAAATGG 822 0 

CTCACTATAT TTATAAAAAT GCAAAGCTGT CAACTTCATC AAATCATCCT CTATACAACT 8280 

TATTTCTTTG TAATTTACCT GTTGATGTAT AAGGTAAAGT ATCAACCTTT TCAAAGTGTT 8340 

TCGCTACTTT ATATTTCG CT — AAATGTTGTG - ATAAATATG C AATCAATTGT^GCCTTTGAAA 8400 

TGTCACTTTC ACTGACAAAA TATAATTTAG GCACTTGGCC CCAAGTATCA TCAGGATGCC 8460 

CTACACATAC TGCGTCACTG ATACCTGGAA ATTGctTCGC TACCGTTTCA ATTTGATATG 8520 

GATAAATATT TTCACCGCCA CTAATAATTA AATCTTTACG TCGGTCATAA ATCATGACAT 8580 

45 AACCTTCATG ATCTATTTCA GCAATGTCAC CCGTATTAAA ATAACCATTT TCAAACGTAC 8640 

CCGTTAAATC TGTTGGATAC AAATATACAT T CAT CACATT GGCGCCTTTA ATCATTAATT 8700 

CTCCATGACC TTCTTTATTA GGATTTTTAA TTTTTACGTC AACATTGGCA CTTGGCATCC 87 6 0 

50 CTACAGTGTC AGGACGTGCA TGCAACATTT CCGGTGTTGC TGTTAAAAAT TGCGAACATG 8820 

TCTCAGTCAT ACCAAATGAA TTATAAATTG GCAGGTTATA TTGTAATGCC GTCTCTATCA 8880 

55 



30 



35 



40 



INSDOCID: <EP 078651 9A2J_> 



749 



EP0 786 519 A2 



10 



is 



20 



25 



30 



35 



40 



45 



SO 



AACCTTGTTG CATAAGCCAA TTTAAAGTTT GTGGCACAAG 
CATTTTTAAT CATCGTTAAA ATTTGTTCGG CATTGAATTT 
AACCTTCAAT AACAGCTCTT AAAAGTACAC TGAGACCCGA 
CAGATAGCCA ATTAGTGTCA CGATCAAATC CCAAGCTCTC 
CATAATGATT ACGAAACGTT TGTGGCACCG CTTTTTGAGG 
ACATAATCGA TGCAATGTCA TCTAAATTAA ATGATGTATT 
CTTTCGGCAC CACAGTTTCA TTCGATGTTT CATATTGGAT 
AACTGTTCGT TGTAATATCC CTTCCAGCGA ATTCAATATC 
ACCCTCGTAA TTCCAGTGGC AAGGTACAAA AAATCAATTG 
GATTCGTCAT CTCATTAGGT GTCAACCTTG TATTAATCAT 
AC CAACATG C ATGTATTAAA ATGATCGATT GAATCGAATT 
GAGATTGTTG ATAAGCCTTG AGTCTTTTAG CCAATAGACT 
GATAAGTATA AGATTCTTGA CCGTCTGTTA TCGCAATATG 
GTTTATATAA CCAAAAGTCC ATGCGTTATT CCTCCAAAAT 
CGATTTTATG ACATTCTAGC AGTGGTTATG TTTAAAAATA 
TGCATTGATA TGATTGTTAT AATGCTCAAT ACATATCGTT 
TCAGTTATTT TTATTTAATT TTAGTGTCAT TCTGTCATTT 
TGTTGCCACA TCATCTGCAA TGTCAATTGG TATACGGTTC 
ATGGAATACT TCATCATCTA AATTTTCAAT GAGATATACA 
TTTATATTTT AACGTTTTCC AAAAGTCCGG CTTGCAATTC 
TTCAATAAAT AAGTAACGTT TGCTGCCTAC TTTGTCTATG 
TTCTATACCT CTTATATGTG CATAGTCTGC TGAAAAGTAA 
ATGTTGTTGT ATTTCAAATC GTTGGCCTAC TATTTTATTA 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



CGAAATGTGC 
ATCAACAATG 
AATATGATAA 
TTTACATCCG 
GCCCGTTGTC 
TAATATGTTG 
ACCCATTGTG 
ATCCAGCGAT 
TACATCGATT 
CGCAATTTCA 
ATCTATGTAT 
CGCTTCACAG 
ATGTCCATTT 
CATTTACATT 
TAAAAAAGTA 
ATATCATTCG 
TGATGTGGTG 
ATGTCTTGTA 
TAATATGTTA 
AATACATTAT 
AAATATTTTG 
ATACTACCTA 
TTTGTGCTAC 



GTGATTCGTT 
CGCACAGTAA 
ATCGGCAAGA 
ATTGCACTGG 
CCTGATG TAA 
GACGG CGACT 
TTGTCCAACA 
ACAATTTGAA 
GACTTCATCT 
ATATTTGCCA 
AGCCCAACAC 
TATAAATTTT 
TGTTGTGCTT 
ATAATTATAA 
GACGAATTGA 
TCTACTATTA 
ATTTACCCAT 
ATGCACTTAA 
CCTTGTCCTT 
CCGGAATATA 
CAGTGCCTTT 
TTGTTTCATT 
nGGGGACTTA 



9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
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Cj I GTGGAllG 


GATTTTAAAA 


TCACCCTCAT 


AAATACTGTC 


ATCAATATGA 


TAAGTTACAA 


120 




TTTCACCTAT 


TATTAAATCA 


GCCCCATCTA 


ATACATCTCC 


AAGCAATATC 


ATTTGCGmTA 


180 


5 


ulll Aw\ I 1L 


f> to to rr^^'w 1 t> mm 

GAATCTCATT 


TTCGCATCTT 


TAATTCCTGG 


CGTCTTAATC 


GTTGTAGATG 


240 




'1 wi i * * * TV /"^TV« TV 


TAATTCTGTA 


CGACTCAACT 


CACTGTCACC 


ATATGCTAAC 


GGCGCTGCAG 


300 


10 


TV*" P/"* TV • 1 ** P7\ TV *TH 

TCTCAITAAT 


ATCTTGAACA 


TTATCTTCGT 


CTGTAATATG 


CACAACAAAG 


TCTCCAGTCC 


360 


G TT CT AT ATT 


TAATGCAGTA 


TCTTTTCTCT 


TACCTCCTGC 


ACGTTGAACT 


GCAATAGCAA 


420 




TCATTGGCGG 


ATGATTATTA 


ACAATATTAA 


AAAAGCTAAA 


TGGTGCTGCA 


TTTACTGATG 


480 


15 


CATCTTGATT 


TAATGTTGTA 


ACAAAAGCTA 


TAGGTCGTGG 


AATAATTGAA 


CCAATTAATA 


540 




ATTTATAGTT 


^*Wf^MHto^lMB% «h >^^fe« 

TTCTCTAGCA 


GTTAATGATT 


GTGCATGAAA 


CGTATACATA 


ATACCTACCT 


600 




CTTTTCTAAG 


TATATCTAGG 


TATTTCTCCG 


ATTTTGGTTA 


ATTTAAACAT 


CTATTCTCCT 


660 


20 


CTGAAAATCA 


CTTGTATTTA 


TTTAGCAAAT 


CTTTTGAAAT 


ATGACACATA 


TGCATATCTT 


720 




CTGGATATTT 


TTCTAAATGT 


TGCTGATGTT 


CTTCAGCACT 


TTTAATGTAG 


TTAGACAGCG 


780 




GTAAGACTTC 


CACTGCAATT 


TGATCTCTGT 


CTTTACGTCG 


TTCAATGAAC 


TGACG CGCTT 


840 


25 


CAATTAAGTG 


GTCATCTACA 


CAACTATATA 


AACCCGTTCG 


ATACTTTTGT 


CCAATATCAT 


900 




TTCCTTGTTG 


ATTCACACTG 


TAAGGATCAA 


TGATTTCAAA 


TAAATAATTC 


ATAATGTCTG 


960 


30 


TAATTGTTAA 


CATACGATCA 


TCGAAATGAA 


GTTTGACACA 


TTCAGCATAA 


CCATCATACG 


1020 


GACCGTCTAA 


TTTAGAGCTT 


CTTCCATTTG 


CTCTTCCTGC 


TTCTGTATGT 


ATAATTCCAG 


1080 




GTATTGTTGC 


AAAAAATGCT 


TCAACACCCC 


ATAAACATCC 


TCCTGCTACA 


TAAACAACTG 


1140 


35 


CCATATTTAC 


ACCTCATCAT 


CCTTTTTTAT 


ATTTTTAACA 


AGGTTATACC 


ATTTAATACC 


1200 




GCCATGACAT 


GATTCTGATA 


CACCTTCATT 


ACGATACCCA 


TATTTTTCAT 


AAAATGAAAT 


1260 




"TAATGATTCT CGACATGTTA 


ACGT^CACC" 


ATGTCGATGA 


TX^TTCTTAG 


CAAGAGTTTC 


1320" 


40 


AAAATAGTTT 


AGTAAGCGAC 


CTGCAATACC 


CTGACCTTGA 


TAATTTGGTG 


CTACAACAAG 


13B0 




ACCTAACACA 


CT AATAT AG C 


CACCTTCACT 


ATTATTTGTG 


GAGACATTTT 


TAAATAAATC 


1440 




ATCGCTAATG 


TAACGCTCTT 


TTATGACTGG 


ACCGTTG 






1477 


45 


(2) INFORMATION FOR SEQ ID NO: 145: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 976 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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AGGTGATTAT CCTAAAAATG CTCATGAGGT CGCTATTAAT GATAAGTTAG CTGCAGACAA 6 0 

CATTAGAGTC GGGGATAGAT TACATTTTAA AAATAATTCA ACTAGTTATA GAGTTTCTGG 12 0 

TATTTTAAAC GACACAATGT ATGCGCATAG TTCCATTGTG CTATTGAACG ATAACGGATT 180 

TAATGCATTG AATAAGGTTA ATACGGCATT TTATCCAGTG AAAAATTTAA CACAACAACA 24 0 

ACGTGATGAG CTTAATAAAA TAAATGACGT TCAAGTTGTG AGTGAAAAAG ATTTAACAGG 300 

TAATATTGCG AGTTATCAAG CAGAGCAAGC ACCGTTAAAT ATGATGATTG TTAGTTTGTT 3 60 

TGCTATTACA GCAATCGTTC TAAGTGCATT TTTCTATGTT ATGACGATTC AAAAAATATC 4 20 

ACAAATTGGC ATTTTGAAAG CAATTGGTAT TAAGACAAGA CATTTATTGA GTGCGTTAGT 48 0 

TTTACAAATT TTAACACTAA CAATAATTGG GGTAGGTATT GCTGTGATCA TCATAGTAGG 54 0 

ACTATCATTT ATGATGCCGG TAACGATGCC TTTTTACTTA ACAACGCAAA ATATTTTATT 600 

20 AATGGTGGGG ATATTTATAT TAGTAGCGAT TTTAGGTGCC TCACTATCAT TTATCAAATT 66 0 

ATTTAAAGTG GATC CTATCG AAG CAATTGG AGGTGCAGAA TAATGGCATT AGTCGTTGAA 72 0 

GATATCGTCA AAAATTTCGG AGAAGGTTTG TCTGAAACAA AAGTTTTAAA AGGTATTAAT 78 0 

25 TTTGAAGTGG AACAAGGGGA ATTTGTCATT TTAAATGGTG CCTCTGGTTC TGGGAAAACA 84 0 

ACATTGCTAA CGATATTAGG CGGATTGTTA AGTCAAACGA GTGGTACAGT GCTTTACAAT 900 

GATGCGCCAT TGTTTGATAA ACAGCATCGT CCTAGTGATT TACGATTGGA AGATATTGGT 96 0 

TTTATTTTTC AATCTTCACA TTTAGTTCCT TATTTAAAAG TGATAGAGCA ATTGACACTC 102 0 

GTAGGTCAAG AAGCGGGAAT GACCAAACAA CAAAGTTCAA CAAGAGCAAT ACAACTTTTG 108 0 

AAAAATATTG GTTTAGAAGA TCGCTTGAAT GTATATCCGC ATCAGTTATC TGGCGGTGAA 114 0 

AAGCAACGTG TTGCGATTAT GAGAGCATTT ATGAATAATC CGAAAATCAT TTTAGCAGAT 1200 

GAGGCCACAG CAAGTTTAGA TGCCGATAGA GCAACAAAAG TTGTTGAGAT GATACGTCAA 1260 

CAAATTAAAG AACAACAAAT GATTGGTATT ATGATTACAC ACGATCGAAG ATTATTTGAA 132 0 

TATGCAGATC GAGTGATTGA ATTAGAAGAT GGCAAAATAA CTGATTAGTG GCTTGTAAAG 13 80 

ACGCTAAATG TTAATGATTT AAGACATAGT AGTATAAAAG TTAGATAACA GAATACGATT 144 0 

45 TGGGTTTACA AAAAACAGGC TGGGACATTA AGTTCTTAGG CAATGTAAAA AAGCTGATTT 1500 

CTATTAATTA TTTGATAGAA ATCAGCTTTT TTGATATGTA TTTTATAATG TACAGCTCGT 156 0 

TGCATTCATA TAG CTTG AAG TCACGTTTAA AAC CAT AT CT ATCATTATGG TATGCATATC 1620 

SO TTTTAAAACC TATTCTTTTG TTATTAGGAC ATATAAATTC ATCATTAAGT TCGTCATATT 1680 

TCCAATTTTG AGTGTTAAAA ATGT CACTTT TAAACTTTCT AGTTTTATCT TTAATAAACA 1740 
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CACTATCATA 


ACATGCATCA 


GCTACAATAT 


A^- 1 V^V^OV^ X AA 


ft T" ft ft /"»/-•/"• i\ ftp 


IllAl X X X y/VA 


1860 






TCATTGTTAA 


AAATGGAATT AAAGTTCTAG 




PTTTTP ft ^ ft *T* 

v> X X X 1\jAAAI 




1920 




5 


ATAAAACAAA 


TTGAGAATTT 


GTCGCTATTT 


CZT A A A TTYIT A 
VJX AAAX X \9 X A 


X\_l*XVj%*l»X 1A 


APTT/^PPPft » 
AVj 1 IVjLjCCAA 


1980 






AGTGTCTTAT 


rrn-iTAAAG 


TATTTAAAAG 


TAJiAATTUPA 


TT.TTAATAm 
XVJX XnnXnV^vi 


TArtTATTA AT 
lAul AX XAAi 


204 0 




10 


GGCGAGACTC 


CTGAGGGAGC 


AGTGCCAGTC 




IjUULLAALAL 


ft /•> ft ft -w- — ,#rv^» jv 
AtxAAJT C 1\jAC 


2100 




ATATAGTCAG 


CTTACAACAA TGTGCCGGTT 


VJUUU X UUL X \3 


A/"2 A PPJ*2 P A 


V_ 1 Avj<JAAIj<jCj 


2160 






ACCCGTCATC 


AAAAATTCTA 


TTTATAGAAT 


7TTA P ZlfST A 21 


TY^TltOr 21 P. 21 T 


f~*r*T* o ft T'ft r~*r*r* 


222 0 




15 


AAg c CATTCA ATACGAAGTA 


TTGTATAAAT 


a r; Aft a a p An p 


ArTtfiP 21T21T 
AVj 1AAUA X A X 


TTTCTAATTG 


2280 






AAAATTATTT 


TACTGCTGTT 


TTTTTTAGGG 


rtX J. nr\ X V7 X 




AGTTTATTTA 


2340 






TTTTCAATAT 


AACAATTGTC 


TAATCAAGGA 


TTIiIvrfi& ATA 
X XAAv-VjAAX A 


TTTA 21 ftp ft Tft 
X X iAAAvjAlA 


\s 1 i i CiACGCA 


2400 




20 


ATATTAGAAA 


CAACCTATAA 


TAATAGTTTG 


X X iui\JUAl X 


AALlAl 1A1A 


ft ft T* 7\ KMt ^*^<j*t 

aATAAAAGCG 


2460 






GCGTAAAGAC 


ATATAAACCA 


ACTACTTGAA 


v rtjf-V 1 A X >n-M.V_^j 


'["I'ft ft T" TV ft Pft ft 

X 1AA1AALAA 


1 LTATACTGA 


2520 






TACATTACGC 


CTAGATAATC 


TTTGATGAGC 


A P & TY2T A 2i f"2 A 
A\-A X\» X AAvxA 


21 21 ft ft /""TV^ 1 ft I* ft 
AAAAO X vjA X A 


IIjGTGTATGA 


2580 




25 


CTTCCGACAC 


CATCGATAGA 


TAAACCTAAT 


XXX X\j%»V»^-. X A 


ur X L-Vj X AALKj(J 


GCGCAATACA 


2640 






TGAAACTGAC 


TTGTtACACA 


AACAATTTTA 




r^ftTft/^ftftft 'i*i' 
VjAXAt^-AAAl X 


CjTTGATGATT 


2700 






TGTTTAGAAT 


ATAAAAAGTT 


TGTGTATGTA 


X X X A 1 AVjA I 


/"* ft /*• ft TTV /"» ft T 


TAGTATATCT 


2760 




30 


GTTTTATCAA 


CACCATGTGC 


AATCAAATAA 




^T*ft ^ ^ ^^^^^^^^^^ 

V- X AAAo(_ 1 1L 


ft ft ft ft 'HM'^^nr* 
AGAAATTGGT 


282 0 






TCGTCTGGTC 


CTTGTCCGCC 


AGATACAATG 


ft TP'I "| ™| V 1*\ m 1 Y2 
A X *w 1 X loi IVj 


L- IAjA X L»V_ X X vj 


I ICjTTGATAG 


288 0 




35 


ATATCAAGTG 


CACGATCTAA 


ACGCGCTGCA 


X X w\J X V3 


X oAUAAA 11L 


^^OTft ft ft ft ft Tft 

GG1AAAAATA 


294 0 




CCAGCACCTA 


ACACAATTAT 


GATATCAACT 


Xk7X XVj X 


A TP. A 'PPT 21 TP. 


T^Tft Tft TP ftT 1 

X 1 A X A XXjAT 


3000 






- - -AGTCTCCAAA- 


" CGAG ATAACA AATAAAGGTT 




AA AP.APATAA~ 


"TAT A^PTftTftT/"* — 

1AIAGCI AAL 


3060 




40 


CACATAGACA 


AACCTTTCAC 


AATAGGTGAC 


Tf2 A A TOfVr A P 


TT ft *p ft ft ft «T» ft f 

X lAlAAnlAu 


ft ft PTW^Tn ft T" 

AAL» 1\jC Xaj AT 


3120 






GTGTAGAGTA 


CAAATTTATA 


TGAAAAAGAT 


AATAATTT1 " 1 1 
nnl/uil X X X X 


TA ATA 21 AT A A 
InAlAH/ilAH 


Vj L.viA\_ X AvsAA 


3 leO 






GTATGAGAAA 


ATAAATATCT 


ATGTTTGAAT 


AfVATOATA A 


ft ft » k i ■ ft 


T ft T ft ft ft mPTT 

X AX AAATGTT 


TV* A 

324 O 




45 


ACAAACATAG 


ACCAAGGGAA 


AGTATAGGTC 


A Trt 21 TPPT A T 


ArtATrtftPTPft 


ft ft ft ft ft *t* ft t^/-* 
LAAAAATATC 


3300 






GATATGACAA 


CTAAGATGTA 


GCATGTTAAA 


TTTAACGTCA 


GAGTATAGTT 


GAAAATTAAC 


3360 






GGACAAATAA 


CGATAAGTAT 


AAATATTAAT 


AATAAATTCA 


ATAACATACT 


GACACCTCGC 


3420 




SO 


TTATAATAAA 


TATTAAATAT 


AAATGTAGAT 


GATTTAATTT 


ATTAAAGCAA 


GGAGAAAGCA 


3480 






GCAACATGTA 


AATCTTAATT 


TGTTATATTA 


TATATGGGTC 


AATATTTTTG 


TGTTTTTTAG 


3540 
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TATGGTAAAA CATTTACAAG ACCATATTCA ATTTTTAGAG CAGTTTATAA ATAACGTTAA 3660 

CGCATTAACT GCAAAAATGT TGAAAGATTT ACAAAATGAA TATGAAATTT CATTAGAGCA 3720 

GTCTAACGTA TTAGGTATGT TAAATAAAGA ACCTTTGACA ATTAGTGAAA TCACGCAAAG 3780 

ACAAGGTGTA AATAAGGCCG CAGTAAG CCG ACGAATTAAA AAGTTAATCG ATGCTTAATT 384 0 

AGTTAAGTTA GATAAACCAA ATTTAAATAT TGATCAACGT TTGAAATTCA TAACCTTAAC 3900 

TGACAAAGGT Ag AG CAT ATT TGAAAGAACG TAATGCGATT ATGACAGATA TTG CGCAAGA 3 960 

TATTACTAAT GATTTA 3 97 6 
(2) INFORMATION FOR SEQ ID NO: 146: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

25 GCTACCTAGG CATTTAAGAG ATCAAAAAAT GTATGAATAT GAACGTTATT TTTATGAGCA 6 0 

AGAACTTAAT GGCGTTGATG aAGGGGAAAT TTTAAAGAAG TTAAAAGACC CACAAGATGT 120 

TGCAGCTGAA ACAAAAGCTA GAAGTGTTAT TGATTATGCT GAATCTAAAC CAACATTTGA 180 

AAATATTTCA AGAGCTGTTG CTG CTTCATT AAGTTTAGGC ATTCTATCTA TTTTTGTCAT 24 0 

CCTTATACCA GTATCTATAG TTGGATTATT TGTATTAGCA TTA TT TTTAA TAT CACTTTT 3 00 

GCTGCTGTTT TGTCCAATTA TTTT ATT AG C ATCAG CAATA TCCAGAGGAA TTGTGGACTC 3 60 

AATTAGTAAT GTATTTTTTG CCATAT CAT A TTCAGGATTA GGATTAGTAT TTATCATTGT 420 

CATATTTAAG ATTTTAGAAT ACATTTATCG TTTAATCTTA AAATATTTAC TTTGGTATAT 4 80 

TAAAACTGTC AAAGGAAGCG TTAGAAAATG AAGAAATTCT TTTTTATTGG GCTTTTAGTG 54 0 

TTTGTTGTCT TTTTTACAGC AGCAACCATT ATTTGGTTCA GCTATGATAA AAACAAATAT 600 

GGTACTAAAC AATATGATAA AACATTCAAA gACGATGCTT TTGACAATGT ATCTATAAAT 660 

AS TTGGATAGTA CAGAACTTCG TATAAAACGG GGGAATCAAT TTAGAGTTAA ATATGATGGT 720 

GACAATGATA TATTAATTAA TATAGTAGAT AAGACGTTGA AGATTAGTGA TAAAAGGTCT 780 

AAGACAAGAG GATATGCAAT TGATATGAAT CCTTTTCATG AGAATAAGAA AACGTTAACG 84 0 

ATTGAAATGC CTGATAAAAT GATTAAACGT TTAAATCTAT CATCTGG AG C AGGAAGTGTT 900 

AGAATCAGTG ATGTTGATTT AGAGAACACA AGTATTCAAA GCATTAACGG TGAAGTAGTT 960 
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AGTAAAAGTA 


ACATTAAAAA 


TAGCAATATT 


AAAGTTGTTA 


TTGGTACGCT 


ACAAATCGAC 


1080 




AAGAGTCAAA 


TTAAACAATC 


CATA i'i A 


AACGATCATG 


GTGACATTGA 


ATTTAAAAAC 


1140 


5 


ATGCCATCAA AAGTAGATGC 


AAAAGCTTCT 


ACTAAACAAG 


GAGATATTCG 


TTTTAAGTAT 


1200 




GATAGTAAAC 


CTGAAGACAC 


TATACTAAAG 


CTAAATCCGG 


GAACGGGTGA 


TAGCGTAGTT 


1260 


10 


AAAAATAAAA 


CATTTACTAA 


TGG t AAAGTT 


GGGAAAAGCG 


ACAATGTTTT 


AGAATTTTAT 


1320 


ACGATTGATG 


GTAATATCAA 


AvjTTGAATAA 


ATAAAGGATG 


TAAGCACCGA 


TATTAGGAAG 


1380 




CATAATTTCT 


CTAATATCGG 


TGTTATTTAT 


TTGTTGGCAA 


AAGTTAAGTC 


GGTATCTATA 


1440 


15 


TTGCCAGTAA AGTGAGTGAT 


ATTAAGGTCT 


TGACCATCTA 


ACCATGATTT 


GAAATCTATT 


1500 




ATTTCTGGTG 


GCGCATTTTC 


T C CCAATGT A 


AAATATGCAG 


TTAATGTTTC 


AGGTTGATAC 


1560 




ATTGATGTAT 


GGATGGTGCC 


AGACCAGCTT 


TTGAATAGTT 


TACTGTAAAT 


TTCATACTGA 


1620 


20 


GGATTATTGA 


ATAACTTAAA 


TGCTGTAGTC 


ATATCTAAAT 


TATCATTAGT 


TTGTGAAATG 


1680 




GTACGCGCCA 


GTCTTTCTTT 


AGATTCTTTT 


GTATAATTAC 


GATTTTCATG 


TGTTAATATT 


1740 




TCAAAATGAT 


TTGTACATAT 


ATTATCATAA 


CGAACATCTA 


TTGATCTCGG 


TGTCACTTCA 


1800 


25 


ACAATTG CAT 


GGTTCAATGA 


TTTGTCCATC 


AGTATGTAGC 


TAAATGAGCT 


TCTGTGTGGT 


1860 




ATTTCTTTCA 


ATAATTGGAT 


TGCTTCTGTT 


ACATTTCGGC 


AATTTTCAAG 


AATTAGACGA 


1920 


30 


CCAATCATAT 


AACATACAAA 


ACCATTTGCT 


GGTTTCTTCC 


GGTGCATAAA 


GTTATAGCCC 


1980 


ATAGTTAATC 


CTGACTCATT 


CATACCATCC 


ATTCTTCCAG 


TTACCCTTGA 


TACAGGACCA 


2040 




ATTTGAGCTA 


AACCGCTATC 


TGTAGGTTGA 


TAAAGTAAGT 


AGCGACCATC 


ATAAGTTGCA 


2100 


35 


GGGTGGTAAT 


CATAATTTCT 


AACCATGAAG 


TCTTTGCCTT 


GAAAGACCGT 


GCAaCCACTT 


2160 




TCTTTTAAAT 


CGGTAAAACG 


ATAATGTCCA 


AAGTTTAAAA 


TAATTTGGCG 


TGTTGGCATT 


2220 




TTGAGTATAC TTTGTAGTCC 


CATTAATTCT 


TCCCATATTT 


GAGGTGCGTA 


TGTTTGGAAT 


2280 ~ 


40 


ATTTGATAAG 


TTTCATTTAC 


ATCTATATCG 


AAACGTGGGA 


CaCnTTTTTT 


CCATTCTTTT 


2340 




TCTCGATTTT 


TTAGAAGAGG 


TGTTTGTTGA 


AGCCATTTAC 


CAGTTTTAAC 


ACCTAACTCG 


2400 




AAATGTGAAC 


CTCTAAAAGT 


CATGATATCT 


GATGTCACTT 


GTTGCATATC 


ATCGGCCCCT 


2460 


45 


TTCTTTTTAG 


TTGTAATATA 


TTGTAAATAA 


ATAGTAATCG 


TATGTATATT 


GAATGTCATG 


2520 




TTAAATAAAG 


TTATATTTTA 


CTAAATGAAA 


TATAAAATTG 


TTTGAGGTGA 


TTTCTCGGTG 


2580 




TATAAGACTT 


ATCAATCAGT 


TAAAACATAT 


TTTTATAGAT 


GGTGGGGATA 


TTGAGTTAAA 


2640 


50 


AACTTAAAAT 


CATCTTATCA 


TAAATATCAA 


TCTTAAGTTA 


GCATTCACGA 


TAATAGTCAT 


2700 




TGTTAACATT 


AGCATATAAG 


GTCATGTCAC 


GTTGAAACAG 


AGGTTCCTCG 


GCATTTTTGA 


2760 
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TTATTTAATG ATTATTCTAT ATATGATAGT ATAATGAAAT GTAGATAGGT ATTTAATTTA 28 80 

ACAGAGGTGA AATTGAGATG TGGAATTTTA TTAAATGtGT GkTTAAATTC GTATTTAGCT 2940 

TAGTTGCTAT TACAACATTA GTTGCTGGTG TTGGTGTAGT AGCATTTGCT TATATCTTTA 3000 

AAAAAGATTT TGAAGATATT GAAAGAAAAA CTAAAGAAAT TATTTCTGAT ATTGAAAGTA 3060 

AAAATAACTA ATAACATTTA GAGGCTGGGA CATAAATCCC TAAAAAACAG CAGTAAGATA 3120 

ATTTTCAATT AGAAAATATC TTACTGCTGT TCTCTATTTn ATcAmTACTt CGTATTGAAT 3180 

GGCTTCGCTT TCCTAGGGTG CCGTCTCAGC CTTGGTCTTC GACTGGCACT GCTCCCTCAG 3240 

GAGTCTCGCC ATTAATACTA CGTATTAACA TGTAATTTTA CTTTGGAAAT ACTTTTAAAA 3300 

AATAAGACAC TTTGGCCCAA CTTGGCACAT AAATGTAAAA TTCAAT 334 6 
(2) INFORMATION FOR SEQ ID NO: 147: 

2Q (i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2375 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GTTGAAGAAA GAAATATAAC AGTCAATTAT AATTATAACC TTGTTGAAAT CGACGGTGAC 60 

AAAAAAGTGG CTACATTCGA ACATATCAAA G CATACG ATA GAAAAACAAT AAGTTATGAT 120 

ATGTTACATG TAACACCACC TATGGGTCCC TTAGATGTAG TAAAAGAAAG TACACTTTCA 180 

GATAGTGAGG GTTGGGTAGA TGTTAACCCA ACCACATTAC AG CAT AAAAG CTACTCTAAT 240 

GTATTTGCAC TTGGTGATGC TTCAAATGTA CCTACTTCAA AAACAGGCGC ACTATTcGTA 3 00 

AGCAAGCACC TATCGTCGCT AATAATTTAT TGCAAGTGAT GAATAATCAA ATGTTAACGC 3 60 

ATCATTATGA TGGTTATACT TCATGCCCTA TTGTTACTGG ATATAATAGG TTAATACTTG 420 

CAGAGTTTGA TTATAATAAA AATACTAAAG AAACAATGCC GTTTAATCAG GCCAAAGAAC 4 80 

GTaGAAGTAT GTATATATTT AAGAAAGATT TATTACCTAA AATGTATTGG TACGGCATGC 54 0 

45 TAAAAGGATT AATATAATAA AGTACAGAAA ACAATAAATT TTTAATGAAA AATCTTTTAC 600 

TATAAAAGAT TAAGTATTTA AATGACGTGT CAGTGTTGTG TTTATATGTC GTGAATTTTT 660 

AGCTCTAAAT AGTATAAGAT TGAAAAAGTT GTTACTGTTT TAAATGATCA CGATGAAGTC 720 

SO ATTCAATAAG AATGATTATG AAAATAGAAA CAGCAGTAAG ATATTTTCTA ATTGAAAATC 7 80 

ATCTCACTGC TGTTTTTTAA AGGTTTATAC CTCATCCTCT AAATTATTTA AAAATAATTA 84 0 



55 



30 



35 



40 



tNSDOCID: <£P 07865 19A2J_> 



756 



10 



15 



EP0 786 519 A2 

AG AT ATT CAA ACCACGTGTA CTCAAAATGA TAG CTTGGT A TGTACCTCCA ATAGTAATTT 960 

CAATAACTTT GTCTGTTGAA CACTAAGAGC AATTTTAATT TCATAATGTG TTGTAAACAT 1020 

TTTTTTTGAT TGGAGTTTTT TTCTGAGTTA AACGATATCC TGATGTATTT TTAATTTTGC 1080 

ACCATTTCCA AAAGGATAAG TGACATAAGT AAAAAGGCAT CATCGGGAGT TATCCTATCA 114 0 

GGAAAACCAA GATAATACCT AAGTAGAAAG TGTTCAATCC GTGTTAAATT GGGAAATATC 1200 

ATC CAT AAAC TTTATTACTC ATACTATAAT TCAATTTTAA CGTCTTCGTC CATTTGGGCT 1260 

TCAAATTCAT CGAGTAGTGC TCGTGCTTCT GCAATTGATT GTGTGTTCAT CAATTGATGT 1320 

CGAAGTTCGC TAGCGCCTCT TATGCCACGC ACATAGATTT TAAAGAATCT ACGCAArCTC 1380 

TTGAATTGTC GTATTTCATC TTTyTCATAT TTGTTAAACA ATGATArATG CAATCTCAAy 144 0 

ArATCTAATA GTTCyTTGCT TGTGTGTTCG CGTGGTTCTT TTTCAAAAGT GAATGGATTG 1500 

20 TGGAAAATGC CTCTACCAAT CATGATGCCA TCAATACCAT ATTTTTCTGC AAGTTCAAGT 1560 

CCTGTTTTTC TATCGGGAAT ATCATCGTTA ATTGTTAACA ATGTGTTTGG TGCAATTTCG 1620 

TCACGTAAAT TTTTAATAGC TTCGATTAAT TCCCAATGTG CATCTACTTT AGTCATGCGT 1680 

25 TTGATAAAAA CTTAAATAAT ATTAATTCGG TCATCAGTGG CGTTAAATCT TTTATCATTT 174 0 

TTAGTTATAG TTGATAAATT TATATTTATA AGCATATATG GATATTTCAT CAAAAATTTT 1800 

TATTTATATA AATCCGAACT GCATACATAT TTGTTTAAAT AAGAGGTATT ATTTTTCGGG 1860 

AAATTGCTGT CTGAGTTAAA AGGATTAGTT TTATAAAATG AGTTGAACTA TAGCCAAAAA 1920 

CGATTAAAAT ACTGATAATC C ATTTTTG t A TTATGTTAGG GACTTTTTTA CTTAATTTTA 1980 

ACCCTATTGG aGCmAATATA ATACTCCCTA TTATAAGGAA TAAGGCGTCA TATAAaGGGA 2 04 0 

TATAACCTTG AATAAGTTTG ATGACAAAAG CACCAATTGA AGATATAAAA GCAATTACTA 2100 

TACTf^TTAGC GACTACAGTA TTCATTGGTA ~ATTTGAATAA AACCAATAAT ATAGGAATAA 2160 

TAATGAAGGC ACCACCTGCA CCTACTATAC CTGAAATAAT ACCAATGAAA AGG CCAATGA 2220 

TAACTAATAA ATATTTATTA AATGAAGACT TTTCGGAACT AGGTTt CACT TTAATAAACA 2280 

TTAATGTTAA TGCAAGTAAA GCAATAATGA TATATACCGT ATTTACAAAT GTAGCATCAA 234 0 

45 ATAAATTTGC TAGAAATGCA CCTAACATAC TCCCT 2375 

(2) INFORMATION FOR SEQ ID NO : 14 8: 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 6115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

GAGGTTTCTA GACAAGCTTT TAATAACTTA CCAAACTCAT TAAgrTGGTT gTGtTGGACT 60 

GCCtATTATC mAAG t ATT AT GaGTTGTTTA ATATTAG t GC TAArACATAC GAAGAGTGGT 120 

TTAAACAATT TAGTAGTAAG AAAGCACAAT TCAGTATTAA TCTCACGGAT AAATGGATAA 180 

TTCAAATCGC ATATGGTAAA TTAATAATAA TGGCTAAAAA TAATGGCGAT ACATATTTTA 24 0 

GAGTTCAAAC AATTAAAAAG CCAGGTAATT ATATTTTTAA CAAATATCGA TTAGAGATAC 3 00 

ATTCTAATTT ACCAAAATGT TTATTTCCGC TTACAGTGAG AACACGACAA AGTGGCGATA 3 60 

CATTTAAACT GAATGGGCGC GATGGTTATA AGAAAGTGAA TCGCCTGTTT ATAGATTGTA 420 

AAGTGCCACA GTGGGTTCGG GATCAAATGC CAATCGTATT GGATAAACAA CAGCGCATTA 4 80 

TTGCGGTAGG AGATTTATAT CAACAACAAA CAATAAAAAA ATGGATTATA ATTAGTAAAA 540 

ATGGAGATGA ATAGCGTTAT G CAT AATGAT TTGAAAGAAG TATTGTTAAC TGAAGAAGAT 600 

ATTCAAAATA TCTGTAAGGA ATTGGGAGCA CAATTAACAA AGG ATT AT CA AGGTAAACCA 660 

TTAGTATGCG TGGGTATCTT AAAAGGCTCA GCAATGTTTA TGTCAGATTT AATTAAACGA 720 

25 ATTGATACCC ATTTATCAAT TGATTTCATG GATGTTTCTA GTTATCACGG AGGCACTGAG 780 

TCAACTGGTG AAGTTCAAAT CATTAAAGAT TTAGG TT CTT CTATTGAAAA TAAAGACGTA 84 0 

TTAATTATTG AAGATATCTT AGAGACTGGT ACTACACTTA AGTCAATTAC TGAATTATTA 900 

CAATCTAGAA AAGTTAATTC ATTAGAAATA GTTACTTTAT TAGATAAACC AAACCGTCGT 960 

AAAGCGGACA TTGAAGCTAA GTATGTAGGT AAAAAAATAC CAGATGaATT TGTTGTTGGt 102 0 

TACGGTTTAG ATTATCGTGA ATT AT AC CGA AACTTACCAT ATATCGGTAC GTTAAAACCT 1080 

GAAGTGTATT CAAATTAATT TTTTAATCAA TTTCAGTTAT TATTACTATG CGTTTGAGAA 114 0 

ATAATAGTGT AGACTCAAAA ATATGAAAAA TGTATTTCAT ATATATTTAA TTTTAGACAA 1200 

GACATATGTC TTGAAAAGTT GAAAAATATA GAGATTGATA AAACTAATAC GGGTGTGAAT 1260 

GACATTGATG TTAAGCTCAA TTACTAGCTT ATAAAACATG TCATATGTTA CAATTTTTGT 132 0 

TAGTTTTATT ATGGGAAGTA GGAGGAAATG ACGCATG CAG AAAGCTTTTC GCAATGTGCT 13 80 

45 AGTTATCGTA ATAATAGGCG TTATTATTTT TGGTCTATTT TCATATTTAA ACGGTAATGG 1440 

AAATATGCCG AAACAGCTTA CATATAATCA ATTTACTGAG AAGTTGGAAA AAGGTG AC CT 1500 

TAAAACTTTA GAAATCCAAC CACAACAAAA TGTCTATATG GTAAGTGGTA AAACGAAAAA 1560 

50 TGATGAAGAC TATTCATCAA CTATTTTATA TAACAACGAA AAAGAATTAC AAAAAATTAC 1620 

TGATGCTGCT AAAAAGCAAA ACGGTGTAAA ATTAACGATT AAAGAAGAAG AAAAACAAAG 1680 
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J t — 1 " 1 """t W T'"<'"V" M I ' 7A 
1 <i 1 U 1 X V^- V, X /\ 


Atj U\AG C A C 


AAGGTGG CGG 


TAGTGGCGGT 


CGTATGATGA 


ACTTTGGTAA 


1800 






MAMA lul AL-ty 


A T* IV A T A A HfTV ft 

ATAATAATAA 


ACGTCGTGTT 


CGTTTCTCTG 


ATGTAGCAGG 


1860 


5 


GGCAGATGAJX 




A. HTTIl Rfiwryi ft 

AA 1 1 AA I i\3 A 


AATTGTTGAT 


TTCTTGAAAG 


ATAATAAAAA 


1920 






M X vj^jtVj A ILIA 


VivjAi i LLTAA 


AGGTGTCTTA 


CTTGTTGGAC 


CTCCAGGTAC 


1980 


10 


TGfiTAAAArA 


X InL 1 1 vjL. 1 A 


/™* a./" , /"v?/' , *t , *tv"'*/'*» 


AGGTGAAGCT 


GGCGCACCAT 


TCTTCTCTAT 


2040 


TAGTGGTTCA 


GACTTTGTAG 


ft /"^ ft ^T^/™' ^^TT^^"***T* 

AtaAiVjl 1 1 Vj 1 


TGGTGTTGGT 


GCGAGCCGTG 


TTCGTGACTT 


2100 




ATTCGATAAT 


GCTAAGAAAA 


ALaj C_ v- CTTG 


TATCATCTTT 


iv rrt/vi Tt rrvi it 

ATCGATGAGA 


TTGATGCTGT 


2160 


IS 


TGGTCGTCAA 


CGTGGTGCAG 


(arbrTGGTGG 


CGGTCATGAT 


GAACGTGAAC 


AAACCCTAAA 


2220 




CCAATTATTA GTTGAAATGG 


ATGGTTTCGG 


TGAAAATGAA 


GGTATCATTA 


TGATAGCTGC 


2280 




TACAAACCGT 


CCTGATATCC 


TTGACCGAGC 


CTTATTACGT 


CCAGGTCGTT 


TTGATAGACA 


2340 


20 


AATTCAAGTT 


GGTCGTCCAG 


TV #TV**TV* kit k 

ATGTGAAAGG 


CCGTGAAGCA 


ATTCTTCATG 


TTCATGCTAA 


2400 




AAACAAACCA 


CTTGATGAAA 


CGGTTGATTT 


AAAAGCAATT 


TCACAACGTA 


CACCTGGTTT 


2460 




CTCAGGTGCT 


GATTTAGAGA 


ACTTATTAAA 


TGAAGCATCT 


TTAATTGCTG 


TACGTGAAGG 


2520 


25 


TAAAAAGAAA 


ATTGACATGA 


ft »-» m n TV m ^/-i Tt 

GAGATATCGA 


AGAGGGAACG 


GATAGAGTTA 


TAGCCGGACC 


2580 




TGCTAAGAAA 


TCTCGAGTTA 


TTTCTAAGAA 


A ^ iv A M^m« 7s rn 

AGAACGTAAT 


ATTGTTGCTC 


ATCACGAAGC 


2640 




TGGTCATACA 


ATTATCGGTA 


TGGTACTT^GA 


r«>^ it ^ ^ ^ it ^1 ■« it 

TGAGGCAGAA 


GTAGTGCATA 


AAGTTACTAT 


2700 


30 


TCjTTCCZACGT 


GGACAAGCAG 


GTGGTTATGC 


AATGATGCTA 


CCTAAACAAG 


ATCGTTTCTT 


2760 




AATGACTGAA 


CAAGAGTTAT 


TAGATAAAAT 


CTGTGGTTTA 


CTTGGTGGAC 


GTGTATCAGA 


2820 


35 


AGATATTAAC 


TTTAACGAAG 


TATCAACAGG 


m/^ < miinw IV IV IV Ti 

TG L. 1 J/CAAAT 


^% Tt /wwn/^M Tk ft 

GACTTCGAAC 


GTGCAACACA 


2880 


AATCGCACGC 


TCAATGGTTA 


CXi caatatgg 


rii TV 1 1 V i fk s m ivt\ it TV 

TATGAGTAAA 


iv iv tv ^n^mv j^^.il s** 

AAATTAGGAC 


CATTACAGTT 


2940 




CGGTCATAGC 


AATGGTCAAG " 


TATTCTTAGG 


TAAAGATATG 


CAAGGTGAGC 


CTAATTATTC 


3000 


40 


AAGCCAAATC 


GCATATGAAA 


TTGATAAAGA 


AGTTCAACGA 


ATCGTTAAAG 


AACAATACGA 


3060 




ACGTTGTAAA 


CAAATTTTAT 


TAGAGCACAA 


A^A TV f~+ TV Tt T* 

AGAACAATTA 


ATTTTAATTG 


CTGAAACATT 


3120 




ATTAACAGAA 


GAAACATTAG 


TTGCTGAACA 


3V TV T"T , /^TV fV T^/^ TV 

AATTCAATCA 


TTATTCTACG 


AAGGTAAATT 


3180 


45 


ACCTGAAATT 


GATTATGATG 


CAGCTAAAGT 


TGTTAAAGAT 


GAAGATTCTG 


AATTTAATGA 


3240 




TGGTAAATTC 


GGTAAATCTT 


ATGAAGAGAT 


TCGTAAAGAG 


CAATTAGAAG 


ATGGACAACG 


3300 




TGACGAAAGT 


GAAGATCGTA 


AAGAAGAAAA 


AGATATTGCT 


GAGGATAAAA 


AAGAAGCTGA 


3360 


50 


TAAATCTGAT 


GAAAAAGATG 


AACCAGCACA 


TCGACAAGCC 


CCAAATATCG 


AAAAACCTTA 


3420 




CGATCCAAAT 


CACCCAGACA 


ATAAATAATC 


GATTATATTC 


AGTACCTCTT 


TCTATGATAA 


3480 
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AATTGTTATA GCAGAAAATA ATTGTAAAAC AAGTTACTTC ATTATTTAGA ATGATGGGTG 36 00 

TAGAATAAGT ACAATTGTTG CATTTTATGA AGTAAAGTAA TTTTTTAAAT ATAGAGTAAT 3660 

AGAGGAGATT GAAATAATGA CACACGATTA TATTGTTAAA GCATTAGCAT TTGATGGAGA 3 720 

GATTAGGGCT TATGCTGCTT TGACAACTGA AACTGTTCAA GAAGCACAAA CGAGACATTA 37 80 

TACATGGCCG ACAG CATCTG CTGCAATGGG AAGAACAATG caCAGCAACA GCTATGATGG 3 84 0 

GCGCAATGTT GAAAGGTGAT CAAAAATTAA CTGTCACTGT AGATGGCCAA GGACCTATTG 3 900 

GACGAATTAT TGCCGATGCA AATGCTAAAG GCGAGGTGCG TGCTTATGTA GACCATCCAC 3 960 

AAACTCATTT TCCATTAAAT GAGCAAGGTA AACTTGATGT AAGACGAGCG GTAGGGACAA 4020 

ATGGATCTAT TATGGTTGTT AAAGACGTTG GAATGAAAGA CTATTTCtCT GGAGCAAGTC 4 080 

CaATTGTTTC AGGAGAACTT GGTGAAGATT TTACTTATTA TTATGCTACA AGTGAACAAA 414 0 

CACCTTCATC GGTAGGTCTT GGTGTATTGG TAAATCCTGA TAATACGATT AAAG CAGCAG 4 200 

GAGGATTTAT CATTCAAGTT ATG CCAGGTG CCAAAGATGA AACAATTTCA AAATTAGAAA 4 26 0 

AAGCAATTAG TGAAATGACA CCAGTTTCTA AATTAATTGA ACAAGGATTA ACGCCAGAAG 432 0 

25 GATTACTAAA CGAAATCTTA GGTGAAGACC ATGTGCAAAT TTTAGAGAAA ATGCCTGTTC 43 80 

AATTTGAATG TAATTGTAGT CATGAGAAAT TTTTAAATGC TATTAAAGGA TTGGGCGAGG 444 0 

CTGAGATTCA AAATATGATT AAAGAAGATC ATGGTGCTGA AGCAG TATGT CATTT CTGTG 4 500 

GAAATAAATA TAAATATACT GAAGAAGAAT TAAACGTGTT GCTAGAAAGT TTAGCGTAAT 4 560 

TTAATTTAAA TCAATACGCT AAAATGTTTA TTTTTAGCGG TTTAGTGAAA TGTAGAACTA 4620 

AATAGTTGTA TAATCCTTAG TGATTTTGTT TGCTTTCTAG AATTTATTTG ATAAAATAAT 4680 

TCTATATCCG ATAAATAAAC TAAGATTTCA ACAACTAACT AAAAAGGAGT GTTCTTAATG 4740 

GCAGAAAAAC CAGTAGATAA TATTACTCAA ATTATTGGCG GTACACCGGT AGTCAAATTG 4800 

AGAAATGTAG TAGATGACAA TGCAGCAGAT GTTTATGTAA AATTGGAATA TCAAAATCCA 4 860 

GGTGGTTCTG TAAAGGATAG AATTGCTTTA GCAATGATTG AAAAAGCAGA GCGAGAAGGC 4 920 

AAAATTAAAC CTGG CGATAC AATTGTAGAA CCAACAAGTG GTAATACAGG TATCGGTTTA 4 980 

4S GCATTTGTAT GTGCTGCTAA AGGATATAAA GCAGTATTTA CTATGCCCGA AACAATGAGC 5040 

CAAGAGCGTC GTAATTTATT AAAAGCATAC GGTGCGGAAT TAGTTTTAAC GCCTGGATCA 5100 

GAAGCGATGA AAGGTGCAAT TAAAAAAGCT AAAGAATTGA AAGAAGAACA TGGTTACTTC 5160 

50 GAG C CACAAC AATTTGAAAA CCCTGCGAAC CCTGAAGTTC ATGAGTTAAC TACAGGTCCT 5220 

GAGTTATTAC AACAATTTGA AGGGAAAACT ATCGATGCGT TCCTAGCTGG TGTTGGTACT 5280 
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GTTGCTATAG AGCCTGAGOC TTCTCCAGTA TTG AG CGGTG GTGAGCCAGG TCCACATAAA 5400 

TTACAAGGTT TAGGTGCTGG ATTTATTCCA GGCACTTTGA ATACAGAAAT CTATGACAGT 5460 

ATTATTAAAG TAGGAAATGA TACAGCGATG GAAATGTCTC GTCGAGTTGC TAAAGAGGAA 5520 

GGTATTTTAG CAGGTATTTC ATCAGGTGCT GCGATTTATG CTGCCATTCA AAAAGCAAAA 5580 

GAATTAGGAA AAGGTAAAAC AGTAGTAACA GTATTGCCGA GTAATGGTGA ACGCTACTTA 5640 

TCAACACCTT TATATTCATT CGATGACTAA TTAATGTCAT TTAAAAGAGT GAGTTATCTT 5700 

TTTGAGATAA CTTGCTCTTT TTTTCTACCA TGTATATTTT TAAAAATATG AGCGTTAAAT 5760 

TAAACATTTT TCTGATAAAA ATATCCAGTG AATGATAAGA TAATAAACGT ACATACTAAT 5820 

AACTAGTAAA TAGCAGGAGT AAATTTTATT AGAGTTAAAC AATACATAAT TAAAGGGTGG 58 80 

TTAACATGAC TAAAACAAAA ATTATGGGcA TATTAAACGT CACACCTGAT TcATTCTcAG 5940 

ATGGTGGAAA ATTTAATAAT GTTGAATCAG CTATAAATAG aGTGAAAGCC ATGATAGATG 6000 

AAGGTGCTGA CATTATAGAT GTTGGAGGTG TTTCAACGAG ACCCGGTCAT GAAATGGTTT 60 60 

CATTAGAAGA TGAGATGAAC AGAGTATTAC CTGTTGTTGA AGCTATTGTC GGTTT 6115 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3S 



40 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
TAGATACTGG GnTAAAcaTc AAAAATAtyT GCtTaTTCaC GTGTTTAcGc TCCCtCAAAC 60 

GCAACGTTAA TTGCGTGTAA TCATTTAGTG TGAATTcAGA CG C TTCTTCC- ATGACTATGT 120" 

CTGATATGCC TTTTATCGAC TTTATTTTCT CTGGGTTATC TAATCCTTTA AACAAAAAAA 180 

CTGCGCCGTT TGGCAATTCA ACTTTGTTAT CAGTCTTATT CCAAAGGCAC ATGTCCCAAA 240 

TACCAAAGTT TATCAAACAA TCTTTAACAT CTTCGAACAA ACTATCTTTA ATTGTTGATT 300 

GTACTTTTCT AAGCCACAGT ATACGCCTAG GATATTTCCA ATCTTGCAAT GCTTTGAGTA 360 

CAACTTTTTG TATAACGCCG TGAGACTTAC CGCTCGAACC TCCACCGTAA TGkACTTCAG 420 

TGAAGTtATC GTAATTGGTT AGTATTTCGA ATATGTTTCT ATTGAAAACA TTAGACGGTT 4 80 

TGTTAAAGTT TAATTTAACT TTCGTCATCG TACTCACCAA TATTAATCTC AATATTCTTC 540 

TGAGTAATTT CTTTTTTATC GATATACGCA CCATGTACTT TTAGTATGTG GTCAATAGAT 600 
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TTTAAATGGT CATATTTCTT ACTGTAAGCC TCTTGAGGTT CTCCTCTAGC AATAGAAGCA 720 

GATAACGCTA AAGCTTCTGT AATACTCATT AAACGCTCTT CTTGTATCTG TTCTAATCGT 730 

TCTTTAATAT ATTCCGAAAC ATTAACATTT CTTAACAATC GACTTGCTAA AGACTCTGCT 84 0 

GTTTTCTTAC TATAACCTGC TGTAATTGCT GCTTTTTTAC CATTACATCC ATTCATTATA 900 

TATTCATCTG CGAATCTCTT TTGTTTTTCG TTCATTTCAT TTACCACCAA CTCTCGCGCT 96 0 

ATACGCTTTT TAAAATTAAA AAAGGATTGG CTATAATCAG CCAACCCACA TAGATCCTTT 1020 

ATTCCTAATT GCGATAAGGG AAACGCAGTA CGATAGTCAA TATCCTACAC TATCATAATA 1080 

TCTCATTTAA GGTATCAAAA ACTGCCACTT TACTGCCAAT TTCAGTCTTC CCCTAACTCT 114 0 

TCCGCCAATC TAGATATGAT TTTTCTTTTG ATTCTATGAG CAGTTCTATC AGAAATGTGT 1200 

ATGTCAACAC AAACTTTCAC TAATTCCTTT TTATTAAAAT AATACTCTTG AATGAATTCG 1260 

CGTTCTTTCC TGCTTGATGT GTTGATTATA CGTTCAATAG CGCTCTTAAA CTCAAGGATT 1320 

TTACCTCTTC GTATACTACA AAGATAATTA GTTACTGCCA TTTCTGTTTT CGATGTATTA 1380 

GACGGTACAA ACTCCCCGCC TATATTTGTA TCTGTTGGAA TCCACGGTGT CATTATTTCA 144 0 

25 CTTCTTAAAT CTTCAAGTTG TTTATGATAA TTAGGATAAT CACACAACTC ATCTTCTAAC 1500 

TTTCGAACTG TTGATAATTT TAATCCGTAT TTCTTTTTAG TCATGAATAC CCTCCGTACA 1560 

AATATGTTTA ATCTTCAAAG TGTCTCAATC TACTTCTTAA TATCTCTATC TCTCG CTCTT 1620 

TAACTTTTAC ATCACCTTTT AACTGTTCCG CTTGTAACAT CACACCAAAC AATAAGATGA 1680 

CTAGTAATAT AATTGCTATG ATTAACCACA TCATCTACTC CGACACCTCC GCCCTCATCA 174 0 

AATCAGACTG ATCACTCAAC TTTGCGAAGT CACTTGG CG C CTCTACATCA TCATTAGCCG 1800 

TCATCATAAT ATATACTTGC TCAGTTACAT ACTTACCTAA CTCATACATC GCTAGTAAGA 1860 

ATAATAGTCT CAAAATTTCT TTAACCACCA CTAAACACCC CATGTTAATT TATCGATAAT 1920 

TTGTATAGCT TGTTTTAATG CGTCTCTTTT TTCTTTGATA TCTCTATTAT CGCCATCTTC 1980 

ATCAGCTGAC ATTAACTCAC TGTCATATTC ATATAATAGT TCTGATATTT CATTACTAGC 204 0 

TACTACTAAT AAGTTTT CAT CTACATCAAT CGTTACCGTT TTCTTTGGCA TCTCCATCTC 2100 

45 TCCTTATCTT AACTTGTGCC TCGTATTTGC GCTCAGCTTC TTCTTTACTC TCTGCCTCAA 2160 

CAACTGTAAA CGTCTGATTA TCTCTAGCAG TAGTAAAATG TTCATGTGGT TGTCCTGTTG 2220 

AATCTTTGAA TGTTGTGACT AAGTATTGCG TCACTTCTTA TCACTCCTTT GAATGATTCT 2280 

SO AAGTTTTTCT ACGAATAAAA GTATTAGTAC AACACTCAAT GTAGCCAACA TATTTTTTTG 2340 

CTTTGCAAAA TCTACTATAA CGATTAAGAC TAATAACATT CCAATTCTGC ATGTAAATAA 2400 
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TACAAGTATT 


GGAACTAATG 


TAATGATGTA 


ACTCACTTCC 


CCAAAACCTC 


CTTGACTCGA 


2520 




TCTAAGATGT 


CTTTACACTC 


CGCTACTTCC 


GAAGCCTTTT 


TCTCCACGTT 


CTGAAACACT 


2580 


5 


TTCGAATTCC 


TCCACTTGCT 


TTAGTTCAGG 


TGTCCATATA 


GGCACGATAA 


CCAATTGAGC 


2640 




TAG TTTGTCT 


CCTTCGTTGA 


TTTGATAAGT 


TCCGTATTGT 


CTTATGGCGT 


CACTCAAATC 


2700 




GATTTCTCCT 


TTAATATCAA 


AAACACCTGG 


TGTGATATAA 


CCATTCGATG 


CAATAGCGTC 


2760 


10 


ATTCTTGATA 


TTAATCCCTA 


AATTGCCGTG 


ATATCCCG CG 


TCTATCTTGC 


CTGTTTCAAT 


2820 




CACTAAATGC 


GTTTTACTAC 


TTACACCACT 


ACGGCTAGTT 


AATAGTCCGA 


CATAGCCCTC 


2880 


15 


TGGTATGCTT 


ACAGCTACAT 


CTGTTTTAAT 


CACTGCCTTT 


TCTTGTGGCT 


CAAGTACGAC 


2940 


AGTTTCAGCT 


GAGAATATGT 


CATAACCTGC 


ATCCGTCTTA 


TGATTTCGTT 


CGGGCATTCT 


3000 




AGCATTTTCT 


GATAATAGCC 


TTACTTGTAA 


TGTGTTAGTC 


ATTTTCCTGC 


TCCTCCCTAG 


3060 


20 


CTGTAGCAAA 


CGCTATTCTC 


AATTTCAATC 


TTTCAACAAT 


ATGAATTAGT 


GCGGTATTGA 


3120 




GGAATATTTC 


AAATTCTTCA 


ATGTTCTCAT 


CTATAAAATC 


AAGTATTTCT 


TCCTCTTGTT 


3180 




CACTGTCAAA 


CTCGCTTAGT 


ACATCCCAAA 


TATTTATGTC 


GCTTTTGCTC 


GTTTCTAATA 


3240 


25 


CTCTTTTGAT 


TATTTCTGAA 


TTACTTTTAT 


TACTCATTTT 


CCTTGTTCCT 


C CTCATATTT 


3300 




ATAGACAACT 


TGACCTGCCA 


TAATCCCTAC 


TGCTTCATCA 


AGTTCAATAC 


CTTCTTTAAC 


3360 




TGAATGTTGA 


AT AG CATTTG 


TCATTCCCTC 


AAGTATTTCA 


TCAAACGCTT 


GTGCTCTCTT 


3420 


30 


ATACACGTCC 


TCAATCTCTT 


TTAGTAATCC 


CTCTGTGTCA 


TTAC CGTTAT 


ACGCACTAGC 


3480 




ACTGATCACT 


GATTGTTCAA 


TTTGTTCGCG 


GTT ATT CATC 


ATTTCCATCT 


CCTCTAAAAT 


3540 




AAAGTTAGTT 


GCTTCTGCTC 


CTCGTATTCC 


AAACCATGTT 


GCTTTATATA 


TGTTTCGAGC 


3600 


35 


TCTTCCGCTG 


TATCAAATGT 


CTTTTTCACG 


CCTTGCCAAC 


CTGGCACGAT 


ATGCCCATGa 


3660 




AAGTAATAAG 


TGCCGTTCAC 


TACATGGATA 


TGTGCCACTC 


GTTCGTTATC 


CTGATACAGA 


3720 


40 


TATCTCTTAG 


ATCCGAAAAA 


TTGGTTTAAG 


TATTCTTTAC 


ATGCGCTATC 


GGTTTTAGGC 


3780 


ATTTATGCTT 


CCTGCCATTT 


CTTAAACATT 


TGGTTATAAG 


TAGTATCAAA 


CCAGTACGGA 


3840 




TCACGTGAAT 


GTTTTTGAGG 


CACATTAAAC 


AAATGTGGCT 


TCTTCTTACG 


TAGTTCAGCC 


3900 


45 


TCTTTACGTC 


GTTGCCTAGC 


CATTTCACGC 


TCTTTGCTCT 


CTCGCTCCAT 


GATTTTGGAT 


3960 






CTTTATACTC 


AGCTAAGCG C 


ATACCATAAG 


GTGCATGTAA 


GGCTTCTAAC 


4020 




AACGCCCAGC 


CACCTCGTAC 


TCTTTTTGCA 


ACCATTCCTG 


GAGTTAAACC 


GTTCTTTTTT 


4080 


50 


ATCAATTCAT 


TTTCATGTTC 


GGTAAATTTA 


TATGGTTTAC 


CGTTAATCTT 


TACGATACTC 


4140 




ATTTATTCCA 


CCTCTATACA 


TTTACTTTTT 


TTAATCCAAT 


CCTCTAATTT 


GTGCGTGTTG 


4200 
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ACATTTAAGT TAACCATCTC AGCTTTTCCG TTTTTATATC CACTAATAGT TGATCTTGAT 4 320 

ACGCCAGTTT CATTGTGCAA ATCTTGGACA CTTACGTTAT CTCTAGCCAT GATTACCCTT 4 380 

AAATTAGTTG CGAATACTC C GTTCAACTTC ATTTATTCCA CCTCTATATA TGCATGTCTT 444 0 

ATTGTTATGT TGTCATACTT TAGTAATTCG TCCGGATTGT CATCTAAGCG CTTTGCCAGC 4 500 

GTATCTTTTT CTTTATCCAC ATCATCGTAA TGCTGATATT CAACTTCTGT AGGTATTCTT 4 560 

ATATCAATCG TTGCGTTTAT ATATGCTTGT TGTTGCATTA GATCACTTCA TTTCTCTTTT 4 620 

TCTTTTACGT CTGACTTTCA CTAAGTCCTC ATATACCATC CATTCTTGAC CTGTGTATTT 4 6 BO 

AGGCG CTTTA CATATCCACG TTAAATTCAC ATCTCTATAC TGATATCTGA ATATCTTCGC 4740 

TTTGATGTTG GCAACTTCAG TCGCCTTACC TTTAACGTCT ATAACTTCAA CCAGTTTCCC 4 800 

TTCCTTCCAC AAAGAGAAAT CGGCTATATA CGTAATCGGT CTTTGTTTCC CGAATTTAGG 4 860 

TTGTAATTCA AATTTCGGTT GTATTTCGAT ACGATCATAG TTAGTGCCAT TCATATTACT 4 920 

TTCTAAATAT TGGTAATATT CGCACTCTAC TTTGCTATCA AATACAATTC CTTTGTACTC 4 980 

AACTTTCTTA GCATTGTATT TACTCATTGT GCCACCTCTA AATATCAAAT ATCGTTGCTT 5040 

GCAATCCTAG CTCTTGCTCA TATAGAAGCC CGTGAGCGCC TTTGAATCGT TTTAGGTCAC 5100 

TATCAGTCAT AATTTTCTTT TCGTCGCTGA AATGGGCTCC TGTGAGCGAA TAAACTTCAT 5160 

TTACGTTGTC TTTATACTTG ATGACCTTAA TATCTTCCGT GCCATCTTCT CGGTATAAGT 5220 

AATATTTTTC TTTCGGCATT TTTTAACACT CCTTAATGTG TGTTTTCTTC GAGTTGATTT 52 80 

CATTCATGAT TTTCTTTTCA ACTCTGTCGT AATCATCGAA AGGCGATAAC TCGTTATTGT 5340 

CCAACAATCT ATTGACCGCC CAACCAGTCT CGATATATAC ATTTGCTACA ATCGGGTCGC 5400 

TTTGCTTTGT CTCTTCATAC ATCGATTTCA ATAAGCTTTT GAATTGCATT ATATTCATGT 5460 

GAAAAACCTC TGAGTCTTCT TGTAATACTC AAATTCAATT ATTCCGGTTT CGCCGTCTTT 5520 

GTTTTTGGCT ATGTTACATT CAACAATAGA TTTGCCAGTG AT ACTGT CAT CTTCGTCACG 5580 

GTTATAATAA TCATCACGGT AAAGTAGCAT CGCTAAACTC GCATCTGCTT CTATTCCGCC 5640 

TGATTCTTTC ATGTCCGATA GCATTGGTCT TTTATCCTGT CTAGACTCGA CACCACGATT 5700 

CAGTTGTGAA AGTAGTACGA TGATTGCGCC TGTCTCGTTA GCGATTATCT TTAAGTCACG 5760 

TGATATCTTT TCTACTGCTA CACGTCTATC AACTTTCGCA TCAGTATCCA TCAGTTGAAG 5820 

ATAATCTATA AAAATAACTT GTTGCCTGTC TGAATGCCTC ATTG t TGCGC TCGCACATCT 5880 

TGCGGTGTGA TATTACTTTT ATCAGAAATA TCGATGCCTA ATTTCATGAT TTTATCCATC 5940 

GCATTCGTTA ACTTTGTTAA GTCATCCGGC GTTAAGTTCC TGATTTCTTT TATCTTTGTT 6000 
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AAGATGTTTT 


GTATCCATTT 


TGTGCTATGT 


TCAGCATCAT 


GTTTAATGCA 


6120 




A TV JV ^i^ w ¥V*"I , O r P 


TACCCACTGA 


GGGACGCGCT 


GCGATGACGA 


TTAATTGTGA 


TGGTTCTAAT 


6180 


5 


^ O i ""'p JA T*1T* 


TGTAATCCAT 


TAG CTTGTAA 


CCCGTCTTAA 


TTTGCTTCTT 


AGGGCTATCG 


6240 






CTTCGACAAA 


CTCCTCAACA 


AACTTCTTGG 


TTCCATCTTC 


TTTTTTGTTA 


6300 


10 


/JTh 7\ • 1 " I W 1 ** I 
\a X£\KX XXa L 1 J. 


TTAAATCCTT 


GAGTTCATCA 


ATCAAGTTGT 


TAAAGTTTTG 


GTTCGTAGGT 


6360 


V ^ r V TSr 1 "!; J V ^ i i J\ 

I w 1 i\j i l IviA. 


ACTCAGTTAC 


CAATTCGTTA 


GCTTTGTTGA 


GCTGATAACT 


TTCCAATAAT 


6420 




1^1 1 w 1 I v?A 1 


AACGTTCAAA 


GAAGCCATAT 


CCAATGAAAT 


CGGAGTTGTA 


AAGTTTAGTT 


6480 


IS 




CATCTAAAAA 


TTCTTTATCT 


TTAGTTGCTT 


TTAAATAGAT 


TTCTTGATGA 


6540 




TCTATCTTTC 


CGACGTCCAT 


TACATAATTG 


AAAAAGGTTT 


TAAACTTTTC 


GTTCGTAAAC 


6600 




ATGTAATC TT 


TAACTCTTAT 


CTTTTCTAAT 


ACGTCCGGTT 


GTTTAAGTAG 


CGTAGCGATT 


6660 


20 


ATTGTACTTT 


CAATTTCGAA 


TTGT CCGTAA 


TTCATTCGTT 


TTCGCCCCCA 


AATTCTGCCA 


6720 




ACTTATTCAT 


GAACTTATCT 


AGCGCTATTT 


TTCTTTGTCT 


GACATATTCG 


GGGTCATTCT 


6780 




GCATTTTCCA 


TTGGTGTGTA 


GCGGTTTCGT 


TATCTACTGG 


CTCGATAGAT 


ACTTTTTTAG 


6840 


25 


GTTCCTTACG 


CATGATTGCT 


GGTAAGTTAG 


GCGGGTACGG 


GTTGTTACTG 


TTGATATAAA 


6900 




CATCTACCGC 


TTTTACAGTT 


GGTTGATAAT 


CTCCATTTTG 


ACTTAATACA 


TCAATCCACA 


6960 




TTTCTAACTT 


CGGTTTATCA 


AAATCAATGT 


TGTATACGTA 


CCTAACTTTT 


TTAATAATTT 


7020 


30 


CTAATGCTTG 


TGTTTTGCTC 


ATCGGCATTA 


GTCATCACTC 


AATTCTTTTT 


CCATTTGTGC 


7080 




AA 1 C5ACATCA 


TCAGTAGTAT 


TTTTTCTAGG 


TGCTATTTTA 


TTTTCTGCAT 


CTTCTTTTGT 


7140 


35 


1 rTGACAi 1 C 


TCTTT AG CCC 


AGTTGTTTAA 


AACTTTAATT 


AAATAGCCAC 


CATGCGCACT 


7200 


1 1 1~1~1 A 


GTGTACTCAA 


CACCTACTTT 


TACAACTTCA 


AAAGCGTTTG 


TACCTATATC 


7260 




AIC/yVTAGCA 


AAC CCTAATT 


GTTGGATTTG 


ATTAGGTGTT- 


AACTTATGAT 


CCAAATTTGC 


- 7320 


40 


AAA iAlAlAT 


TTTATTGAAG 


ATGAGAAGAC 


GGCTTCTCTT 


TCTTCTTCTT 


TATTCTTATA 


7380 




TTCTTCTTCT 


TTTTCTTCTT 


CTCTTTCTTC 


TTCTTCTTCT 


GTATCGTTAC 


GTAACGTTAC 


7440 




ova lAAtAj 11 A 


CGTTTTGCTT 


CTAGTAACTT 


TTTCTGTTTC 


TCACGATAGC 


GTTGTTGTCG 


7500 


45 


CAATTTATTT 


TTTTCTTTAT 


GCTTAGCTTT 


GCTATCTAAG 


CT1TGATGCT 


TCTCCCAGTT 


7560 




TGTCACTTTT 


ATGACACCAT 


taactttttc 


AATCATGCCC 


AATGTCTCAA 


AAGTTTGAAT 


7620 




TGCTAACCTT 


ATTGAGTTAA 


TAGGTCTATT 


AAATTCATTT 


GCTAACATTT 


CTTCGTTGTA 


7680 


SO 


CGGCAAGTTT 


TCGGATAGCA 


TAATATAACC 


TTGTTCATTG 


TACTTTCCTG 


ATAAAGTTAG 


7740 




TAACTTAACC 


CAAATAGTTA 


TGATCGTATC 


TCTTTCGGGT 


AAAGCTTCGA 


TATATTTGAT 


7800 
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CTCCTTTCAG CATTTTGTTG AGCCTCTCAT CAACTTTTAT CCACGAGTCA TGCAAGTGAT 7920 

ATTTATCATC AAACGACTTA ACGCCAATTG CGTGCTGTTC ATTATGATGT TGTCTACACA 7 980 

GTGCTAACAC ATGTTTGTCG TAGTGATTCA TTTTGTTTCT GTTCATGCCT CTGCCGACTG 804 0 

CTTCATAATG TGCCAGGTCT GCGTGAGGCT TTCCGCATAT TACACAGTTG CGGTTGATTG 8100 

TAGCCCAATA TAATAACGCT TTATCTTCGC TTAACAACTT ACTCGTTTCT ACACTCATAG 8160 

GTATTTGATG ATGAAACATA AACGCTATAA TCAGTTCTAT TAACTCCCTT GCAACTTTCA 8220 

TAGAACAGTC GCGCAGACTG ATTTCTTCAT AACCTTTCAT AATTTCCAAT TCTGTTTGTA 8280 

ATAATTTTCT AGTTGATTCT ACTGGTTCGC CCCAGTGAAG TTCTATATCT CTACACATTG 834 0 

CGAATATTTT TTTGCGTTGT TCTATAGATA GTTTTTTATT GTCCGGAACC TCTACTTCTG 84 00 

CTTTTAGTGG ATATCCGTTT TCTAGTAAGT CAATGTGACT TTGTTCAAGT TCAACAC CAG 84 6 0 

TAGCAACGAC GGAATAAGTA CCGTCATTGT CTTTCTGGTA TCTTGTAATG TATTGCATTT 8520 

AAACCACGTC CTAGAACGGT AAATCATCAT CATTGATTTC TATTGGACCA TT AG CATT AG 85 8 0 

CGAATGGGTT TGATTGTTGA CTCATTGGCG TCTGTTTCCC ATTTGCTTGC TGTTCTTTTT 864 0 

25 GTTTCATCTC ATCAGTTTTA GGTTCTGGTT TATTAACTAC TTCATCGTCT TTATTCCAAA 8700 

CTTTTACATA TGAGAGT CTT ACAAAATACT TGCCTTGTTC CTCGTTAAAT TTATTTTTAA 8 76 0 

GTACAATAGT TCCGATTTTG TTAATTAATT GATCTGTGTC AAAAGTTAAA TCTGGTAAGT 8820 

TCAATTTAAT TCCTAATCTA CTAAGTAACT CGATATATTG TTTTTCTTGA TAATCTTGTT 88 80 

GGAATGGTGG GACGAATTGG TTGTGTTTGT ATTGTTTACC TTCGTTGTTT TCAAAAACAA 8 940 

TCGTGAAGTA TCTGTTTTCT CTGTCGTTAA ACTCGACATT TGCAACTTTT ACTGTAAATT 9000 

CTCCAGCTCC TAAAAAGTCC CCACCTTTCA TGAATGCCTC TTGATTAGTT TCTTGAATGT .9060 

ATTGTGTTCT ACCAGTGATT TTCATAATTT TTATACCGTC CTTTTAATTA ATTTTTAATT 9120 

ACCATTTCTA ATTGCTTGTA CAACATCGTT AATACTTGGA TTAATGAAAC GTTTGTTGTT 9180 

AATTTTGATG TTGCTTGAGT GTCTTATCTT TGTCTCGAAT AAATTTGATG GTTCAGCGTT 924 0 

AAGTACATAT TGATAAGTTT TTTCGCCGTC TTGCTCATGT TCTTCTATTG TCATTCTTGC 93 00 

45 TAACACGTCA GATTGACTGA TGACTGCTTT TTTTATTTGG TCTTGTGCCT CTATCGTGAT 9360 

TGTTGGATTG ATAGTACTTC CCTCATCATC TTTGTCTTTG TTAATGCCCT CGTGTCCGCT 94 2 0 

TATAGCAAGA TGAAATTGAT AATGTTCTTG TAATTTAGAA ATATAACGAT AAATACTTAC 94 80 

50 AATGCGTGTA GCACACTCGC C CCAAT CATT AAATGTCGGT TTCTTTGATT TACCGTCCAT 954 0 

GATGTCGTCC ATAGTGATAT CACGTAACTT TTGGATTGTT TCAATCACTA CAACATCAAT 9600 
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AAAATGCTTA TAATTCTTAA TCTGCACAAC TGCCCCATCT TCTGTTACCG TTGTTCCGTC 
CTCATTTATA TCTAGTACTA AGGCATTGTT ATCTTTTGTT AAAAACGTAG TTTTACCAGT 
ACCGAACTTG CCGTATATCG CAAATTTATA AAACTTGTTT GCATTTTGTT TGCTGATGTC 
TTTTACACCT AGTTGCGTTA AAATATCGAC ATCTTGATTA GTTTTTTCAG TCATCTATTC 
TCCCACCTTT ACCGTGTATG ACGTTGGTTT CTCCACAATG CTAGCACCCT CTAAAACTTC 
GCCGTTTGCG TCAATCAATG TGCCGTTTTC AGTTACATTG AAATCTTTCT TAATGTCTGA 
TTGGCTAAGT TTTTTAGTTA CTTTTACATA GTTGTCAAAA CCTCGTTGCT CAAGTTGTnT 
AATGACTTCT TGCTCATTGC TAACTTGAAT GACTTTTGAA CCTTTTCTGG CTGTCACTTT 
TCCGTAAGtG TATTCAACTT GAATTTGCTA TCTTGTTCTT TTTGTATTCT GTAATATTCA 
ATTACAAGGC TTTGTAAATA TTCTTTGCCA CTCTGTAATT TTTCTACTTC TTTATCTTTC 
CATTCGTTTA TGCGTTCAAT TTCTTTATTT GCTAAATCGT TGATTTCATT CTCTTTAGTT 
GTGATTGCAT CCAGTTTCTn AAAAACCCAG TTAGCACTGT CTAGATCAGT nACTTTGAAT 
CGGTCGTCTT GTTCGAATGT n 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
TTTCTCTCTA TTATTCTCGA TGCGTAGATA ATTGTTTAAA TTTAAGTTTA TAGTAATGTT 
GAGOTTATAA-TTTCATATAT CTAAAAACAG GTGTTGTATA^ TATAATCATT CATCTAGTTA - 
TACTTACTTT AAAAATAATA TAATTTCATG CGATGCAATT CATTGATGGA TGTTTTTAAT 
CTTAATCAAA TCCAaATAAA G CAT AT ATTT TTAAATTCAC TTTCTTTCGA ATCGATTTTT 
ATCTCTTGnA TTAAACTTTT CCATTGTTTC ATTAAAGCTC TCTGTCATAT CTATTCCCAT 
TGAATTCGCT AAACATAACA ACACAAATAA ATTATCACCT AATTCTGCTT TAATCGTATT 
TGCTTCCTCT GAATCTTTCT TCTTTTTTTC AC CAT AGGT A TGATTTATTT CACGTGCAAG 
TTCGCCCACT TCTTCAGTCA ATCTAGCTAA GTTAGCTAAT GGTGAAAAAT ATCCTGTTTT 
AAATTGTCCA ATATATTCAT CAACTTCACG TTGCATTTCT ACCATTGATT TCATTTCTAC 
GTTCTCCTTA TATTGCATTT CTAATATAGT ATATATCAAT TTGAAGTCTC ATGCATGTTT 
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AATTCAGTTT ATATAAATGT AATGCATTCC TAACTAAATT AAATCAATTG AAATTGGGAT 720 

TATAACTTTA TGATACGTAC CACTACAATA AAATAATATA GTGAATAATC TACCATTAGA 780 

AAAATAAGCA CAAAAAAACT AGCAACCACA CAAAAATGTG ATTAGCTAGT TAATAAGTGT 840 

CTAATTTAAG TTAATTGTTA ATCTATAAGA TTAATCACTT GAACGCGCAA TCAAAATAAT 900 

ACGTACAAGC TCTGCTACAG CGACTGCAGT TGCTGCAACA TAAGTCATTG CTGCTGCAGA 960 

TAATACTTTA CGCGCATGCT TGTATTCTTT TTCATTTACA ATGTTCAATG CCGTAATTTG 1020 

TTTCATCGCT CTTGAACTCG CATCAAACTC AACTGGTAAC GTAACAATTG AGAATAATAC 1080 

CGCTAATGAC ATTAAACCAG CACCAATCCA TAAAG CAGTT GAACCaAATG CACTACCTAT 1140 

CGCTGTTAAG ATAATACCTA ACATGATGAT CATATAACTT AATGAACTCC CTAGGTTTGC 1200 

AACAGGTACT AATGCTGCTC TGAATCTTAA GAACCAATAT CCTTGGTGAT CTTGAATGGC 1260 

2 0 ATGACCAACT TCGTGGG CTG CAATTGCAGT TCCAGCAACT GATGGTCTGT CATAGTTTGC 1320 

AGGAGATAGT GAAACAACTT TCTTTTTAGG ATCGTAATGA TCTGTTAAGA ATCCTTCACC 1380 

TTTAACAACT TCG ACAT CAT AAATACCGTT TGCATGTAAA ATTTCTAATG CAACTTCACG 1440 

2$ ACCCQTTTTA CCACTAGTTG ATCTAACTTG TGAATATTTC TCATAGTTAG ATTTAACTTT 1500 

GTGTTGTGCC CATAAAGGAA GCACCATTAA TATTACGAAA TAAATTATCA TAGTAAAAAT 1560 

TGAAGACAAT AAACTCACTC TCCTTTATAA ATATTTTACT GTCATTTGCC GTTTTTATCA 1620 

AATCATTTAC ACTTTAATAA TTTGTTTAAT TCAATATAAA GCAAAAGTCC AAAAACACTT 16 80 

AGACAACATG ATAATACACC AATTTGCCAC ACATGTGTAG TTATAAAATC ATAATATGGA 1740 

AATTGAAGGT GAAAATAGTC AATATAATCA TTCAAAAACA CCCAAATCAT yGCTACACTG 18 00 

ATTCCAATCA TAGAACGTTT AAACCTAGGA TAGAAGTAAA TTGCCTGAAC AGCCATTATA 1860 

CTGTGGGAAA ACATTAATAC CAAACCATTT ACTGTAATAT CACCTTGTTC AATAATAAAT 1920 

AATATATTCA TT AT AACTG C CCAAATCCCA TATTTGAATA ATGTTACAAA TGCCAGTGCA 1980 

TCGATAATAC TATTTTGTTT TTGAATTAAT ATCAATGAGA TAGAAATAAC TAAGTATAAT 204 0 

ATTGCAGTTG GGCTATCTGG AACAAAAATC TTAAAATGCC AGGGCGTATG ACTTAATTGT 2100 

45 TCACCATACC ATATATAACC ATAAATCATC CCTAATATAT TACAAATGAG TAGCATCATT 2160 

AACCAAGAAC GTTGATAAAG TGTATATTGC CAAAATGCTT TAATTGT CAT CTGCTAAGTC 222 0 

CTCAAATTGA TTATGTTTAT TT ACT AG CTT GAGTGTATTT AAAATTTGCG TTAGTTGATA 228 0 

50 AAAACGTTGC TTTTCATTCA TCTGTAAACT TAAATCAATA TTGTGTAACA AGTAATCTAT 234 0 

TAATAACGCA TGTTTATGCC GATCTATAGC CATACTATTT AAGTCATGAA GATAAGTTTG 2400 
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TGACACGTTT GCGAAGTGAA TTTGAATATC AAAAGCACAG TTATGATTAG CGATATAATC 2520 

AAATATTTCA TTTGTATTCA TTAACTTTAT ATTACGCTTA GTAAATTGAA TTGCAGAAGC 2580 

GTGACTTCCC ACTTCTGCAA TTTCTAATGT TTCATGATGA TTAATTTTTG TATCTACAAA 2640 

ATGAATGTTT GCCAATTTCG CCTCATTCAC TTTTATATAG TTAAGCACCC AAACTGCAAT 2 700 

ACGCGACTTA AATCGATATT GAAAAAGTAA ATATTCAATA AAACTTTCTT TAATTTGATT 2 760 

GAGTGTCTCT GACATCAAAT ACCCCATTTT AAGATTGCAA TCTTGaTAAT TCGTCATGCC 2 820 

AATTTTCGTT ACTTGGcTCT AGTTC CAACA ATTGATTTAA AATAGTAATT GCTTGTTCCT 2 880 

TTTGACCAAT TTCAATTAAA TAGAAATAAT AATCACTCAT AAAATCAATA TTTGTTTTCA 2 940 

TCGTTGGATA TGCTAATTCA AAGAAATGTT GAGCTTCTTT ATCTCGCTC 2 989 
(2) INFORMATION FOR SEQ ID NO: 151: 

20 <i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CATCAACTCC TTAATTACAC TGTAAATGAT ATGCGTCTTT TTGACAACTA TATTTGTCAA 60 

ATCTACACCA AAAAATATGA TTATCCACCT ATGTATGACA TTTTGAAACA AACACCTCAA 120 

CGCCTACAAG TCATAATTGT TTACTTTCGT TACACCTTCC TGCATAATTA ACAGCATTCT 18 0 

AATTTTAGTA TGATGCACGC ATTTTCACTA AATCAAACCA TTCAAAGGAG ACTATTATGG 24 0 

CATTTACATT ATCTGCAATT CAACAAGCAC ATCAACAATT TACTGGTGTT GACTTTCCAA 300 

-AACIATTCAA "AGCTTTTAAA GATATGGGGA: TGACTTACAA - TATCGTCAAC ~ATTCAAGATG~ 3 6 0 

GCACTGCAAC ATACGTACAT CAATCAGAAG ATGATATCGT TACGTCATCT GTAAAAAGTA 420 

ATCATCCTGT TGCTCAAAAA TCAAACAAAA CAATAGTTCA AGACGTCTTA ACTAGACATC 4 80 

AACAAGGGCA AACAGATTTT GAAACATTTT GTGATGAAAT GGCTGAAGCT GGCATTTATA 54 0 

45 AATGGCATAT CGATATTCmA GCGGGCACTT GTACTTATAT CGACTTGCAA GACCAAGCTG 600 

TTATTTCAGA ATTAATCCCT CAATAAACTA TATTTATAGC AACATTTTAA TTATTTCATA 66 0 

AAATTTTATT GATAATCATT ATCGTTCGGT ATAAAGTAAA TACTATATAC TACTTATGAG 72 0 

50 TGAGGTTGAT TAT CATGAT A ACTAACACTT TTATTTTAGG CATCACAGGC CCAACAAGTC 7 80 

TTGTCGTCAT TAGCATTATC GCTTTAATTA TTTTTGGTCC GAAAAAATTA CCACAATTTG 84 0 
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AGTCTCACGA TACACCCAGT AAGGAATCGA AACAACAGCG AGAGCAATAG CACTGACCAC 960 

ACCTTACTGG TTCACTTTAG CGAACTACGC CATCGGTTAG TAAAAATTTT ATTGTCGTTC 1020 

GTCATTACGG TCATCGTCGT ATATGTyTCA TCATTTTGGT GGATGACACC ATTCATAACG 1080 

T AT ATyACC C GgCACATGTG TcCTTACATG CATTTcATTC ACAGAAATGA TACAAATAAC 114 0 

GTG 1143 



W 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7953 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRAND EDNESS : double 

<D) TOPOLOGY: linear 

2o (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

CAACGCCTGA ACGTAAACCA TATCGTTTCG CGATTTCCTC ATCTTGACTA TTTACTAAAA 60 

ACTCTCTCAT GGCGATTAAT GTTTCTT TT T CTTCTTTAGT TAATGGTAAT TCTAACTCAG 120 

25 CTGCTTTTTG ACGCAAAGTT GGATGACCAT CTCTAATGAT GTCTTTCATT GTTAACATAT 180 

ATTGCACCTT CCTTATTTTA ATTTGTTTTA GTTGAATGAC AGTAAAAAGG TTGTTAAGAT 24 0 

ACTCATACAT TTTTATGTGT AAATATCTAC AAAGTTAACC AACTACTGCC AATGTTTATT 300 

2Q ____ 

TTAGATAGTA TATGTAAATT TTCAaGAtAT GCgTAATTGC gTTAAAAAAT GaTTAAAGTG 360 

TTGGTTTCAA GCAATGaTAC TTTAGAAATT TATTTATCAT CTTGACTTTA AAAATTATAT 420 

TATAAATGAC GTAACTGTCA ACAGATATAC TTAGTArTGA AGATGTGTAA TGTAATTGTT 4 80 

35 

TAAAATTGAT TTCCAAGCAG ATTTTATTTA TCATTTAATT TAAATAGCAA GTGGAGGTAC 540 

AAGTAATGAA ATTTGGAAAA ACAATCGCAG TAGTATTAGC ATCTAGTGTC TTGCTTGCAG 600 

GATGTACTAC GGATAAAAAA GAAATTAAGG CATATTTAAA GCAAGTGGAT AAAATTAAAG 660 

40 

ATGATGAAGA ACCAATTAAA ACTGTTGGTA AGAAAATTGC TGAATTAGAT GAGAAAAAGA 720 

AAAAATTAAC TGAAGATGTC AATAGTAAAG ATACAGCAGT TCGCGGTAAA GCAGTAAAGG 780 

4S ATTTAATTAA AAATGCCGAT GATCGTCTAA AGGAATTTGA AAAAGAAGAA GACGCAATTA 840 

AGAAGTCTGA ACAAGACTTT AAGAAAGCAA AAAGTCACGT TGATAACATT GATAATGATG 900 

TTAAACGTAA AGAAGTAAAA CAATTAGATG ATGTATTAAA AGAAAAATAT AAGTTACACA 960 

50 GTGATTACGC GAAAGCATaT AAAAAGGCTG TAAACTCAGA GAAAACATTA TTTAAATATT 1020 

TAAATCAAAA TGACGCGACA CAACAAGGTG TTAACGAAAA ATCAwAAGCA ATAGAACAGA 1080 

55 



3NSDOCID: <EP 078651 9A2_t_> 



770 



EP0 786 519 A2 





AAGAAAAGCA 


AGACGTTGAT 


CAATTTAAAT 


AATTAATATA 


ATACAGATGG 


TAGGAAACAA 


1200 




CTAATACAGT 


TCCTATTATC 


TGTATCTT1T 


TTTATTAAAA 


CAGAACTTTT 


TCAAATGGTT 


1260 


5 


TAACACTCCC ATTTATTTGT 


GGTACAATTA 


GTAAGGATAA 


AATGAATTTC 


TATACAATTA 


1320 




TGGGAAAGGT 


ATTGTGAATT 


GAATGGCTCC 


TAAGTTACAA 


GCCCAATTCG 


ATG CAGTAAA 


1380 


10 


AGTTTTAAAT 


GATACTCAAT 


CGAAATTTGA 


AATGGTTCAA 


ATTTTGGATG 


AGAATGGTAA 


1440 


CGTCGTAAAT 


GAAGACTTAG 


TACCTGATCT 


TACGGATGAA 


CAATTAGTGG 


AATTAATGGA 


1500 




AAGAATGGTA 


TGGACTCGTA 


TCCTTGATCA 


ACGTTCTATC 


TCATTAAACA 


GACAAGGACG 


1560 


15 


TTTAGGTTTC 


TATGCACCAA 


CTGCTGGTCA 


AGAAGCATCA 


CAATTAGCGT 


CACAATACGC 


1620 




TTTAGAAAAA GAAGATTACA 


TTTTACCGGG 


ATACAGAGAT 


GTTCCTCAAA 


TTATTTGGCA 


1680 




TGGTTTACCA 


TTAACTGAAG 


CTTTCTTATT 


CTCAAGAGGT 


CACTTCAAAG 


GAAATCAATT 


1740 


20 


CCCTGAAGGC 


GTT AATG CAT 


TAAGCCCACA 


AATT ATT AT C 


GGTGCACAAT 


ACATTCAAGC 


1800 




TGCTGGTGTT 


GCATTTGCAC 


TTAAAAAACG 


TGGTAAAAAT 


GCAGTTGCAA 


TCACTTACAC 


1860 




TGGTGACGGT 


GGTTCTTCAC 


AAGGTGATTT 


CTACGAaGGT 


ATTAACTTTG 


CAGCAGCTTA 


1920 


25 


TAAAGCACCT 


GCAATTTTCG 


TTATTCAAAA 


CAATAACTAT 


GCAATTTCAA 


CACCAAGAAG 


1980 




CAAGCAAACT 


G CTGCTG AAA 


CATTAGCTCA 


AAAAGCAATT 


GCTGTAGGTA 


TTCCTGGTAT 


2040 




CCAAGTTGAT 


GGTATGGATG 


CGTTAgcTGT 


nATATCAAGC 


AACTAAAGAA 


GCACGTGACC 


2100 


30 


GCGCAgTTGC AGGTGAAGGT 


CCAACATTAA 


TTGAAACTAT 


GACATATCGT 


TATGGTCCTC 


2160 




ATACAATGGC 


TGGTGACGAT 


CCAACTCGTT 


ACAGAACTTC 


AGACGAAGAT 


GCTGAATGGG 


2220 


35 


AGAAAAAAGA 


CCCATTAGTA 


CGTTTCCGTA 


AATTCCTTGA 


AAACAAAGGT 


TTATGGAATG 


2280 


AAGACAAAGA AAATGAAGTT ATTGAACGTG 


CAAAAGCTGA 


TATTAAAGCA 


GCAATTAAAG 


2340 




-AGGCTGATAA 


CACTGAAAAA- 


CAAACTGTTA 


GTTGTCTAAT- 


GGAAATTATG 


TATGAAGATA — 


24 00 


40 


TGCCTCAAAA 


CTTAGCAGAA 


CAATATGAAA 


TTTACAAAGA 


GAAGGAGTCG 


AAGTAAGCCA 


2460 




TGGCACAAAT 


GACAATGGTT 


CAAGCGATTA 


ATGATGCGCT 


TAAAACTGAA 


CTTAAAAATG 


2520 




ACCAAGATGT 


TTTAATTTTT 


GGTGAAGACG 


TTGGTGTTAA 


CGGCGGTGTT 


TTCCGTGTTA 


2580 


45 


CTGAAGGACT 


ACAAAAAGAA 


TTTGGTGAAG 


ATAGAGTATT 


CGATACACCT 


TTAGCTGAAT 


2640 




CAGGTATTGG 


TGGTTTAGCG 


ATGGGTCTTG 


CAGTTGAAGG 


ATTCCGTCCG 


GTTATGGAAG 


2700 




TACAATTCTT 


AGGTTTCGTA 


TTCGAAGTAT 


TTGATGCGAT 


TGCTGGACAA 


ATTGCACGTA 


2760 


50 


CTCGTTTCCG 


TTCAGGCGGT 


ACTAAAACTG 


CACCTGTAAC 


AATTCGTAGC 


CCATTTGGTG 


2820 




GTGGCGTACA 


CACACCAGAA 


TTACACGCAG 


ATAACTTAGA 


AGGTATTTTA 


GCTCAATCTC 


2880 
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CTATTAGAAG 


TAATGACCCA 


GTCGTATACT 


TAGAGCATAT 


GAAATTGTAT 


CGTTCATTCC 


3000 




GTGAAGAAGT 


ACCTGAAGAA 


GAATATACAA 


TTGACATTGG 


TAAGGCTAAT 


GTGAAAAAAG 


3060 


5 


AAGGTAATGA 


CATTTCAATC 


ATCACATACG 


GTGCAATGGT 


TCAAGAATCA 


ATGAAAGCTG 


3120 




CAGAAGAACT 


TGAAAAAGAT 


GGTTATTCTG 


TTGAAGTAAT 


TGACTTACGT 


ACTGTTCAAC 


3180 




CAATCGATGT 


TGACACAATT 


GTAGCTTCAG 


TTGAAAAAAC 


TGGTCGTGCA 


GTTGTAGTTC 


3240 


10 


AAGAAGCACA 


ACGTCAAGCT 


GGTGTTGGTG 


CAGCAGTTGT 


AGCTGAATTA 


AGTGAACGTG 


3300 




CAATCCTTTC 


ATTAGAAGCA 


CCTATTGGAA 


GAGTTGCAGC 


AG CAG AT ACA 


ATTTATCCAT 


3360 


15 


TCACTCAAGC 


TGAAAATGTT 


TGGTTACCAA 


ACAAAAATGA 


CATCATCGAA 


AAAGCAAAAG 


3420 


AAACTTTAGA 


ATTTTAATAC 


ATTTTAAAAG 


TTAACGAAGT 


TAGCGTATTT 


TAGTCTCATT 


3480 




GATTAAAATG 


AAATGTTTAA 


TTTACGAAAT 


CTTAGGAGGG 


CAAAAACGTG 


GCATTTGAAT 


3540 


20 


TTAGATTACC 


CGATATCGGG 


GAAGGTATCC 


ACGAAGGTGA 


AATTGTAAAA 


TGGTTTGTTA 


3600 




AAGCTGGAGA 


TACTATTGAA 


GAAGACGATG 


TTTTAGCTGA 


GGTACAAAAC 


GATAAATCAG 


3660 




TAGTAGAAAT 


CCCATCACCA 


GCATCTGGTA 


CTGTAGAAGA 


AGTTATGGTA 


GAAGAAGGTA 


3720 


25 


CAGTAGCTGT 


AGTTGGTGAC 


GTTATTGTTA 


AAATCGATGC 


ACCTGATGCA 


GAAGATATGC 


3780 




AATTTAAAGG 


TCATGATGAT 


GATTCATCAT 


CTAAAGAAGA 


ACCTGCGAAA 


GAGGAAGCGC 


3840 




CAgcAGaGCA 


AGCACCTGTA 


GCTACTCAAA 


CTGAAGAAGT 


AGATGAAAAC 


AGAACTGTTA 


3900 


30 


AAGCAATGCC 


TTCAGTACGT 


AAATACGCAC 


GTGAAAAAGG 


TGTTAACATT 


AAAGCAGTTT 


3960 




CTGGATCTGG 


TAAAAATGGT 


CGTATTACAA 


AAGAAGATGT 


AGATGCATAC 


TTAAATGGTG 


4020 




GTGCACCAAC 


AGCTTCAAAT 


GAATCAGCTG 


CTTCAGCTAC 


AAGTGAAGAA 


GTTGCTGAAA 


4080 


35 


CTCCTGCAGC 


ACCTGCAGCA 


GTAACATTAG 


AAGGCGACTT 


CCCAGAAACA 


ACTGAAAAAA 


4140 




TCCCTGCTAT 


GCGTAGAGCA 


ATTGCGAAAG 


CAATGGTTAA 


CTCTAAGCAT 


ACTGCACCTC 


4200 


40 


ATGTAACATT 


AATGGATGAA 


ATTGATGTTC 


AAGCATTATG 


GGATCACCGT 


AAGAAATTTA 


4260 


AAGAAAT GG C 


AGCTGAACAA 


GGTACTAAGT 


TAACATTCTT 


AC CTTATGTT 


GTTAAAGCAC 


4320 




TTGTTTCTGC 


ATTGAAAAAA 


TACCCAGCAC 


TTAACACTTC 


ATTCAATGAA 


GAAGCTGGTG 


4380 


45 


AAATCGTTCA 


TAAACATTAC 


TGGAATATCG 


GTATTGCAGC 


AGACACTGAT 


AGAGGATTAT 


4440 




TAGT ACCTG T 


TGTTAAACAT 


GCTGATCGTA 


AGTCTATTTT 


CCAAATTTCA 


GATGAAATTA 


4500 




ATGAATTAGC 


TGTTAAAGCA 


CGTGATGGTA 


AATTAACAGC 


CGATGAAATG 


AAAGGTGCTA 


4560 


50 


CATGCACAAT 


CAGTAATATC 


GGTTCAGCTG 


GTGGACAATG 


GTTCACTCCA 


GTTATCAATC 


4620 




ACCCAGAAGT 


AGCAATCTTA 


GGAATTGGCC 


GTATTGCTCA 


AAAACCTATC 


GTTAAAGATG 


4680 
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ATGGTGCAAC TGGCCAAAAT GCAATGAATC ACATTAAACG TTTATTAAAT AATCCAGAAT 4 800 

TATTATTAAT GGAGGGGTAA AACATGGTAG TTGGAGATTT CCCAATTGAA ACAGATACTA 4 860 

TAGTAATCGG AGCAGGTCCT GGTGGATACG TTGCAGCAAT TCGTGCAGCT CAATTAGGAC 4 920 

AAAAAGTAAC AATCGTTGAG AAAGGTAATC TTGGTGGTGT TTGCTTAAAC GTAGGATGTA 4980 

TTCCTTCAAA AG CATT ACT A CATGCTTCTC ACCGTTTTGT TGAAGCACAA CATTCTGAAA 504 0 

ACTTAGGTGT TATTGCTGAA AGTGTTTCTT TAAACTTCCA AAAAGTTCAA GAATTCAAAT 5100 

CATCAGTTGT TAATAAATTA ACTGGTGGTG TTGAAAGCTT ACTTAAAGGT AACAAAGTTA 5160 

ACATCGTTAA AGGTGAAGCA TATTTCGTAG ATAACAATAG CTTACGTGTT ATGGACGAAA 5220 

AGAGCGCACA AACATACAAC TTTAAAAATG CAATCATTGC AACAGGTTCA AGACCAATTG 5280 

AAATTCCTAA TTTCAAATTC GGTAAACGTG TTATCGACTC AACAGGTGCT TTAAACTTAC 534 0 

20 AAGAAGTACC a GGT AAATTA GTTGTAGTTG GTGGAGGATA CATTGGATCA GAATTAGGTA 5400 

CAGCATTTGC TAACTTTGGT TCAGAAGTAA CCAT CCTTG A AGGTGCTAAA GATATCTTAG 5460 

GTGGCTTCGA AAAACAAATG ACACAACCTG TTAAAAAAGG TATGAAAGAA AAAGGTGTTG 5520 

25 AAATCGTTAC TGAAGCTATG GCTAAATCAG CTGAAGAAAC AGATAACGGA GTTAAAGTTA 5580 

CTTATGAAGC TAAAGGCGAA GAGAAAACAA TCGAAGCTGA TTATGTATTA GTAACTGTAG 564 0 

GTCGTCGTCC AAACACAGAC GAATTAGGCC TAGAAGAATT AGGTGTTAAA TTCGCTGACC 5700 

GTGGATTATT AGAAGTTGAT AAACAAAGCC GTACGTCTAT CAGCAATATC TATGCAATTG 576 0 

GTGATATCGT TCCAGGTTTA CCACTTGCTC ACAAAGCTAG CTATGAAGCT AAAGTTGCTG 5820 

CTGAAGCAAT TGATGGTCAA GCTGCTGAAG TTGATTACAT TGGTATGCCA GCAGTATGCT 5 880 

TTACTGAACC AGAATTAGCT ACAGTTGGTT ATTCAGAAGC GCAAGCTAAA GAAGAAGGTT 594 0 

- TAGGAATTAA -AGCTTGT AAA - TTCCCATATG- CAGCAAATGG- TCGTGGATTA- TGATTAGATG 6000- 

ATACTAACGG ATTTGTTAAA CTTATTACAC TTAAAGAAGA TGATACTTTA ATCGGTGCTC 6060 

AAGTAGTTGG TACTGGTGCA TCAGATATTA TCTCTGAATT AGGTTTAGCA ATTGAAGCTG 6120 

GTATGAATGC TGAAGATATC GCATTAACAA TCCATGCACA TCCAACATTA GGTGAGATGA 6180 

45 CTATGGAAGC AGCAGAAAAA GCTATCGGAT ACCCAATCCA TACAATGTAA TAACTGATTA 6240 

TCTATAAAGA TTCAGTCATT AAAAGCTGTA GCATATGCTA CGGCTTTTTT GTTTTAGGTA 6300 

AAGTAATGTA AGGAAATTGA TTTGAGATAT CGTTAACATG TGACATGCAT GTTATACTAG 6 36 0 

50 CGATGCTAAT AAAAGAATTG AAATGGAGGG TTCAACAATG GAATATGAGT ATCCAATTGA 6420 

TTTAGACTGG AGTAATGAAG AGATGATTTC AGTGATAAAT TTCTTTAATC ATGTAGAGAA 64 80 
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20 



25 



30 



35 



40 



45 



SO 



AATTGTGCCT GCTAAAGCAG AGGAAAAACA AATTTTTAAT ACTTTCGAAA AAAGTAGTGG 6600 

CTATAATAGT TACAAAGCAG TTCAAGATGT AAAAACTCAC TCTGAAGAAC AAAGAGTAAC 6660 

AGCTAAAnAA TAATTCGTTC GAAATTAACA CAATTTAATA GGAATTTTTC TTTAAAACTA 6720 

TTGCTAATAA AG CTATATTT TGATACCTTT ATCAAGTGTT AAACAAAATG TTTGATAAAA 6780 

GTAAACTTAA TATAGCTTTT TTAGGTGGAA AAATAAATGA ACATAGGTAA TAAAATTAAA 684 0 

AATCTTAGAA GAATTAAAAA TTTAACGCAA GAAGAACTTG CTGAACGTAC AGACTTATCG 6900 

AAAGGCTACA TTTCACAAAT AGAAAGTGAA CATGCCTCAC CAAGTATGGA AACTTTCTTA 6960 

AATATTATAG AGGTGTTAGG AACGACGCCA AGTGAATTTT TTAAAGACAG TGAAAATGAA 702 0 

AAAGTATTAT ACAAGAAGGA AGAACAAGTT ATTTATGATG AGTATGATGA AGGTTATATA 7080 

TTAAATTGGT TAGTTTCAAA GTCAAATGAA TATGATATGG AGC CATTAAT ATTAACTTTA 714 0 

AAGCCTGGAG CATCATATAA AAATTTTAAT CCATCAGAGT CTGATACGTT TATTTATTGT 7200 

ATGTCAGGTC AGATAACACT TAATTTAGGC AAAGAGATAT ATCAAGCACA AGAAGAAGAC 7260 

GTTTTGTATT TTAAAGCACG AGATAATCAT CGTTTGTCAA ACGAATCAAA CAATGAAACA 73 2 0 

CGAATACTTA TTGTAGCGAC AGCTTCATAT TTATAGGGGG GATCTTATTT GGAACCGTTA 73 80 

TTATCATTAA AATCAGTTAG TAAAAGCTAT GATGATCTTA ATATCTTAGA TGACATAGAT 744 0 

ATTGATATTG AATCAGGATA CTTTTATACA TTATTAGGTC CTTCAGGTTG TGGTAAAACA 7500 

ACAATTTTAA AATTAATTGC AGGGTTTGAA TAT CCTGACA GTGGTGAAGT GATTTATCAA 7560 

AACAAACCAA TTGGTAATTT ACCACCAAAT AAACGTAAAG TGAATACAGT CTTTCAAGAT 7620 

TATGCATTAT TTCCACACTT AAACGTCTAT GATAATATCG CTTTTGGTTT GAAATTAAAA 7680 

AAATTATCAA AAACCGAAAT TGATCAAAAA GTAACTGAGG CATTAAAATT AGTAAAACTT 774 0 

TCAGGTTATG AAAAAAGAAA TATTAATGAA ATGAGTGGCG GACAAAAGCA ACGTGTTGCA 7800 

ATTGCACGTG CTATCGTAAA TGAACCAGAA ATATTATTGT TAGATGAATC TTTATCCGCA 7860 

TTAGATTTGA AATTGCGTAC TGAAATGCAA TATGAATTAC GAGAATTGCa ATCTAGATTA 7920 

GGtATTACAT TTATATTTGT aACACATGAT CCA 7953 
(2> INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GGCGTGATCA TACGACCGTC ATTCATGCTC ATGAAAAAAT ATCTAAAGAT TTAAAAGAAG 60 

ATCCTATTTT TAAACAAGAA GTAGAGAATC TTGAAAAAGA AATAAGAAAT GTATAAGTAG 120 

GAAACTTTGG GAAATGTAAT CTGTTATATA ACAGCACTAA TGATnACAAT CATTTTTTAC 160 

ATTTCTATAT GCTAATGTGG CAAGATGAGC AAAACTCATT TTGTGGATaA TGTTTaAAAG 24 0 

TCATACACAC CATACACAAG TTATCAACAT GTGTATAAyT cGcCAAATCT ATGTTTTTAA 300 

GACTTATCCA CCAATCCACA GCACCTACTA CTATTACTAA GAACTTAAAA CCTATATAAT 360 

TATATATAAA CGACTGGAAG GAGTTTTAAT TAATGATGGA ATTcACTATT AAAAGAGATT 420 

ATTTTATTAC ACAATTaAAT GACACATTAA AAGCTATTTC ACCAAGaACA ACATTACCTA 4 80 

TATTAACTGG TATCAAAATC GATGCGAAAG AACATGAAGT TATATTaACT GGTTCAGACT 540 

CTGAAATTTC AATAGAAATC ACTATTCCTA AAACTGTAGA TGGCGAAGAT ATTGTCAATA 6 00 

20 TTTCAGAAAC AGGCTCAGTA GTACTTC CTG GACGATTCTT TGTTGATATT ATAAAAAAAT 660 

TACCTGGTAA AGATGTTAAA TTATCTACAA ATGAACAATT CCAGACATTA ATTACATCAG 720 

GTCATTCTGA ATTTAATTTA AGTGG CTTAG ATCCAGATCA ATATCCTTTA TTACCTCAAG 780 

25 TTTCTAGAGA TGACGCAATT CAATTGTCGG TAAAAGTGCT TAAAAACGTG ATTGCACAAA 84 0 

CAAATTTTGC AGTGTCCAcC TCAGAAACAC GCCCAGTACT AACTGGTGTG AACTGG CTTA 900 

TACAAGAAAA TGAATTAATA TGCACAGCGA CTGACTCACA CCGCTTGGCT GTAAGAAAGT 960 

TGCAGTTAGA AGATGTTTCT GAAAACAAAA ATGTCAT CAT TCCAGGTAAG GCTTTAGCTG 1020 

AATTAAATAA AATTATGTCT GACAATGAAG AAGACATTGA TAT CTTCTTT GCTTCAAACC 1080 

AAGTTTTATT TAAAGTTGGA AATGTGAACT TTATTTCTCG ATTATTAGAA GGACATTATC 1140 

CTGATACAAC ACGTTTATTC CCTGAAAACT ATGAAATTAA ATTAAGTATA GACAATGGGG 1200 

AGT7TTATCA-TGCGATTGAT-CGTGCCTCTT ~TATTAGCGCG~TGAAGGTGGT~AATAACGTTA~ 1260 

TTAAATTAAG TACAGGTGAT GACGTTGTTG AATTGTCTTC TACATCACCA GAAATTGGTA 1320 

CTGTAAAAGA AGAAGTTGAT GCAAACGATG TTGAAGGTGG TAG CCTG AAA ATTTCATTCA 13 80 

ACTCTAAATA TATGATGGAT GCTTTAAAAG CAATCGATAA TGATGAGGTT GAAGTTGAAT 144 0 

45 TCTTCGGTAC AATGAAACCA TTTATTCTAA AACCAAAAGG TGACGACTCG GTAACGCAAT 1500 

TAATTTTACC AATCAGAACT TACTAAAAAT AAATATAAAT AAAGGATGAC GTGATTAATT 1560 

AAAACGTCAT CCTTTATTTT TTGGCAAAAA TAATTCTAGG TGCGTATGTA AAATAAATTT 1620 

50 GGCAGCATTT TAAACAGCAA ATAAAAGACG CCAATTAAAT TTATGACAAA TGTATCCAAA 1680 

ATTTAATAAG TGTGCTTATA TGCCCTTTAA ATTTAAAATT TTAATAGTCA ATAACAAGTT 1740 
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AAAAATAAGA ATTAATTATT TATATGTAAA CGGTTTCTAC CTCTATTTTA AATGAAATTT 1860 

GTGACAAAAA AAGGTATAAT ATATTAATGA CATACAAAGA AATGGAGTGA TTATTTTGGT 1920 

5 

TCAAGAAGTT GTAGTAGAAG GAGACATTAA TTTAGGTCAA TTTCTAAAAA CAGAAGGGAT 1980 

TATTGAATCT GGTGGTCAAG CAAAATGGTT CTTGCAAGAC GTTGAAGTAT TAATTAATGG 204 0 

AGTGCGTGAA ACACGTCGCG GTAAAAAGTT AGAACATCAA GATCGTATAG ATATCCCAGA 2100 

10 

ATTACCTGAA GATGCTGGTT CTTTCTTAAT CATTCATCAA GGTGAACAAT GAAGTTAAAT 2160 

ACACTCCAAT TAGAAAATTA TCGTAACTAT GATGAGGTTA CGTTGAAATG TCATCCTGAC 2220 

1S GTGAATATCC TCATTGGAGA AAATGCACAA GGGAAAGACA AATTTACTTG GAATCAATTT 2280 

ATACCTTAGC TTTAGCAAAA AGTCATAGAA CGAGTAATGG ATAAGGGACT CCATACCGTT 234 0 

TTAATGC 2347 

20 (2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 
25 (D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

30 

ACAAGACGTn TCTATAACTT ATCTGAAATC GCTCGTCAAG ATAAAGATTA TGCAACTATC 60 

TCATTCTTAA ACTGGTTCTT AGATGAACAA GTCGAAGAAG AATCAATGTT TGAAACTCAC 120 

ATCAATTATT TAACTCGTAT CGGCGATGAC AGCAATG CAT TATATCTTTA CGAAAAAGAA 180 

35 

CTTGGCGCTC GTACATTCGA CGAAGAATAA TTAAACATCA CTACAATAGA CAGATAAATA 240 

TCATACGACA TGATAGGCAT TTGGGTCACT TACAATAACC CAATGTCTAT ATTATTTTGC 300 

TTTACGGAGA TCACTAGATT CATTTTCTGA ATCATTG AT C TGCGTTTTTT CATTTTCAAG 3 60 

40 

GCTAATTATT GTATTTTTAG TCATTTATTT TTTAAACTAC TAATGTTAAT AACTCTAAAT 420 

TTGATGTTGA ATTAATTTGA CGATTTTAAA GCATATCATC ATTTACTTTT TAATCAGAGT 480 

45 TACATCCAAA TGATAGATTT CACGTTATAC CTTCACGTAT AATATTATGT ATCGTTTGTA 54 0 

AGCAAATGAC TAAAAGTCTA TTAATATATA CATTTAATTA ATTGAAAGGA TTGACTACAT 600 

GATACAAGAT GCGTTTGTTG CACTTGATTT TGAAACAGCA AATGGTAAAC GTACAAGTAT 660 

SO TTGTTCTGTC GGAATGGTTA AAGTCATTGA TAGTCAAATA ACAGAAACAT TT CAT ACT CT 720 

TGTGAATCCG CAAGACTATT TTTCACAACA AAATATTAAA ATTCATGGCA TACAACCAGA 78 0 
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aGATTTACCT GTTGTCGCAC ATAACGCGGC ATTTGATATG AACGTCTTAC ATCAAAGCAT 9 0 0 

TCAAAATATT GGTTTACCAA CTCCAAATTT AACTTACTTT TGTAGTTATC AACTTGCTAA 960 

AAGAACCGTT GATTCGTATC GATACGGTTT AAAACATATG ATGGAGTTTT ATCAATTAGA 1020 

TTTTCATGGT CATCATGATG CATTGAATGA TGCCAAAGCA TGCGCAATGA TTACTTTTAG 1080 

GCTACTGAAA AATTATGAAA ATTTAACATA TGTAACTAAT ATTTATGGTA AAAATCTAAA n 40 

AGATAAAGGC TAGGACTAAA TAAAATACTC CCTTCAAAAG TAAG CATTGT AAAAATGTAA 1200 

ACTTTGCAGG GAGCTTTATT TTATATAAAG TCATATATCG TCATATTTTT ATAAGTTGAT 1260 

TGTTCTAAAT TACCTACAGT GACACCAATA AGTCGAATTG GTACATCAGG GTCTTTTAAA 1320 

TCGTTATAAA GTAAATATGC AATATTATAA ATATCTTCTT CAGAACTAAC CGAATCTCTT 1380 

AAACTCATCT GTTTAGATAG CGTTTCAAAT TGATAAGTTT TAATTTTAAC CGTTACAGTT 1440 

20 . TTAGCTGACT TCTGTAATTT ATTTAGACGT TCAGCTGTTT TACCTGnACA ATTCCCATAC 1500 

TTTTCTTAAA ATCTCTTCAT CATCATTCAC GTCTGTTGCA AATGTGCGTT CAGTCCCTAC 1560 

TGATTTTCTT ACTCTTGATG ATTTCACTTC ACTATGGTCA ATACCGCGTG CCTTGTTATA 1620 

25 TAAACCCCGA CCTCTTTTTC CAAACAAACG TATTAATTCA AATTCCGTTT TCTCATATAA 1680 

ATCTCTACCG TTAAAAATAC CATTATCATG CATTACTTTT TTGGAAGCTT TACCTACGCC 1740 

TGGaAAATCT CCAATATCCA ATGTCATCAA AATATCATGG aCATTTTGAT AATCAATCAC 1800 

AGTCATACCA TCAGGTTTAT T CAT AC CACT CGCTAATTTA GCTAAAAATT TGTTATAAGA 1860 

AACACCTGCA GATGCTGTTA AATGTGTCTG CTCTAGAATA TCTTTTCTAA TAT ACTGAG C 1920 

AATTTTCGAA GCAGGAAGGT CTGGTCTCAC TAATTCTGTA ATATCTAAAT ACGCTTCATC 1980 

CAATGACATC GGTTCTACCT TATCTGTATA ACTTCGGAAA ATAGACATAA TCTGCGCAGA 2040 

-TGTTTCTCGG TAAGCACCAA AATTAGTTGT -GACAAAGTAT CCATTTGGAC" ATAATTTATG 2100 

CGCTTGTGAC ATAGGCATTG CTGAATGGAC GCCGTATTTT CGTGCTTCAT AGGATGCCGT 2160 

AGAGACAACA CCCCTACTGC TTGCTTTACC ACCAACAATG ACTGGTTTCC CTTTCAATTT 2220 

GGGGTTATCT CTCATTTCGA CTTGTG CAAA AAAATAGTCC ATATCTATAT GAATAATTCG 2280 

45 TCTCTCAGTC AAGTGCTCAC CTCCCTACTA ATTTTTACTT TTATAACGCA CAAAAATATC 2340 

TCAACATAAT TATACG CTGT GTACGATTTT TTTACATAAA TCTTGCACTT AGCGATAACT 24 00 

ATATTGaGAT AACTACAAGT TGTTATaAAA TCAATTGCTA TTTAAGCATG ATGATGAAGA 2460 

SO CGATTGAGTA AGAAAACATA GGTAATCTGA AATAATTCAA GCAAATTCAT TTTGTTGGTA 2520 

TCATCATATT AAAATTTATT ATTGAGTCGG CTTTTGATGA TACAAATAAA TACTATCTTC 2580 
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AAAGCAATAA GCGGTATGCA TACTAAACAT AAAAATAAGT GATGAATAAC CAAATACCTT 2700 

AATTAAAATA AG CAAGCCAG TACTTAATAG GATTAGTGGT GACAGCATAA TAATTGAGAA 2760 

TTGCCATTTG TTGAAGCAAG CATCTGCTGT TTGGAATAAG ATTCTGTCTT TTTTTATATT 2820 

AAACATAGGT TTGCTATCTT TTTTAAATAA AAGAAATAAT GCTCTATGGA TAAGTTCATG 28 80 

TAAAATCAAT AAAATAATGA ATCCAGCAAA CCCATATACA AGATTGATGA TGATATTTTG 2 94 0 

ATCGACAACC GCTGTGACAC CTAACGCCCA CTTATACGTA AATAAAATCA CGAATAACGC 3 0 00 

AATAACAAGT TGCAAGATAA TAAACCTTCG CATTTGAAAA TTATTTGTCG TTAAATCAAT 3 0 60 

TTTATGCATT ACCAACCCTC CCGATCATGA CATTCTTATT CTTCTTTAAA TATAGTATAC 3120 

AATGTCACAT TTAATTTAAA AAGTTCATAT CAAGAAAGTA AATTGGCTGT AATAAAATTT 3180 

TAATATACGA CVTCTTTCT T CACTTATTAA GGCGAAATTT TATCtCAAAT CATGTGCGCT 3240 

20 ATTTCAAATT GAATAATGCC ACTGTCTCAA CATGTGTTGT TTGTGGAAAC ATATCTACCG 33 00 

GTGTTACCTC TTCAAGTTGA TATTTTTCAG CTAATAATAA TGCATCACGT TGCTGTGTTG 33 60 

CGGGATTACA TGAAATATAG ACAATACGCT TAGGTTCTAA TGTAAGCAAA GTCTGAATAA 34 20 

25 ACGTTTCGTC ACAGCCCTTT CTTGGCGGAT CAACCATTAC AACATCTGGT TTAATCCCTT 34 80 

GTGCTTTCCA TTGTAAAATA ACTTCTTCAG CTTTCCCACA GACAAAAGTT GTATTATTGC 354 0 

ATTGGTTTAT AGTCGCATTT TGTTGTGCGT CTTCAATTGC AGAAGGTACT ACTTCAACAC 3 600 

CGTATACATG TTTTGCAAGT GGTGCCATAT ATAGCCCTAT TGTTCCAATA CCACAATAGG 3660 

TATCTAATAC AACTTCATTA CCTGTCAATT GCGCATACTC AATTGCTTTA TTATATAATT 372 0 

TCTCTGTTTG TTCAGAATTA ATTTGGTAGA ATGACTGATC ACTTATTTTA AATGTACTAT 3780 

CTGTTAATTG ATCAATAATT GTATCTTTAC CATATAGCGT TATAGATTGA CGTCCGATAA 384 0 

TAACATTAGA GTGGCTATCA TTAATGTTTT GTTTAATGCT TGTCACATTA GGAAATGCAT 3900 

CTAATATCTT CTCAACAACA GCATTTTTTT GTGGCCACTT TTTACCATTA GTTACAAAAA 3960 

TAATCATCAT TTCGTCTGTA TGATATCCTG TTCTTACAAC CAAATGTCTC ATTAAACCTT 402 0 

TTTTCAATTG TTCTTGATAA ATACTTACAT TTAAATCTTT TAAAATAGAT TTAACTTCAT 4080 

45 TCATCACTTC TTGATGTTGT GAATCTTGTA TTAAACAACT TTC CATGTCA ATAATGTCAT 414 0 

GGCTTCTTTG ACGATAAAAG CCCATAATAA CTTCATTCTG TTCATTCTTA CCAACTGGAA 420 0 

TCTGGGACTT GTTTCGATAT CTCCAAGGAT CTGTCATGCC AACTGTATCG TTAATCTTAG 4 26 0 

50 AATTATCAAA ATGCGCTTTT CGCTGAAACA AATTAATCAC TTGTTCCTTT TTCATTTCAA 43 2 0 

GTTGTGCTTC GTATGATAAG TGTTGAAGTT GGCACCCACC ACAACGTTCA TAATATATAC 4380 
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AGTTCTTTTT TACTTTGATA ATTTTATATT CAATTTGTTC ATTAATTAAA GCTTGTGGTA 4 500 

TGAAAATAGG AAAGCGATCT ATTTTTACGA CACCATGGCC TTCATGCGTT AAATCAACAA 4 560 

5 CTGTTCCCGT TTTTATGTCA TTTTTAGCTA TTGCTTGCAA AATTTTACCT CCAAAATGAA 4 620 

CAGGTTAGGA ACAAAATTAT GCGCTTCCTA ACCTGCCATT ATATATTTCA CTATTTCTGT 4680 

TTATTCTTCG ATTAAATTGT CATCAACATG ATCATTATTT ATTAACTCTT CATTTACAAT 4 740 

w 

ATCATTAGGT GCAAAGACAT CTATATGACG TTCTAGATTT AAGAAATTCG CTGGTAATTT 4 BOO 

ACCACCATAT TCTCCATCTA CATTTAGTTG TAAGTCTGTG AATGATGAAA TATTAATTGC 4 860 

CTTTGCTTTT TCATAAATAA CTTTAGGATG CTTAGTATGT TCTCCTCTTG AAGCTAAAGT 4 920 

75 

CATAATATGA CCAAGTTCTG CAAGGTTTGA TTTTTCAACT ATAATTAACG TAAAATAGCC 4 980 

GTCATCTAAC TTAGCGTCCG GCACTAATTT TTCAAATCCT GCCATTGAAT TTGTTAAACC 5040 

2Q TAAAAAGAAT AATAATGCTT CTCCTTGGAA AACATTACCA TCATATTCAA TTCTTAAATC 5100 

TACAGCTTTC ATTTGAGGTA ACATTTCGAA ACCTTTGaTG TAATAAGCAA ATGGACCAAC 5160 

AATAGATTTC AATTTACTCG GTGTTTCATA AGAGACTTGC GTCAATTGTC CGCCTGCAGC 5220 

25 TAAATTAATA AAGTATCGAT TATTCATTTT ACCAATATCT ACTTTAGTAG AATGACCTTC 5280 

AATGATGACA TCAAGTGCCC CCATGATGTC ATTAGGTATA TGCAATGCAC GTCCAAAGTC 534 0 

ATTAACAGTA CCCATAGGAA TG AC AC CT AG CTTAGGACGA TTAGGCTTTT CTGCGATACC 54 00 

30 ATTAACTACT TCATTTAATG TTCCATCACC ACCTGCAGCG ATTAATACAT CATAATTTTC 54 60 

ATGCATAGCT CTTTCTGCTT CAAGTGTGGC ATCACCTATT TTCTCGGTTG CATATGCACT 5520 

CGTTTCATAT CCCGCTTTTT CTAATTTTAT TAAGGCATCA GGTAATTCTC TTTTAAATAG 5580 

35 

CTCTTTACCT GATGTCGGGT TATAAATGAT TCTAGCACGT TTCCTCATAT CTTATCCCTC 5640 

' TACTTAAAAT TCATATATTT TAACTTCATC~TTTGTTTCGT CTAATAGGGA GTGGGACAGA ~ " "5700 

AATAATATTT AACAAAATTT ATTTCGTTCT ACCCCAACTT GCATTGTCTG TAGAATTTCC 5760 

40 

TTTCGAAATT CTCTATGTTG GGGCCCCACC CCAACTTGCA CATTATTGt A AG cTGACAGA 5820 

AAGTCAGCTT CTTTGTTTGG GGGCCCCGCC AACTTGCACA TTATTGTAAG CTGACAGAAA 5880 

45 ATCAGCTTCT ATGTTGGGGC CCCACTAGAA TTGAAAAAAG CTTGTTACAA GCGTATTTTC 5940 

TTTCAGTCAA CTACAGCCAA TATAACATTG TAGTGCCTAG GACATTGAAT TTATGACCCA 6000 

GGCTCAGTCT TATTTCATCA TTCTTAATAT CGTTAAAGAC CAACTTGTAT CTTAAACAAA 6 060 

50 TACTATCTCA ATATGTACAA AGCTTGTTAT TTATTCAGCA TTTTTTGCCG TTCTTCATTA 6120 

TAtAGcTTCG TCAGTTATGC TATTTTACCT TTAAAATGAT GTTGTAAATA TAATGTTGTC 6180 
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AACGCATTAA TAAAATTAAT ATTTTTACCA TTAACATGTA CAATGAATAA AGTTAAAAGT 6300 

AATTTGACTT CTATAGATAT AAATAAACCC TCGATTGCAT CTAAGTCAGC AATCAAGGGT 63 60 

TTATTTTTTA AATCTTCATA GTTTGATGAT TTAAATTATC TTTTATCTAA TTCTTGTTTT 6420 

AATAGTTGAT TTACTAATTG TGGATTAGCT TGACCTTTAG ACGCTTTCAT AATTTGACCA 64 80 

ACTAAGAAGC CCATAGCTTT GCCTTTACCA TTTTTGTAAT CTTCAACTGA TTGTTCGTTA 654 0 

TTGTCTAATG CTTCATTTAC AAATTTTAGA AGTGTTGCTT CATCAGAAAT TTGAACTAAG 6600 

CCATTATCTT CCATAATCTG TTTAGCATTA CCACCTTTAG CTGCTAACTC TGGGAAGACT 6660 

TTCTTCGCAA TTTTACTGCT CATTGTTCCG TCTTCGATAA GTTTAATCAT ACCTGCTAAA 6720 

TTTTCTGGTG TTAATTTAGT ATCTAATAAT TCTACTTGAT TTTTATTTAA ATATTCGTTT 6780 

ACGCCACCCA TTAACCAGTT AGATGTTAAT TTAACATCTG CACCGTGTTC AATTGTTGAT 6840 

TCAAAGAAAT CTG ACATTT C TTTAGTCAAT GTTAATACGT GTGCATCGTA TGCAGGTAAA 6 900 

CCTAATT CAT TTACATACTT AGCTTTACGT TCATCTGGTA ATTCAGGAAT TGTCTGACGA 6 960 

ACACGCTCTT TCCAAGCATC ATCAATATAT AAAGGTACAA TGTCAGG CTC TGGGAAGTAA 7020 

CGGTAAT CAT CAGAACCTTC TTTAACACGC ATTAAAATTG TTTTACCTGT AGATTCATCA 7080 

AATCGACGTG TTTCTTGTCC GATTTCTCCA CCATTTAACA ATTCTTCTTC TTGGCGTTTT 7140 

TCTTCATATT CTAAACCTTT ACGTACATAG TTAAATGAGT TTAAGTTTTT CAATTCGGCT 7200 

TTAGTAC CAA ATTTTTCTTG ACCATATGGA CGTAAAGAGA TGTTAGCATC ACAACGTAAA 726 0 

GATCCCTCTT CCATCTTAAC GTCTGATACA CCAGTGTATT GAATAATTGA ACGCAATTTT 7320 

TCTAAATATG CATATGCTTC TTTAGGTGAA CGAATATCTG GTTCAGATAC GATTTCAATT 73 80 

AGCGGTGTAC CTTGACGGTT CAAGTCAACT AATGAATACT CACCTTTATG TGTTGACTTA 7440 

CCAGCATCTT CTTCCATGTG AAGACGAGTA ATACCGATTC GTTTTGTTTC ACCGTCGACT 7500 

TCGAT AT CG A TATATCCATT TTCACCAATT GGTTGATCAA ATTGAGAAAT TTGATATGCT 7560 

TTTGGATTAT CTGGATAGAA ATAGTTCTTA CGGTCAAACT TAGATTCTGT TGCGATTTCC 7620 

ATATTTAGTG CCATTGCAGC ACGCATTGCC CAGTCTACTG CACGCTTATT AACAACTGGT 7660 

AAGACACCTG GATATGCTAA GTCGATAACA TTTGTATTTG AGTTAGGTTC TGCTCCAAAA 774 0 

TGCGCTGGTG ATGGAGAAAA CATTTTTGAG TCCGTTTTTA ACTCTACGTG AACTTCAAGT 7800 

CCTATAACTG TTTCAAAATG CATGATTTCC ACTCCTTATA ATTTTTCATA AACGTCATGT 7860 

AAATTGTATT GTGTTTCATA TTGATAAGCG ACACGATATA ACGTTTTTTC ATCGAATGGT 792 0 

TTACCAATGA ACTGTAAACC GATTGGTCGG CCATTTGATT GTCCACAAGG AACAGAAATA 7980 
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10 



15 



20 



25 



30 



35 



GGATCATCAA 


TTTCTTCACC 


TAAATTAAAC 


GGaGTcrTnAG 


G CGCTGTTGG 


ACCAACTACT 


8100 


ACATCATAAT 


TTTCGAATAC 


TTTATCAAAG 


TCATTTTTAA 

X > ■ ■ » X X A X X 


TCAATGTTCT 


AACTTTTTGA 


8160 


GATTTTTTAT 


AGTAAGCATC 


ATAGTAACCT 


GAACTTAATG 


CAAATGTACC 


TAAGAAAATA 


9 X%X> V 


CGACGTTTTA 


CTTCTTTACC 


GAAACCTTCA 


GATCTTGACA 


TTTTATATAA 


TTCTTCTAAT 


8280 


GAATGAGCTT 


CTTTAGAATG 


ATAACCATAA 


CGAATTCCGT 


CAAAACGAGA 


AAGGTTTGAC 


8340 


GAAGCTTCTG 


ATGATGCAAT 


CACGTAATAT 


GATGGAATAC 


CAAATTTAGT 


ATTTGGCAAT 


84 Of) 


GATACTTCCT 


CAACGACAGC 


ACCTAAAGAT 


TTTAAAGTTT 


CTACAGCGTT 


TTGAACTGCT 


8460 


TCTTTTACGT 


CATCAG CTAC 


ACCTTCACCT 


AAGTATTCTT 


TAGGTAATGC 


AACTTTTAAT 


8520 


CCTTTAATAT 


CTTTACCAAT 


TTCAGATGTA 


AAGTCTACAT 


CATCAACTGG 


TGCACTTGTA 


QCQA 


GAGTCATTAA 


CATCTGCACC 


AGAAATAGCT 


TCTAATACGA 


TTGCATTATC 


/ f J f w i w PAf**A'T w T r r 

X X A A X A X 




CGAGTCAATG 


GACCAATTTG 


GTCTAATGAA 


GATGCAAAAG 


CAACTAATCC 


AAATCGAGAT 




ACACGACCGT 


ATGTTGGTTT 


CATACCGACA 


ACGCCACAAT 


ATGCAGCCGG 


TTGTCTAATT 


876 0 


GAACCACCTG 


TGTCTGAACC 


TAAGCTAAAT 


GGTACTAAGC 


CAGCTGCAAc 


TGCTGCTGCA 




GATCCACCTG 


ATGAACCACC 


TGGCACTGCT 


TTATGGTCAA 


ATGGGTTAAC 




8880 


AAATAAGATG 


TTTCTGTTGA 


ACCACCCATT 


GCAAACTCAT 


COAT ATTT A A 


TTTAf^CGATT 

x x A/iNvVv^nA x 


8 Q4 n 


AAAACGGCAT 


TTTCATTATG 


T AGTTTTT CC 


ATTACAGTAG 


ATTCGTAAAT 


TGGCACAAAA 


9000 


CCTTCTAACA 


TTTTACTTGC 


ACATGTTGTT 


TCTAATCCGT 


TTGTAATAAT 


GTTATCTTTT 

w X X M ^ X X X X X 


J V V V 


ATACCCATTG 


GAATACCAAA 


TAATTTGCCA 


TCCATTTGAT 


CTTTTGCTTG 


TAATTCATCC 


9120 


AATTCTTGCG 


CTTTTTTGAT 


TGCATTTTCT 


TTATCCAGCG 


CTAGAAAAGA 


CTTAATTGTT 


9180 


GGATCAGTCT 


CTTCAATTGC 


ATCATATATA 


TCTTTAACAA 


CATCAGATGG 


tttgattttt 


9240 


nxiTcrriTA 


TTAAAGTTAA 


TAAATTCTCA- 


ACCGATTCGT 


AGCGAATGCT 


CATCTTACGC 


93.0.0. 
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GTCCTCCTCA 


TTCATGATTG 


TAGGCACTTT 


AAATTGTCCA 


TCTTCTGTTT 


CTTTGGCATT 


9360 


TTTCAAAGCT 


AATTCTTGTG 


GAAT AC CTTT 


AATTGCTTTA 


TCTTCACGTA 


AAACGTTTTG 


9420 


TAAATCTAAA 


ACGTGATATG 


TAGGTTCAAC 


GCCTTCTGTA 


TCAGCGCTAT 


CATTTTGTTT 


9480 


TGCAAAATCT 


AAAATGCTTT 


CTAATGTGTT 


GGCCATTTCT 


TCCGTTTCTT 


CAGGAGAAAT 


9540 


TTGAAGT CTT 


GCAAGATTCG 


CGATATGCTC 


AACTTCTTCA 


CGTGTTACTT 


TTGTCATTAA 


9600 


TAAAAGCCTC 


CTTTAAGTCA 


TTCATCACTA 


AATTGTATCA 


AATTTCCAAT 


TAAAAATCTA 


9660 


AGTATTTATG 


AGGTGCTACT 


TTAATTTCAT 


ATAAACTGTA 


TAAACATTAT 


CATTCGTTTA 


9720 


TCAAATCATT 


TTTTATGAAA 


ACAACACTCT 


TTTAATATTA 


GACAACCCAA 


TTCAATATTA 


9780 
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TATATTGGTA TGCAAGTATT TCAAAAAGAA 
TTATCTTTAT GTATAATGTT TTCAAGTAAC 
5 ATCATGCTTA CAATGGGGAC AGCATTAAGT 

ATTATGATTG CCGTCTACTT CTTGATACTA 
GCAACTGGTA ACCTAAGCGA GTACATGTTA 

w 

CATTATCAGC TGGAGCTTCA GATATGAGTG 
TCTATAGCAC TGGTCTATCA GCTATGTGGA 
TAAATTACTT TGTTGTTGCT CCTAGACTTC 

15 

TTACATTACC AGATTTCTTT AAAAATCGTT 
TTTCTGGATT GATTATCGTA GTATTCTTTA 

20 GTGGTAAACT ATTTGAAAGT GCTTTTGGAT 

CTTTCATTGT CATTTTCTAT ACTTTCTTTG 
TCTTCCAAGG TGTCATTATG TTAATTGCGA 

25 ATTTAAACGG CTGGGGAACG TTTCATGATG 

TATTTAAAGG GTTATCATTT ATAGGAATTA 
TCGGTCAACC TCATATCATT GTAAGGTTTA 
AAGCTAGACG TTTAGGTATT AGCTGGATGG 
GTTTAACAGG TATTGCATTC GTAC CTGCTT 
TATTCATCGT GATGAGTCAA GTACTCTTCC 

35 

CGATTCTAGC TGCAATTATG AGCACGATTT 
TAACGGAAGA CTTTTATAAA TTAATTCGTG 
AATTTGTTAT GATTGGAAGA TT AT CTGT AT 

40 

CATGGAATCC AAACGACACA ATTCTAAACT 
CATCGTTCAG TCCACTTGTG CTATTTG CAC 

4S CTGTAAGTGG AATGGTTTCA GGTG CCTT AG 

CATTGGCACA TATCAACGAA ATATTCGGCT 
GTGTAATCGT TACATATGTT GTAAGTAAAC 

50 CTGACTTAAA CAAAGTTCGT GACATCGTTA 

TAAAAAGGTA CTAATGTTAA TCAAAATTAT 

55 
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TAAATTTAAT TTTCCTACTT TTCTAAACAT 9900 
TAAATTATAA ATTAAATAAA GGGAGTGTTT 9960 

CAACAAGTAG ATGCCAATTG GCAAACTTAT 10020 

ATCGTTATTG GCTTTTACGG TTACAAGCAA 10080 

GGTGGACGTA tATTGGACCG TATATTACTG 10140 

GATGGATGAT TATGGGGCTA CCTGGTTCTG 10200 

TTACAATCGG TTTAACATTA GGTG CTT AT A 10 260 

GTGTTTATAC CGAATTAGCT GGAGATGCAA 10320 

TAAACGATAA AAATAATGTG TTAAAGATTA 10380 

CATTATATAC ACATTCTGGT TTCGTATCTG 10440 

TAGATTATCA TTTCGGTTTA ATATTAGTTG 10500 

GTGGATATTT AG CTGTATCA ATTACAGATT 10560 

TGGTTATGGT CCCTATTGTT GCTATGATGA 10620 

TAGCAGCTAT GAAACCTACA AATTTAAATT 10680 

TCTCTCTATT TTCATGGGGA TTAGGTTATT 10 740 

TGTCTATTAA ATCACACAAG ATG CTACCT A 10800 

CTGTTGGTTT ATTAGGCGCT GTGGCTGTTG 10860 

AT CAT ATT AA ACTAGAAGAT CCTGAGACAT 10920 

ATCCTCTTGT AGGTGGTTTC TTACTTGCTG 10 980 

CTTCACAATT ACTTGTAACA TCTAGTTCAC 11040 

GTGAAGAAAA AGCTAAAACG CACCAAAAAG 11100 

TAGTTGTAGC AATTGTTGCC ATCGCGATTG 11160 

TAGTAGGTAA CGCTTGGGCC GGATTTGGTG 11220 

TTTACTGGAA AGGTTTGACA CGTGCCGGTG 11280 

TCGTTATCGT TTGGATTGCA TGGATTAAAC 11340 

TATATGAAAT TATTCCTGGA TTTATTGTAA 114 00 

TTACTAAAAA ACCTGGTGCA TTTGTTGAAA 114 60 

GAGAAAAATA ATTCATAAGT CTTAACAAAT 11520 

GACTAACATT GGTACCTTTT TATTATCTTT 11580 
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AATTAAAGCA CGTGGTTGGT 


T* TV O/"* K '1'/ wiwiwr 


AATACGAATT 


TCATAGTTAT 


CXjATTTTATC 


11700 




GAAATATTTA 


TTCGCTTGTT 


L-AG 1 AACGTA 


. CTGTGTAATA 


CCAAU"i"Gi-l-i 


CAGCTTGTCC 


11760 


5 


ATAGTAATCG 


ATTGGTAAAT 


V- 1 A v. X AC 1 AA 


TCGTTGTGGC 


l iittATCAA 


CAAATTTAAC 


11820 




TTTCCCTACT 


GCTTGTGTGA 


A/%1 inunnAn 


AT ATG ATTG C 


TV TV T\ mmk 

AAA l I AT CAT 


TAAATTGCTT 


11880 


10 


GAAATTATTA 


TTTAAATTTT 


CATCATAATC 


TGCTGCTGTT 


GAAGAAGGTA 


AT AAAG CTG A 


11940 


TTTTTCATTG 


ATATTATGCC 


■r\ 1 ll_/\J. lAAu 


CTTTGTTTGA 


CTCTTTTCTG 


CAGTCGCTTG 


12000 




AGTGATAAAT 


TCACCTGGTG 




TTCACTTGAT 


TGCTTATAAA 


TTGCAAAATG 


12060 




AATTGGTATA 


TCTT1TAAAT 
TGCTTTTTAA 


CATCATTTTC 


ACGTAACCTT 


GATAATATCT 


CACTAGCCAT 


12120 




TTGTTTACCT 




TCTAuTrrrr 


TACTAAAAGT 


CGATCCATCT 


12180 




•rn-ixrrxTirr 


TATAGTAATA 


ft A ft A *^*T*^* 

MnLAL 1A1 I L. 


TV fTA/"* TV TV TV /*"» 

atagctaaac 


CAATCGTCAT 


ACCTTTAATA 


12240 


20 


TTCTTACCTT 
GCTGGTGATT 


TTGTATCTCC 


AL uACCATAA 


ta TV Tt w i x i v/*h A^n 

AAATCTTGCT 


CTAAAATGTT 


AGATAAATAG 


12300 




TTTCTGCAAT 


CiTri'CAGGA 


TCTGTTTCAC 


CTtCGTGTGA 


TGGATTAAGT 


12360 




CCTAAATTTT 


CATTCGCTTT 


CTTGTCTTTT 


TTATCTTTTT 


CAGACATTTT 


ATCGATTTCA 


12420 


25 


CGTTTTGTAT 


ACTTAGGATT 


T* AAA T* TA f^O /** TV 

I AAAT AGG CA 


TTAATTGTTT 


TCTTGTCCAA 


AAATTGACCA 


12480 




TCTTGATACA 


AATATTTATC 


ibl IXSG AAAT 


ACTTCTTTAC 


TTAAGTTCAA 


TAAACCATCT 


12540 




TCAAAGTCGC 


CGCCATTATA 


ACTATTTGCC 


ATGTTATCTT 


GTAAAAGTCC 


TCTTGCCTGG 


12600 


30 


CTTTCTTTAA 


ATGGTAACAA 


1 G rACGATAG 


TTATCACCTT 


GTACATTTTT 


ATCCGTTGCA 


12660 




ATTTCTTTTA 


CTTGATTTGA 


ACTATTGTTA 


TG lTiTl'GAT 


TATltixz CC 


AGCCTGGTCA 


12720 


35 


TCCTTATGGT 


TACCACAAGC 


AGCGAGTATA 


TV TV f' TV T< TV ^*"^*TV™» 

AAGATAGCTG 


TAATCAATAA 


TACTAATGTA 


12780 


CGCTTCATCG 


ACATACCCCT 


CTAACTATTT 


AATTCATTTT 


GCTTATCTAC 


AAATTGTTGC 


12640 




_ _ TCTGTCCAAA 


TTTCAATACa 


-TAAACTTTGT - 


. GCTTTTGTTA- 


ATTTTGAACC- 


TGCATGTTCA- - 


- 12900 


40 


CCAGCAATAA 


CGACATCTGT 


Al I I TIAvi I A 


ACGCTACTTG 


TAACTTTAGC 


ACCTTGTGAT 


12960 




GCAAGCCATT 


TAGATGCTTC 


Al 1 oCG TGTv_ 


ATTTGATGTA 


GCTTACGAGT 


CAGTACTATC 


13020 




GTTTTACCAC 


TAAATTCAGG 


A J/G I X 1 L-A 


TV T 7A T"/ "IV * TV TT" 

ATATCTGATG 


TTTTGATACC 


TTTATAAATC 


13080 


45 






i AA'rrrnGA 


IV ' 1 ■ I ' TV TV TV /** /"*» TV /*» 

ATTaAAG cac 


GAATATCTTC 


ATTTTCTAAA 


13140 




TAAGTAACTA 


CAGATTGTGC 


TACTTTATCA 


CCTATATCAT 


GAATTTCTAC 


TAATTCCGCT 


13200 




TCAGTTACCG 


TTAGTAATCG 


ATCTATCGTT 


TCATATTTTT 


CTGCTAACAC 


TTGGCTCGCT 


13260 


50 


TTAACACCTA 


AATGCCTAAT 


ACCTAGACCA 


AATAATAAAT 


TTTCTAAAGA 


GTTGTC CTTA 


13320 




GCTTGTTGAA 


TGGCAGCTAA 


TAAATTATCA 


ACTTTTTTCT 


GCCCCATTCT 


GTCTAAAGGT 


13380 
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TAAAGCTGTT GAATAATTTT AGTG CCTAAA CCATCAATAT TcATGGCTTG TCTTGaTACA 13500 
AAGTGnATCa ATCCtTcAAC AAGTTGTGCT TGGTCATTTT GG 13542 
5 (2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 93 base pairs 

(B) TYPE; nucleic acid 

10 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



75 



20 



30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

CAGTAAACAC CTCTGATTAC GAATATTTAT ACATTTATTT TAACACATGC ACTGATTTAC €0 

GACTACTAAA CACCTTTACG TAAAAAGGGT AAACATGGTT TATCTATCTT GGTTATCTAT 120 

TTATAAATAT TTtlTCATATT ACGCATAACA ATTGCTTAAA ATATGTATAA AAATGAATAT 180 

ATGTGTAATA AACTTGCTAA TTATTAGATT TAATAAGCGT CAATTGTTTG AACATATTtA 240 

ATTAAAATCA CATTGATATC ACAGATACGA ATATTGTCGT ATAGAAATTG AAAATTCTAT 3 00 

25 TTTTTAAATG AAAGTCTTCA ACATAATTTT AAGTTTCAAC ATGAGAAAAA TCGATTAACA 3 60 

AACAACGTCA GTTGAATATG CCTTTTGAGA CATTTCAAAC TTTACAATTG TTGCTAATCG 4 20 

AT AT ATTTG C TTTTAGTGAT CCCTGCTATA AAATAAATCA ACGATTTCTA ATAAGTGTTT 4 80 

TGTATTGAAT TGTTCATCAA TTTGCGTTAG TTCATCCACT GCTGCGTCTC TATGATAAGT 540 

CAATTTATCT TCTGCGCCAT CTTTCCCTAA TAAACTCACG TACGTACTTT TATTATTTTC 6 00 

AAGATCGCTG CCCACTTTTT TACCTAACTT TGCTTCATCA CCATAGCAGT CTAATAAATC 660 

ATCTTTAATC TGGAACATCA TACCTAAATG ATAACTATAA CTTTCTAAAT GTT CTTTAGT 720 

TGTSTCATCG ACATTAGCGA TATCTGCTGC ACTCATAACC GCAAAAGTTA ATAATGCTCC 780 

TGTTTTTGTT TTGTGTATCA TTTCCAAAGT TTCAAGATCA ATTGGTTGGC CTTCGCTTTG 840 

CATATCTAAC ATTTGACCGC CGACCATTCC AACATGACCA CTTGCTATTG ACAGCCGTTG 900 

TAGAACTTTT ATTTTTACTT CATCAGTTAA TCTAT CAT CA CTTGAAATAA GTTCAAATGC 960 

TTTAGTTAAT AAAGCATCAC CTGCTAATAT CGCAGTCCAC TCACCATATA CTTTATGATT 1020 

TGTTAATTTT CCTCGTCGAT AATCATCATT ATCCATCGCT GGTAGGTCAT CATGAATAAG 1080 

TGAATATGTA TGAATCATTT CTAGTGCAAT TGCGCTCTTC ATACCTAACT CATACTCGGT 114 0 

SO ATTTAGTGAA TCTAAAGTGA GTAATAACAG AACTGGTCGG ATGCGTTTAC CTCCAGCATT 1200 

TAATGAATAC AACATACTTT CTTCT AG CTG AGTATCCATT ACTGATTTAT TTATCGCAAC 126 0 
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CATCCTCAGC TTCTTCTTTT ATTAAGTCAT TCACCTTTTT TTCGGCATTT TTTAAAGTTG 13 80 

TGTCACAAGC TGCTGATAGT TTCATACCAC GTTGATATAA ATCTAATGAT TCCTCTAAAG 1440 

ATACTGTTTC ATTATCTAAT TTTTGAACAA TTTGCTCTAA TTCTTGCATC ATTTCTTCAA 1500 

AACTTTGCGT TTCTTTAGTC ATTATTACAC CTTACTTTCG TAACTTTTGC ATCTACTAAG 156 0 

CCATCTTTCA TTGTTAACGT CAATTGATCA TTTTCTGTTA AATCTTTAGT ACTCGTAATG 1620 

ACTTCGTCTT TTTTATTAAC AATTG CAT AT CCACGCAACA TTGTATTAGT TGGACTTAAA 16 8 0 

TTGTTTAAGT TTTCTACTTT ATTTTTCAAA TCATTTT T AT AACTTAATAT CTTAGAATTC 1740 

AATAATTTAA CAAGTTGGTT TGTCAATTGA AGATTATnTT GTTGTTCTTG ATTAACACTA 1800 

CTTAGTAATG CTTTTAAATn ATAACGTTGG TGCAACAGCA TTAAATCGAG GCCCCGGTGG I860 

TCCAAAGTTG CCCGAATTnG TGGTTTCAGG CCC 1893 
20 (2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
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AAAATATATT 


CCTTCACTTA 


ATATTCAATT 


AGAGAAAAAC 


ATGGTGATTG 


TAATATGTTG 


60 


TGCAATATTT 


CTGGGTGTTT 


TAATACTTTT 


TTTATTTCTG 


AATCGTAAGC 


TAAGGTTGGA 


120 


AATTTATAAT 


AATAACTCTA 


GTAAAGGGAA 


AATAATTTTA 


TTTCCTTCAT 


TAAAAAACTT 


180 


TTGTTTCACA 


ATATTTTATT 


ATTTTTTATT 


TGGCGGTCTT 


TCAATAATGG 


CTCTAAGTAT 


240 


-GTTATTAACT 


TTAAATCCTC 


AAAATATAAT 


AGGCTTTATT~ 


GGTTGGTTGG 


TAATGACTGC 


" "300 


AGGTTTCTTT 


CTGTTAAACA 


TGTCATCGAT 


TATTGACAAA 


AAAATTTATG 


TATTATCTAA 


360 


AACTAACACG 


GTGGAAAAAT 


GATGGTTTAG 


CTGG ATTT AC 


TGCAGGTTCT 


ATTTCGGCAA 


420 


TACTTGTATA 


TTGGACCAAT 


CAAAAAAATG 


AATTTGGAAT 


AAAAGATAAA 


AACGATTGGA 


480 


TAGGACATAA 


ACTAGACGTT 


GGTATAGATG 


CTGTAGAAAA 


ATCTGCAGAA 


AAAACAGTAG 


54 0 


ATGGTGTTGA 


AAATGTCATG 


GTGAAG CTTC 


AAAAAGTATT 


TCTAATCATA 


TAAGCCCTAA 


600 


GAAATGGAGC 


TGGTAAATGT 


TGCTATGCGA 


ATCTAAAATC 


ATCAATAAAA 


ACCCAAAATA 


660 


TAGAATTATT 


AAATATAATG 


ATGAATACTT 


AATGGTCGAT 


ATAATAAGCA 


CTTGGATTAG 


720 


TTTATTTT1T 


CCTTTTATTA 


ATTGGTTCAT 


CCCaAAAGaA 


TACGTCAAAA 


TTAGTAGAGA 


780 



SDOCIO: <EP 0786519A2 J_> 



785 



EP0 786 519 A2 



<2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 





ARATAA 


TTTTCAATTA 

X A 4 X V-Tlrt X A rk. 


GAAAATATCT 


TACTGCTGTT 


CTCTATTTAT ACAATACTTC 


60 


75 


flTlV T*TY3 IV ATfS 
K3 1 f\ X lunnlu 


OL1 J. \»<JV> X X X 


CCTAGGGTGC 


CGTCTCAGCC 


TTGGTCTTCG ACTGGCACTG 


120 






nu X v_ x v_vj\_ v^r\ 


TTAATACTAC 


GTATTAACAT 


GTAATTTTAC TTTGAAATAC 


180 




>]w|m | »pft TA 71 7A 7A 71 
X X X Xn/vvvVrv 


TAAnACACTT 

X jrWAVxAAv_rY\w X X 


TGCCCAACTT 


GCACATAAAT 


GTAAAATTCA ATAAAATGAA 


240 


20 


*T w T w T ir P ^*TV^TY*1 V 
X X 1 lUlVJiVJi 


X X V—^V— X X 


CTTATAATTT 


AATAAATACC 


ACTAAACTAA ATTAACGAGG 


300 




TGCCTTATGT 


ATAAAAATTA 


TAACATGCCC 


CAACTACACT 


ACCAATAGAA ACTTCTGTTA 


360 




GAATCCCTCA 


AAATGATATT 


T CACG AT ATG 


TTAATGAAAT 


TGTTGAAACr ATACCTGATA 


420 


25 


GCGAATTCGA 


TGAATTCAGA 


CATCATCGTG 


GCGCAACATC 


CTATCATCCA AAAATGATGT 


480 




TAAAAATCAT 


CTTATATGCA 


TATACTCAAT 


CTGTTTAATT 


ATGTTCAAAG CATTAAGGTA 


540 




ACAAGACAAT 


ATCTAAGATA 


TCAAAGATAG 


aaattttttg 


ACGTTGTTGC TGATTGTAAA 


600 


30 


CATAACCATC 


AATTTCATAA 


TTAATAGCAT 


CAATACGATA 


AATGGTTAAG CGTACTGAAT 


660 




CTACAAAGCC 


ATTATTATAA 


AATTTAACTT 


CTACAGGTTG 


GGCATATTGT AGCGCCTCGT 


720 


35 


GTAGCCGAAT 


GTTTAGCTCA 


GCCAATTGAT 


CATCTGATAA 


TACAGGACGT GTAATTTTGT 


780 


TTTGGTCGAT 


AATGTATTGT 


TGAATCGTTT 


CGAATTGTTC 


GGGTAATGTT GwVvviOtAfw 






CCO?TTTAAT CATGCCTCTT 


CC CAT AGGT A 


TATTGTTATC 


TAGTAATTCT CTTGGAACGT 


900 


40 


TACGATAATC 


AGTTTCTTCT 


TCATAACTTG 


TCATCCTTAA 


TTCACCCCAA TCTGATAATT 


960 


ACATTATACG 


AACATGTGTT 


CTATTTTGCA 


ACAAAAATTT 


TGTGGaAGCA TAAACGCGTT 


1020 




AATAATTAAT 


GCTCGTGtAA 


GTAAAAAAGA 


GGGATTAATT 


AAAATCGAAT AATGaCATAT 


1080 


45 


CACaGCAAAT 


AGTTCTTTTA 


AAGTAGTTAA 


ATAGTTTTAG 


CTTTAAGGAA aTGATAAaTG 


1140 




ATTGTwAATT 


CTAGCTAAAA 


TTTAATAAAA 


TGAAAATAAG 


ACTAACATGG AGGGGTAAAA 


1200 




GTAATGACAA 


ATGGATATAT 


TGGTTCTTAC 


ACTAAAAAGA 


ATGGTAAAGG GATTTATCGT 


1260 


50 


TTTGAATTAA 


ACGAAAATCA 


GTCACGTATT 


GATTTATTAG 


AAACAGGATT TGAATTAGAA 


1320 




GCGTCTACAT 


ATTTGGTGCG 


TAATAATGAA 


GTTTTATATG 


GAATCAACAA AGAAGGAGAA 


1380 
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TGTTTGTCTT CAAAAGCTGG TACAGGTTGT TATGTATCGA TTTCAGAAGA TAAACGATAT 
TTATTTGAAG CGGTATATGG TGCTGGCATC ATACGTATGT ATGAATTAAA TACGCACACA 
GGTGAAATTA TACGTCTAAT TCAAGAACTT GCACATGATT TTCCAACAGG TACACATGAA 
AGACAAGATC ATCCACACGC ACATTATATT AATCAAACTC CAGATGGTAA GTACGTTGCA 
GTAACAGATT TAGGTGCTGA TCGTATCGTT ACTTATAAAT TTGATGACAA CGGGTTTGAA 
TTTTATAAAG AATCTTTATT TAAAGATAGT GATGGGACAA GACATATTGA ATTTCATGAT 
AATGGAAAAT TTGCTTATGT CGTACACGAA TTATCAAATA CTGTGAGTGT TGCAGAATAT 
AATGACGGTA AATTTGAAGA GCTCGAGCGT CATTTAACAA TTCCTGAAAA CTTTGATGGA 
GATACTAAAC TTGcAGCAGT GCGTTTATCT CATGaTCAAC AATTCTTATA TGTATCTAAT 
AGAGGGCATG ATAGCATTGC AATTTTTAAA GTTCTTGATA ATGGTCAACA CTTAGAACTA 
GTAACAaTTA CTGAAa GTGG TGGTCAATTC CCAAGAGATT TTAATATTGC CTCATCAGAT 
GACCyTTTAG TTTgTGCTCA kGaGCaAGGA GATTCAGTTG TAACTGTTTT CGAAAGAAAT 
AAAGAAACAG GTAAAATTAC GCTATGTGAT AACACTCGTG TAGCATCTGA AGGTGTATGT 
GTCAXATTTT AATCTTTAAT TAATCATGAT AAAAAGAAAA CCATGTTTCC AAAAAATTTG 
TGTATACCTT GAAATTTATT GnTTTCCAGn ACATCAATTA TGGGAAGCAT GGnTTATTTT 
TGT 

(2) INFORMATION FOR SEQ ID NO: 158: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
AAATTGCCAG TTGGTATCGC TTCTGGTGCA GTAGTCGAAG GTTTCTTCCA AGGTATCATT 
CCGATTGGCT ATATCGTTAT GATGGCAGTA TTGTTATACA AAATTACTGT TGAATCTGGA 
CAATTTTTAA CAATTCAAGA TAGTATTACA AATATTT CAC AAGACCAACG TATTCAAGTT 
TTACTTATTG GATTTGCATT CAACGCATTT TTAGAAGGTG CAGCAGGATT TGGTGTACCA 
ATTGCAATTT GTGCACTTTT ATTAACACAA TTAGGATTTA ATCCATTAAA AGCTGCGATG 
TTATGTTTAG TCGCAAATGC AGCGTCTGGT GCTTTTGGTG CGATTGGTAT CCCTGTAGGT 
GTTGTAGAAA CGTTGAAATT ACCTGGAGAT GTTTCAGTAT TAGGTGTTTC TCAATCAGCA 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



GGTTTTAGAG 

CTTACTCAAG 

TTATTAACAA 

CGTGTTAATA 

CATGCATGGA 

TTTAAAAACT 

CCTGGaACAA 

CAAACAGGTA 

AACTTTAAAG 

CTTACAATTT 

GCAATGGGTC 

GGTTGGATAG 

ATTCAAGCTT 

ACAGTTGGTG 

GCAGTAAAAC 

TGTTTACTAA 

TTTCAAAATA 

GATCATTAAT 

AGTGAGTAAT 

TTTGTGCTAT 

ATACATAGTA 

AAAGTGGTAT 

CTGTATTTGT 

TTATGAGTGC 

GTTTGAAAAA 

AGGGCATTCA 

TTGTAGTAAA 

TTTAGTTAAA 

GAGTTTTGTC 



GTGTTAAAGA 

GATTATTAAC 

TGTTAGCATT 

AAGATGAAGA 

GCCCATTCAT 

TATTCTTACC 

TCAGCGAAGT 

CAGCTATTTT 

ATGCAGGTAG 

GTTTCATCTT 

AAGGTATTGC 

GTGTGTTTAT 

CTGTTGCACA 

GTGTAGCGGC 

AAGTTGGTAA 

TATTCATCTG 

TAAATATACA 

GAAAGAAGTT 

CAGCAATCAA 

CGCnCATAAA 

GATAGTAATA 

TAATATGTCG 

TATGACAACA 

aCATTAGGAA 

TTAGCACAAC 

GCACTTTGCG 

GCAATTAAGA 

ACAGGACAAT 

ATTGGCGAAG 



AACATTACCA 
TGTATTCAGT 
AGCAGTATTT 
AATTGAACCT 
TGTATTAACA 
AAATGGTGCT 
TACGCATAAA 
ATTAACTATT 
ATTATTCGGC 
AGCAATTTCT 
TAAAGCAGGT 
GACAGGATCA 
ACAAATTGGA 
AAAATTGATT 
GGAATCAGAA 
TATTTGGACT 
GAAGGTGAGA 
TGACATTATA 
ATTAAATTGG 
GTATATAATT 
GAGCATCACC 
TATTTCTCGA 
CAACATAGCA 
CATTACTTAA 
CTGG CGAAGA 
AGTTGAACTA 
TAAATGAGCA 
TAGATAACCC 
ATAATGTAGC 



GCAATTTTAG 
GGACCTGAAT 
TCTAAAAAAT 
GCAAAAGCAC 
GTCATTGTAA 
TTATCATCAT 
CCATTAGTAT 
ATTATTACAA 
GTTACATTTA 
AAAATCACAA 
AATGTCTTCC 
GTTGTAAATA 
ACAAGTGGTT 
TCACCACAAT 
TTATTAAAAA 
TTCATCTTGT 
TGTTTTCTAA 
ATAATGGTAG 
TTGATAGCTG 
AAAGTAGTTT 
ATGGGAACCT 
ACGTTCCATT 
AAACAGATGT 
AGAATTATCA 
GAGTTCAAAT 
TACAAAAGAA 
GTACCAAATT 
AGATCGCTTT 
TTTTATAAAA 



TAGTTT CAAT 
TAGCAGATAT 
TCCAACCAAA 
ATTCTGCAAA 
TGATTTGGAG 
TAGTATTTAA 
TGACTTTAAA 
TTTTAATGTC 
AAGAGTTGTG 
CTTATGGTGG 
CAGTTCTATC 
ACAACTCATT 
CACTTCTTGT 
CAATTGCAAT 
TGACATTGAA 
CATTATTATA 
CATCTCATCT 
CGCTTTATGT 
TTAAGGTTTG 
CGTTATTATA 
ATTGAGACAC 
ATTCATTTTA 
CATCTTAATT 
CCTGAGAAAA 
GTATGGAATA 
GGTAAGGATG 
TCAAAACAGT 
ATTCAAGCGG 
AGTCGTGTTG 



CACTTATACA 

TATTCCACCG 

ACACATTTAT 

AGCAGTATTA 

TGCGCCATTC 

ATTCAACTTA 

TATTATTGGA 

TAAAAAGGTT 

GTTACCAGTT 

TTTAAGTGCA 

ACCAATTTTA 

ATTTGCACCA 

ATCTGCTAAT 

TGCAACTGCA 

ATACAGTGTA 

AAAAAACGTA 

TTTTTTTATG 

TAAAATGAAT 

TGGTTTTGTC 

AAATATTAAT 

TTATTGATTT 

AAAAGGGGGA 

GGTGGCGGTA 

ATATTAAAGT 

ATGCCGGTAC 

GCACAGTTGA 

TTTGGGCATA 

TGCCACACAT 

CAACGTTAAA 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
. 1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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GGTACCGTTA ATGATTGAAG GTCGTAAGTC TGATGAACCA ATTGCTTTAA CTTATGATGA 234 0 

AACTGGTACa gATGTTAACT TTGGTGCGTT AACTGCAAAG TTATTTGATA ATTTAGAGCA 2400 
ACGTGGTGTG GGAATTCAAT ATAAGCAGAA TGTATTAGAC ATCAAGAAAC AGAAATCTGG 24 60 

GGTATGGCTA GTTAAAGTTA AAGATTTAGA AACTAATGAA ACGACAACAT ATGAATCTGA 2520 
TTTTGTATTT ATTGGTGCTG GCGGTGCGAG TTTACCATTA CTCCAAAAGA CTGGGATTAA 2580 
ACAATCAAAA CATATTGGTG GTTTCCCGGT AAGTGGATTA TTCCTGCGCT GTACAAATCA 2640 
AGAAGTGATT GATCGTCATC ATGCTAAAGT GTACGGAAAA GCAGCAGTGG GTGCGCCACC 2700 
AATGTCAGTG CCGCACTTAG ATACACGTTT TGTAGACGGC AAGCGTTCAT TGTTATTTGG 2760 

TCCATTTGCA GGTTTCTCAC CTAAATTTTT AAAAACAGGT TCACATATGG ATTTAATTAA 2820 

ATCGGTTAAA CCAAATAATA TCGTGACGAT GTTATCTGCA GGTATCAAAG AAATGAGTCT 2880 

2Q TACGAAGTAT TTAGTGTCAC AATTGATGTT ATCTAATGAT GAGCGTATGG ATGATTTAAG 2940 

AGTCTTTTTC CCAAATGCTA AAAATGAAGA TTGGGAAGTG ATTACAGCAG GGCAACGTGT 3 000 

CCAAGTAATC AAGGATACTG AGGATTCTAA AGGTAACTTA CAATTTGGTA CTGAAGTTAT 3060 

25 TACGTCAGAT GATGGCACAT TAGCTGCATT ACTTGGTGCA TCACCTGGTG CGTCAACAGC 3120 

TGTAGATATT ATGTTTGATG TTTTACAGAG ATGCTATCGT GATGAATTCA AAGGATGGGA 3180 

ACCAAAGATT AAAGAAATGG TGCCGTCATT TGGTTATCGC tTAACAGATC ATGAGGATTT 324 0 

AT AT CAT AAA ATTAATGAAG AAGTAACTAA GTATTTACAA GTTAAATAAT AAACGAAACG 3 300 

GTAATGTCTT TTTTAATGTG ATAGACATTA CCGTTTTTTA GTGGTTAATA AAAATCATTT 3360 

TAATTGTTTC AGTTGCTTGT TAATAGTGTC TACGTAGTTC TTGTTTTTAA AGAATTGAAT 3420 

TATCCAAATT AATACATAAA CCACAATGAA GATAATTGTG AATATGATTA GATAATGCAC 3480 

TGTTAGTGGA AACCAACCGG __CAAGCATTGC_TAAAGGCAAG AATCCGACAT -ACGTTGTTAT 354 0 

GAAATGCATT ATAGTTGCTT TAGTAATGCT CCAATCTGTG TATTTAAAGA TAAAATCTCC 3600 

AAGGAAAAAG ACGACGCCTA TGAGTAACCA TAAAATGATA GAAATCAACA TTACGGTAGT 3660 

TTCTGTGAAA TGCGTATAAT ACAATATGCC AATAGTTGAT TGTGGGTTCA GTGGATAATA 3720 

45 " TTTGCCGTCT GCAAATAACA TACTAAAGAA CAGTGAAAGG GACAAACCAA TGATTAAGCT 3780 

AATAAATAAT GAGTTTTTCA AATTTTTCAT ATTGATAAGC GCTCCTTTAT AGATTTTAAA 384 0 

TAACGTCTAG AAGAATAGGT GTAGTGTGCA TCTTTAAGAT ACATACGTAT AAGTCCATTT 3 900 

50 GGCTCTAATA ATAATTTTTC AATGTAATAC TTGTTGACGA TTTCTGATTT GGAAATGCGA 3960 

ATGAAATGTT GTGGTAACTG TTTTTCTAGT TCATAAAGTC GTAATTTTAG TTTGAATTTT 4 020 
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ACATTAATGA TATGGATTTC TTTGTCTATG TATCCGACTA ATGTATGTGA TTTGTCTAAA 414 0 

TCATTGACTG CATTAATAAT ACTTTGAACG TTATCATTCA TTTTAGGTGC ATGTATATCA 4200 

5 ATATAAGATT CCGTCTCATT TG CATTG AT A AATAAATTGA GTTTCATCAT AGGTTAATGC 4260 

CTCCTTCAAA ATTATTAAAC CATAAATGAC CATCGATATA TTTAAATTTT GTTGAATGGT 4320 

AGAAATTAAA TGTTAAGTGG CTAGAAAGCG CTAATCAATA TAAAAGATAC CTCCTGAAAT 4 3 80 

W 

AAAAACAGAA ATGTTTTTTC AGGAGGTAGA GATTAAAGTG AATTATTTGG CAGTGTAATA 44 40 

GTAAAGGTGG TTACATACTC GTTACTTTGT GTGAATTGGA TTGTACCATG ATGCAATTCA 4500 

ATGATGGATT TTGTAATTGC AAGACCTAAA C CATTG CT AT TATCATGTTT GCTCACTTTA 4560 

15 

TAAAAACGTT CAAATAAACG TGCTTCAGCT TGTGGACTAA TTGGTGAACC ATCATTACTT 4620 

ATTGTGAAAA TGATATTGTT GTGACTATGT TGCAAAGCGA TGTCAATGGC ACCACCAACA 4 6 80 

2 0 TCTGTATACT TAATAGCATT TATTAATAAA TTACTCAATG CTTGATGTAA CAAACGTTGA 474 0 

TTTCCTAGGA AATTGATGAT TCTAGGTCAG CTAAnATGAT TAACGACTTT TCATCAGCAG 4800 

CAnATTGTTC ATGTCGAATG ATATCnTTAA TGAGCTG 4 8 37 

25 (2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

35 

ACAATTATTG GATTATTATC AAGCAACGTT AATGGATGAC TTCCACTTAC AACAGAAATG 60 

CCCATAGATT CTAAATCT t T TGCATGAGCA TCTTGTGATA AGTCTTTTCC ATCATTGACA 120 

GTTACATTCG CACCTAATTT ACTTAATAAT TTAGCTGCTT CATAACCACT TTTTGCCAAA 180 

40 

CCGACAACTA ATACATTTTT ATTTTCTAAC CCTGTATAAT TAAGCATCTT AATGCACTCC 24 0 

AATCCATAAA CCGATTAAAC CTGAAATCAG ACCAACAGCC CAAAATACTG TAACTACTTT 300 

4S CCATTCGCTC CATC CT ATCA ATTCAAAATG ATGATGAATC GGACTCATTT TAAATATACG 360 

CTTTCCAGTC AATTTAAAGC TAGCGACTTG TAACATAACA GATAATGTTT CAATTACGAA 420 

TACTAAACCT ATAAAAATTA ATGATAATTC CTGATTAAGC ATGATTGAAA TGGTAGCAAA 480 

SO TATACCACCT AAAGCTAAGC TACCTGTATC TCCCATAAAC ACTTTAGCAG GGTTAATGTT 54 0 

ATATGGTAAA AATCCTAAAA GTGCAAACAA CATAATGATA CAGAAAATAC CAATTGCCGT 600 
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5 



70 



75 



TGCTAATCCA 


TCTAAACCAT 


CTGTTAAATT 


TACTGCATTA 


GAAAAACCTA 


CTTGCCAAAA 


720 


AACAATGAAA 


ATAACATATG 


CAAATGATAG 


TGGGATTGCT 


ACATT CGTAA 


ATGGAATATG 


780 


TATGCTCGTA 


GAAAAATTCA 


CCAAATGAAA 


CACATTACTT 


AAAACAAAGA 


ATATAATCGC 


840 


AATACCAATT 


TGCGCCAAAA 


ACTTCTGTTT 


ACTTGTTAAA 


CCTTGGTTAT 


TCTTTTTAAC 


900 


AACAATAATA 


TAATCATCTA 


TAAAACCAAT 


TAACCCAAAA 


CCAATCGTCA 


CAAATAATAA 


960 


CAGTATGATT 


GGATTAGCTT 


GATCTACAAA 


TATAATAGCC 


ACCAAAGACG 


TTATCACAAT 


1020 


ACTTAATAGA 


AATGTTAGTC 


CACCCATCGT 


TGGTGTACCA 


GTCTTCTTCA 


TATGGCTTTG 


1080 


TGGACCTTCT 


TCTCGAATAC 


TTTGACCAAA 


TTTCATCCTT 


TTTAATGTAG 


GTATTAAAAC 


1140 


AGGTACCAAA 


ACAAATGTAA 


TCACTAGCGC 


TAATAACGCA 


TATACAAAAA 


TCATAACTAT 


1200 


CTCCTCTTCT 


TAATCCAGAC 


TTTTTTAACC 


ACTAATATAT 


TATCAATTTT 


TCAATTAAAT 


1260 


AAACAAAGTT 


GTAATCAAAA 


TTTATAATTT 


TTCTTTTTTA 


CGGCATAAGA 


GGCCAGTATA 


1320 


AAAAGTTTGC 


CTATAACAAA 


CAAGTTAATC 


TGACCTCGTC 


TACCTTAAAA 


TTCTCTATCA 


1380 


ACACTTATTT 


ATAAAGATTA 


AATGAAGATG 


TTGTTTTCTA 


TCACAGCATT 


ACTTTAGTAA 


1440 


AAACAAATAG 


TGACAATACA 


TCCTAATTTA 


ATGTAGCCAT 


TCTTGTTAGT 


CCGACTTATC 


1500 


CTTGTCAGTT 


TTACTGTCAG 


ATTTCnTCTT 


ATCATCTGAA 


TTTG AAT CAG 


AATTATTCGT 


1560 


CGAATTGCTG 


TCTACATTCT 


CTGGATGGAA 


AATTCTACGT 






1600 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1186 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

■40 

ATTGCCTTTG TTTTAATTTT AAATCAAAAT mGCCTATGAA AGATTTAAAT CAATTAATTT 60 

CTATAATATT ATCATTTTTA AAG CATATCA TTGTTTAGTT TTTTTATAAT TGGATAAATA 120 

45 CT AAT AG TT A CTTTATAAAA CATTACATAG AGAAAGGTTA AGGAGTGCAC ATGTCGAAAA 180 

AGGATCACTC TTCTTCAAAA TACCTTAATT CTGTTAAGGA AGCGCAAGAG GAGTCAAAAA 24 0 

AGAAAAATAA AAGTAATCCC AAAATTGATG TTGATCGTAC ATATATTGAA CCTCAACAAT 300 

50 TCCAATCTAA GAAACCTAAA AAAGATGATC AGGTTTTCTT CTTATCAAGA TTAAATAAAC 360 

CTGCAAAATA TAAGAAAGAC TCTAATTTCT TATCATATCT CATCTATCGC ATAGGAAAAG 420 
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TTTTCAATAT TAAGCAGAGT CAAATTACTA 540 

CTACTCTAAT TAAGAGTGTA ATTGGTGATA 600 

CTATCGGTTT GATTTTAGCA ATTTGGTCAG 660 

CTTTCAATGT TGCTTACGAT GTAGAAGATA 720 

GTGTTGTCTT CACTGTAGTT ATGGGCGTTG 7 80 

TTGGTTCTGT AATTAGTCAT TTCCTATTCG 84 0 

GGATTTTTAA CCTTATTAGA ATTGTGTTAC 900 

TGTTATATTC GGTTGCACCT AACGTTAAAA 960 

TATTTACTTC AATTATTTGG TTAGCTGGTT 1020 

TTGGTAACTA TTCTAAAACA TATGGCAGTA 1080 

TATATATCAC AAGTTTTATT ATAATTGTCG 114 0 

GTAGTGTAAT TAAAGG 1186 
(2) INFORMATION FOR SEQ ID NO: 161: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TCTTGAGCCA TCTTTTGAGC TAACTGACTA GATTGATACC CAAAAATCAT AGTTACCAAC 6 0 

35 

ATAAACTTTA ATTTTACCGA AGTCTAAATC AGCGATATGA GTACATACAT TATTTAAGAA 120 

ATGACGGTCA TGCGATACTA CGATAACAGT ATTATCAAAG TTAATTAAGA AATCTTCTAA 180 

CCAACTGATT GCTGGAATAT CGAGACCGTT AGTAGGCTCA TCCAGTAATA GTACGTCTGG 24 0 

40 

TTCACCGAAT AAACTTTGCG CTAATAATAC TTTAATTTTT TGGTTGTTTT CTAATTCAGC 3 00 

CATTTTTTTA TCGTGTAAAG TTGGATCGAT ACCTAAACCA GATAAAAGGT TAGCAGCATC 360 

45 AGCTTCAGCA TTCCAACCAT TCATTTCTGC AAATTCACCT TCAAGTTCAG CAGCACGGAT 420 

AC CATCTTCA TCACTGAAAT CTGGCTTCAT ATAGATTTCA TCTTTTTCTT TCATAACCTC 4 80 

ATAAAGACGT TCGTGACCTT TAATTACAAC ATCAAGCACG CGTTCATCTT CATAAGCATA 540 

50 GTGGTCCTGT TTTAAAACAG CTAGACGTTC ATTTTTCCCT AATGAAACAT GTCCTGTTTG 6 00 

AGAATCTAAT T CACCAG AT A ATATTTTTAA GAATGTTGAT TTACCTGCAC CATTCGCACC 660 
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TGTTGCTTTT 
ATATGTTAAG 
TAACTCAAAA 
CTTCAAATGG 
GCCGTAATGG 
TGTTTGTAGT 
GTCCACTTGG 
CAATCATTAT 
CGAAGCTTAA 
CATTTGGTTT 
TCGCGGGTAT 
GnGCTGAAAT 



CCTATTAACA 
CAATGCACCC 
CTCCAGTGGT 
AATGACTGCA 
AATCGTATTA 
TGCTCTAGCA 
aTTTGACGAA 
TATATTTATC 
GTCAGTATTA 
TGGTTGGTAT 
CATCATTTTG 
CAATGCAATC 



TTATTACCAT 
GCTGAAACAT 
GGCTTATTAT 
ATTATGAATT 
AAACTACTAA 
TTACCAACGC 
CAAGTGAAAT 
ATATTTATCG 
CCAGGTGCAG 
ATTTCAAATT 
TTACTATGGT 
ATTCATCAGC 
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ATCTCCAAAA CGTAAACTCA CATCAGTTAC TTGTAACATG CATTTTCTCC TTTTTTTCAT 780 

TCGATATTCT AACGGAAGAA TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT 840 

TGTAATGATA GAATGACTCA CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA 900 

ATAACATGGC ATTAGACAAA GATATAGTAG GTTCTATAGA ATTCCTTGAA GTAGTAGGGT 96 0 

TACAAGGTTC AACTTACCTT TTAAAAGGAC CAAACGGTGA AAACGTAAAG TTAAACCAAT 1020 

CAGAAATGAA CGATGATGAT GAATTAGAAG TAGGTGAAGA ATATAGTTTC TTCATTTATC 1080 

CAAACCGTTC AGGTGAATTA TTTGCAACTC AAAATATGCC TGATATTACG AAAGATAAAT 114 0 

ATGACTTTGC TAAAGTACTT AAAACGGATC GCGATGGGGC ACGTATAGAT GTTGGATTAC 1200 

CCCGTGAAGT GTTAGTACCA TGGGAAGATT TACCAAAAGT GAAATCACTA TGGCCACAAC 1260 

CTGGTGATTA TTTGCTAGTT ACATTACGAA TTGACCGTGA GAATCATATG TATGGACGTT 1320 

TAGCGAGTGA ATCTGTTGTA GAAAATATGT TTACACCTGT ACACGACGAT AATTTAAAAA 1380 



ACGAaGTCAT TGAAGCCAAA CCTTACCGCG TATTACGAAT TGGTAGCTTT TTATTAAGCG 144 0 

AATCAGGTTA CAAAATTTTC GTACATGAAT CAGAACGTAA AGCTGAACCA AGATTAGGTG 1500 

25 AATCTGTTCA AGTTAGAATT ATCGGGCATA ATGATAAAGG TGAGTTAAAT GGTTCATTTT 156 0 

TACCACTTGC ACATGAACGT TTAGACGATG ACGGCCAAGT CATCTTTGAT TTACTAGTTG 162 0 

AATATGATGG TGAATTACCA TTCTGGGACA AATCAAGCCC TGAAGCGATT AAAGAAGTAT 1680 

TCAATATGAG TAAAGGTTCA TTCAAACGTG CAATCGGTCA CTTATATAAA CAGAAGATTA 174 0 

TTAATATAGA AACAGGTAAA ATCGCTTTAA CTAAAAAAGG TTGGAGTCGA ATGGACTCAA 1800 

AAGAATAATC ATTTTTACAC GTGTCGTAGG ATGCGTGTTT TTTTTATTCA ATATTAAATC 1860 

GGACAGATGA AGTAGTTTTT TAAACATTCC TTTCAAAGTA AAAAATTAAA TAATTCAAAC 1920 

-GAATAGGGTG GGaCATTAAG - TTCTTAGGCA ATGTAAAAAA GGTGATTTCT- ATTAATTATT- 198 0 

TGATGGAAAT CAGCTTTTTT GATATGTATT TTATAATGTA CAGCTCGTTG AGCTGCTATT 2040 

TTCCTTATAT TAAGTGCCAT TAATACAAAA CCTAGCTCTC GTTTAACTTT ATTTATTCCT 2100 

CGAACTGACA TTCGAGTGAA aCCCAAAATA GCCTTCATAA ATCCAAAAAC AGGCTCTACA 2160 

TAAATTTTTC TATGACTATA GATTTTTTTC GTTTCTGGTT CAGAAAGCTT TTGaTTAATT 2220 

TGGGCTTTAA TGTATTTCAA AGTAAAATTA CATGTTAATA CGTAGTATTA ATGGCGAGAC 2280 

TCCTGAGGGA GCAGTGCCAG TCGAAGACAG GGGCCCCAAC ACAGAAGcTG ACATATAGTC 234 0 

SO AGCTTACAAC AATGTGCCGG TTGGGGTGGC TGAGACGGCA CCCTAGGAAG GGACCCGTCA 24 00 

TCAAAAATTC TATTTATAGA ATTT T ACAGT AATGTGACAG ACGGGCAAAG CGAAgCCATT 24 60 
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CTTACTGCTG TTTTTTTAGG GATTTATGTC CCAGCCATTT TTGTATTCAT ATTTAAATTT 2580 

CGATAATTTT TCAGGAAGCA TTTTAATTTT ACTAATGAAG CAATATTTTT TAGATTAACA 264 0 

AAAATTAATA TTTACATTTT CTTAACAATT TTTTATGTAA CATTTACAGT TTCTAAAAAT 2700 

GAGGTTAATA ATTCAAGGTT AAGATAAAGA TGTAATCAAT ACAAATACTA TTTGTTGTTC 2760 

ATACAGGGAG GATATTTCAA TGAAAAAATG GCAATTTGTT GGTACTACAG CTTTAGGTGC 2820 

AACACTATTA TTAGGTGCTT GTGGTGGCGG TAATGGTGGC AGTGGTAATA GTGATTTAAA 2 8 80 

AGGGGAAGCT AAAGGTGATG GCTCATCAAC AGTAGCACCA ATTGTGGAGA AATTAAATGA 294 0 

AAAATGGGCT CAAGATCACT CGGATGCTAA AATCTCAGCA GGACAAGCTG GTACAGGTGC 3 000 

TGGTTTCCAA AAATTCATTG CAGGAGATAT CGACTTCGCT GATGCTTCTA GACCAATTAA 3060 

AGATGAAGAG AAGCAAAAAT TACAAGATAA GAATATCAAA TACAAAGAAT TCAAAATTGC 3120 

20 GCAAGATGGT GTAACGGTTG CTGTAAATAA AGAAAATGAT TTTGTAGATG AATTAGACAA 3180 

ACAGCAATTA AAAGCAATTT ATTCTGGAAA AGCTAAAACA TGGAAAGATG TTAATAGTAA 324 0 

ATGGCCAGAT AAAAAAATAA ATGCTGTATC ACCAAACTCA AGTCATGGTA CTTATGACTT 3 3 00 

25 CTTTGAAAAT GAAGTAATGA ATAAAGAAGA T ATT AAAG CA GAAAAAAATG CTGATACAAA 3360 

TGCTATCGTT TCTTCTGTAA CGAAAAACAA AGAGGGAATC GGATACTTTG GATATAACTT 342 0 

CTACGTACAA AATAAAGATA AATTAAAAGA AGTTAAAATC AAAGATGAAA ATGGTAAAGC 34 8 0 

AACAGAGCCT ACGAAAAAAA CAATTcAAGA TAACTCTTAT GCATTAAGTA GACCATTATT 354 0 

CATTTATGTA AATGAAAAAG CATTGAAAGA TAATAAAGTA ATGTCAGAAT TTATCAAATT 3 600 

CGTCTTAGAA GATAAAGGTA AAGCAGCTGA AGAAGCTGGA TATGTAGCAG CACCAGAGAA 366 0 

AACATACAAA TCACAATTAG ATGATTTAAA AG CATTT ATT GATAAAAATC AAAAATCAGA 3720 

CGACAAGAAA TCTGATGATA AAAAGTCTGA AGACAAAAAA TAATAAGACG CAATTTCAAA 3780 

TGTGTCTTGA AACATGATTT TGATGGTGAA TCATTATTTA GAGTACAAAG CTTGATTTAT 384 0 

CGAGACGCTG ATTTTGACAT TCAGTTAGTC TAcAAGCTTA TCAACTTAAA ATAGTGGTTC 3900 

ATCATTATTT TACAAATCTA ATTATTTTGG GAGTAATAGA AAGAGGTTTG ATTATGACTT 3960 

45 CAT CTACT AA TGTTAAAGCT TTAATCGAAA AAAATAATAA TAAAAAAGGA AAG CAT AATG 4020 

ACAAAATTAT ACCAGTTATT TTAGCCGCAA TTTCAGCGAT TTCCATTTTA ACAACACTAG 4080 

GTATATTAAT CACATTGCTT TTAGAAACCA TCACTTTTTT CACCAGAATT CCAATAACTG 414 0 

50 AATTTCTATT TTCTACTACT TGGAATCCTA CCGGTTCAGA CCCTAAGTTT GGTATCTGGG 4 200 

CATTGATAAT AGGGACTTTA AAAATCACAG TTATTGCGAC TATATTTGCA GTTCCAGTCG 4260 
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AACCGATATT 


AGAAATTTTA 


GCAGGAATAC 


CAACAATTGT 


GTTTGGTTTC 


TTTGCATTAA 


4380 




CCTTTGTTAC 


ACCAGTATTA 


AGATCTTTCA 


TACCAGGTCT 


TGGAGAGTTT 


AATGCTATAA 


4440 


5 


GTCCCGGCTT 


AGTTGTCGGT 


ATTATGATTG 


TCCCTCTCAT 


CACAAGTTTG 


AGTGAGGaTG 


4500 




CAATGGCATC 


TGTACCAAAT 


AAAATTCGAG 


AAGGTGCCTA 


TGGACTTGGA 


GCAACTAAAT 


4560 


10 


TAG AAGTAG C 


AACTAAAGTC 


GTACTTCCCG 


CAGCAACATC 


AGGTATTGTA 


GCTTCAATCG 


4620 


TTCTCGCGAT 


TTCAAGAGCA 


ATTGGAGAAA 


CGATGATTGT 


ATCATTAGCG 


GCAGGTAGTT 


4680 




CGCCAACAGC 


TTCATTAAGT 


TTAACAAGTT 


CG ATT CAAAC 


AATGACTGGA 


TATATTGTTG 


4740 


IS 


AGATAGCGAC 


AGGTGATGCA 


ACATTTGGAT 


CAAATATTTA 


TTACAGTATT 


TATGCTGTAG 


4800 




GGTTCACACT 


ATTTATCTTT 


ACCTTAATCA 


TGAATTTACT 


TTCTCAGTGG 


ATTTCTAAGC 


4860 




GTTTTAGGGA 


GGAGTATTAA 


TATGGAAACG 


ACAGATAATA 


ATAGACAATC 


ACTCGTCGAT 


4920 


20 


CAACAACTTG 


TCCAAAAACA 


TTTATCATCC 


AGAACGGTTA 


AAAATAAAGT 


GTTCAAACTC 


4980 




ATATTTTTAG 


CATGTACATT 


ATTAGGACTT 


GTCGTACTTA 


TTGCGTTGTT 


AACTCAAACA 


5040 




TTGATTAAAG 


GGGTAAGTCA 


TTTAAATTTA 


CAGTTTTTCA 


CTAATTTTTC 


TTCTTCAACA 


5100 


25 


CCATCTATGG 


CTGGOGTTAA 


AGGCG CGTTA 


ATCGGTTCAC 


TTTGGTTAAT 


GTTAAGTATC 


5160 




ATTCCATTAT 


CAATCATCCT 


AGGAATAGGT 


ACAGCTATAT 


ACTTAGAAGA 


AT ATG CG AAA 


5220 




AACAACAAAT 


TTACTCAGTT 


TGTTAAAATC 


AGTATTTCCA 


ATTTAGCTGG 


TGTACCATCA 


5280 




GTTGTATTTG 


GGTTATTAGG 


TTATACTTTG 


TTCGTTGGTG 


GTGCAGGGAT 


TGAAGCCTTG 


5340 




AAAATGGGTA 


ACAGTATATT 


GGCAGCAGCG 


CTAACAATGA 


CCTTACTGAT 


ATTACCAATT 


5400 


35 


ATTATTGTTT 


CAAGTCAGGA 


AGCAATTAGA 


GCTGTACCTA 


ACTCAGTACG 


CGAACTTcTT 


5460 


ACGGCTTAGG 


TGCTAATAAA 


TGGCAAACGA 


TAAGACGTGT 


TGTCTTACCA 


GCAGCGTTAC 


5520 




CTGGTATTTT 


AACTGGATTC 


ATTTTGTCTC 


TTTCAAGAGCT 


ACTGGGAGAA ACAGCGCCAC 


~5580~ 


40 


TTGTGCTAAT 


CGGTATACCG 


ACTATATTAT 


TGGCAACACC 


TAGAAGTATA 


TTGGATCAAT 


5640 




TTTCAGCATT 


ACCTATCCAA 


ATATTTACTT 


GGGCGAAAAT 


GCCTCAAGAA GAATTCCAGA 


5700 




ATGTTGCATC 


GGCAGGCATT 


ATCGTTTTAC 


TAGTTATCTT 


AATCTTAATG 


AATGGCGTTG 


5760 


45 


CGATTATTTT 


ACGTAACAAA 


TTTAGTAAAA 


AATTCTAATT 


TAAACAATCA 


ATCTCATTTA 


5820 




TCTATTAAAA 


AGGGAGTTTT 


AAATATGGCG 


CAAACACTTG 


CACAAACTAA 


ACAAATATCT 


5880 




CAAAGTCATA 


CGTTTGATGT 


CTCACAAAGT 


CATCATAAAA 


CACCAGATGA 


TACAAACTCA 


5940 


SO 


CATTCTGTTA 


TATATTCAAC 


ACAAAATTTA 


GACTTATGGT 


ATGGCGAAAA 


TCATG CATT A 


6000 




CAAAATATTA 


ATTTAGATAT 


TTATGAAAAC 


CAAATTACTG 


CCATTATAGG 


TCCATCTGGT 


6060 



55 



795 

NSOOOIO <PP 07ftfiS1QA? I > 



EP0 786 519 A2 



w 



IS 
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AAAACAGCTG 


GTAAAATATT 


ATATCGAGAT 


CAAGACATTT 


TTGATCAAAA 


AT ATT CT AAA 


6180 


/»» TV TV /"» fk *v ^**T* TV /""» 

GAACAATTAC 


GTACAAATGT 


GGGCATGGTC 


TTTCAACAAC 


CTAATCCATT 


1 tLAftAAlUi 


6240 


ATATACGATA 


ATATTACTTA 


CGGTCCAAAG 


ATTCACGGTA 


TTAAAAATAA 


AAAAb 1 ILi 1 


63 00 


GATGAAATCG 


TTGAGAAATC 


ATTACGTGGC 


GCTGCAATTT 


GGGATGAATT 


AAAGG A 1 AGO 


6360 


TtGCACACAA 


ATGCATATAG 


TTTATCCGGT 


GGGCAACAAC 


AACGTGTTTG 


TATCGCG CGT 


6420 


TGTTTAGCAA 


TTGAACCTGA 


AG TCATTTTA 


ATGGATGAAC 


CGACATCAGC 


ATT AG AT C CA 


6480 


ATCT CAA CAT 


TAAGAGTAGA 


AGAGTTGGTT 


CAAGAACTAA 


AAGAAAAGTA 


TACAATTATT 


6 540 


ATGGTtACAC 


AT AATATG CA 


ACAAGCAGCT 


CGTGTATCAG 


ATAAAACTGC 


ATTTTTCTTA 


6600 


AATGGTTATG 


TCAATGAATA 


TGATGATACT 


GATAAAATTT 


TCTCTAACCC 


ATCAAACAAG 


6 660 


AAAACAGAAG 


ATTATATTTC 


AGGAAGGTTT 


GGTTGATATA 


TAATGGCAAT 


tv iv #**w*iv ^ iv ^*tv tv 

AATTAG A CAA 


6720 


CGATATCAGG 


AGCAACTTGA 


TGATTTAATA 


AAAGAATTAC 


GTCGGTTAGG 


TGCaAATGT C 


6780 


TATGTGAGTA 


TTGaAAATGG 


TATAAAAtCA 


TTAAGTATTG 


a CGATAGAGG 


c TTTGCACG A 


6840 


CAAACAGTTA 


AAAACGATAA 


ACATATCAAT 


CAATTAAATT 


ATGATATTAA 


TG AG CG AGTT 


6900 


ATCATGTTAA 


TTACAAAGCA 


ACAGCCCATT 


GCGAGTGATT 


TGCGTATGAT 


GATTTCTTCA 


6960 


TTAAAAATCG 


CCTCCGATTT 


AGAAAGAATA 


GGAGATAATG 


CCTCGAGTAT 


TGCCAATATT 


7020 


CGATTG CGTA 


CAAAGATTAC 


AGATGATTAT 


GTGTTAACCC 


GTTTAAAGAC 


AATGGGTAAA 


7080 


TTAGCTATGT 


T AATG TT AAA 


GGACTTAGAT 


CAAGCATTTA 


AAAAGAAAGA 


TACCGTATTA 


7140 


ATAAGAGAAA 


TAATTGAGCG 


TGATGAAGAT 


ATCGATGACT 


TATATAGTCA 


T ATT ATT AA C 


7200 


GCAACGTATC 


TTATTGATAA 


CGtCCATTTG 


TCGCTGCACA 


AGCT CATTT A 


/"^ f*% IV TV TV TV 

G CAG CAAG AC 


72 60 


ATTTAGAACG 


TATTGGTGAT 


CATATTATTA 


ACATCGCTGA 


AAGTGTTTAT 


TTTTATTTAA 


7320 


CAGGTACACA 


TTACGAACAA 


TAACTTAAAG 


TTATTACTAT 


AAAATCCCTT 


TV 7V «T*TV TV TA *T*7A 

ACGATAAATA 


73 80 


TATATTTCTA 


TTATTCATAA 


ACCCTCAAAA 


AAACCAAGAT 


TCTCACAATT 


Tv tv tv nv^nv*' t\ 

AGTAATGTGA 


744 0 


AAATCTTGGT 


TTATATTGTT 


CTACTATAAA 


TTGTCTCGCA 


TCTTAGTTAT 


TTGCTTGCTC 


vbOO 


AATTTCATCT 


GTTAATTTTT 


CAACTTCATC 


G ACT AAAT CA 


GAAATATATT 


GAATTGTAGA 


/SOU 


TTTAAGTGGC 


TGTTCTGTAG 


TAATGTCTAC 


ACCTGCAATG 


TTTG CAAGTT 


CGACAGGTGA 


Vfa^U 


TACACTACCA 


ccttttttca 


ATGTTTCTAA 


CCAAGCATCA 


ACAGCTGGTT 


GGCCTTCATT 


7680 


TTTAATCTTT 


TGAGAAACGA 


CAGTTCCGAT 


TGTTAAGCCA 


GCAGAATACG 


TATACGAATA 


7740 


TAATCCCATA 


TAGTAATGAG 


GTTGACGCAT 


CCATGTTAAT 


TCAGCACCCT 


CAGTCATGTC 


7800 


TACTGCATCT 


CCAAAAAATT 


GTTTATAAAC 


ATTTAGCATT 


ATTTCATTTA 


ATGTnCGGCG 


7860 
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(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

TTTTTTCTTT TCTTCATTTG AAAATTGATC ATTCAG CAAT ATAAGCGTAT TTGTTAATGA 60 

TTTAGGTGTT CCAATTTCAT AATCCCACCA ATTTAAGTTG GTATTCTTGC CAGTTGTTTT 120 

AGTAAAATTC TCACTTAATT CTTTTACTTT TTTATCTGGT TCTTTTCCAT ATGCATTTTT 180 

ATGCAGCCAC TCAAGGGCAT CTTTCACTTT CTTCTTATTT TCGTCAGTAT TTAAAGTGGT 240 

20 TTTAGGATTC CTCATCGCTT CTGCGATTTT CTCAATATTA CGATAGGTAC GAGTCATATG 300 

AGAAGAATTA GTTTCAAGGG TTTCCGCTCC TGACCACAAG TATTTCCTAC CACTTTCAGT 360 

TTTCATTTCC TTGAGTAAAT TCGTCGCCTC TTTCTCTGTA GCATCAAACT TCTTCTTCAT 420 

AT CTGGATT A TTCTCATCAT ACTTATCATA ACCATAGTTA ACGTCCAGCC ATGTGTTCCT 4 80 

CAATTTTTCA TAATCTGGCG TTTGAACATT CGTATCAGCC ACAGCGATTT GATGTTTATC 54 0 

AACACTTCTG AATTCACCAC CATTCAAAGT AATCACACCA GCCATTAATA ACGTAATGGT 600 

GGATAATTTT TGCCATTTCT TTATTCTATA TGTCATTGaC ATGTCTCCTT TTTGTGTTGC 660 

GCGTGCGCAA TGAATATTAT GATTAAATAA TGATTCAATT TTTCAAAATT CGTTAACGTA 720 

TACAAATGAC TGTCTACTGT CAAACAATCC ACAAAGAATG TTGATGtCAT ATaAACAATC 780 

GATCACCCAA ATTTTCCG 793 
~(2T .INFORMATION F0R""SEQ~ ID~ NOT 16 3": "~ " " 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

TACAGGTTTT ACTATAATGG ATGGTATTTT GGCTAAACGA CATTGGTTTA GTCTTCTTTT 6 0 

TTTnACTTCC TAnATTTACA ATGGTATAAA TAATAATGCT ATATTTAGAA TGATGAGTAT 120 

ACTTACTGAA ACTAAATTAA AAGTGTCTGG TTCTTTACTA AAGATAGCTG CTATCCTTGC 180 
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AATACAAGTT CCAATGAGCG CAATTAAAAG TACTAACCCA ACGATGAAAC TCTGTTTGTC 3 00 

ACTTAACTCA AAGAAACTAT AGATAGGATA TTTTTTAATA ATCAAGCCAC CTAAAATCAT 360 

CCATAAAAAT ACGATAATTC CATAAGTCAC ATTTATAACA TACGTTATTT TTTGGTCACC 420 

AAATCGGACT AATGTATTTC GTAGAATCAG CATACCAATG ACAACACCTA AAATAACGAT 4 80 

ACTAGCTATA TAAAGTAAAA ATG CAATTG T CACATCAAAT GTACCCAAAT CTAAAAACCT 540 

AGGAATTAyA AyGACTGCTA AAATAAAAGC GAAGyACAAA GTAATATAkT TATACAAACC 6 00 

GGTAGTAAGA CTTATCTCAG GTGATAATTG ATCAGCCATT GACTTAATCG GTGTATTAAT 660 

AATTGAACTT GTATCTTCGT TATTTTTTTC AGCCATAGTT AAATGATCTT CGAGCTCTTC 720 

CAATAACTCT TCTACTTCTG CTTCAGTCTT ACCTCTAAAT AACAATTCAA CACGTAATTT 780 

TTCTAAAAAA TCTTGAGATT GTTTACTTAA CATCGTTTTC CCCTCCAAAC AAGTTAATCA 840 

TCCCTTTATT CAAAACTTGC CATTTCGATT TAAATACTTT TAGTTCCTTT AAACCTGAAT 900 

CGGTAATCGT ATAGTATTTC CGCCTCGGGC CGCCATTACT AGATTTTTTT ATTGTCGTAT 960 

CAACGTATCC TTTTTTGTTT AAACGCATTA AAACTGGATA AATACTACCC TCACTTATCT 102 0 

25 CTGGAAACTC TTGATTCTTA AGTTTCGTCA TAATTTCATA TCCATACGTT TCGCCTTGGG 108 0 

CAATGAGACC TAATATCGCC CCATCTAAGA GACCTTTCAT AATCTGATCT GACACTGACA 114 0 

TTTTAATCAC CTACTATCTT ACATAATAAG ATAGTACATT GAGAACTTTT CGTCAACTAT 1200 

30 CTTTTATTGT AAGGTAGTTG TTGTACACAT TCCTTAAATG ACT AACAAC T TTGTTAATAG 1260 

GGTAATACTT ACGGAAGTAT ATTTTATTTA TGGGGGAGGA ATTAATAATG ACTACAAAAA 1320 

CAGTATTTGA TGTCATTGAT ATGGGGTTAG GATATTTAGT AAATGTGTAT GATGCTTGGA 1380 

AAGTTGAAAA GGTACTTGAT GATTATCATA AGCCTTTTTC TAATACCATT CATTGGCAAT 1440 

TTGGt CATGT ATTAACAATT TTTGAATCGG CCTTAGCTGT TGCTGGTAAA GAGAATATTG 1500 

ATTTAAATAT CTATAGACCT TTATTCGGAA ATGGTTCGTC TCCAGATGAA TGGAAGGATG 156 0 

AAGT AC CGAG TATTGAAAGG ATTTTAGAAG GTCTCCAAAC TTTACCTGAA CGTGCACGAA 1620 

ATCTAACTGA AGATGATTTA GCAATTGAAT TGAAACAGCC AATTGTCGGT TGTAATAACT 16 80 

TAGAAGAGTT ATTAGTATTA AATGC CATTC ACATCCCACT TCATGCTGGT AAAATTGAAG 174 0 

AGATGTCTCG TATATTAAAA AATTTAAAAT AAATATGTGC TTATTAACCG TTAACAACAC 1800 

GTTAACGGgT TTTTTATTTG TTTAAAAGGT CACTTTTTTG AATTTAATAA ACACCATCTA 1860 

TACCAGTTCT TCACCGATTC TCGAAAAATA ATTATATTAA TGATTTCGTT AATTTAATTT 1920 

TATATTTAAT TATTACTGTA CATCTTTTGT AGTTAGCTTT ATTCTTAAAT TGAAATATGT 1980 
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X X VrVV* X A 1 


\_L»x XTj'I'AGGT 


CTCCTTATTT 


GGGCACTTAC 


ACCTTTTAAA 


CCGGATGCTG 


2100 






ALtLZATGGTAT 


ATGTTCGCAA 


TATTCGTCGC 


GACAATCATT 


GCTTGTATTA 


2160 


5 


CACAACQ3AT 


uLLAAl luGL? 


G CCGTCTCTA 


TAATTGGATT 


TACAATCATG 


GTACTCGTTG 


2220 




lox X vj/"V 


*-A X uAAAALvj 


G CTGTCGCTG 


GTTTTGGTAA 


TAATAGCATT 


TGGTTAATTG 


2260 


10 




x X ILAI X X L.VJ 


AGAGGATTTG 


TGAAAACAGG 


TCTTGGTAGA 


CGTATCGCAC 


2340 


x x v-~r\ x x x v_\j x 


CAAATTATTT 


f' /"""P 7V TV TV TV TV TV ?\ 

TAAAAAAA 


CATTAGGATT 


AGCATATTCT 


ATCGTCGGTG 


2400 




x -f\vjuri XXX nn x 


IV- X AvjrL.ljL.L- X 


G CTACACCAA 


GTAATACCGC 


GCGTGCTGGT 


GGAATCATGT 


2460 


15 


X w\_L^Vr\X 1A1 




TCTGAATCAT 


TTGGTTCGAA 


ACCGAAAGAC 


GGATCAG CAC 


2520 






TGCATTTCTT 


GTTTTCACAG 


AATTCCAAGG 


TAATTTAATT 


ACTGCGGCTA 


2580 




1«1 XXI X AAL 


TGCAATGGCC 


GGTAACCCCC 


TTGCACAAAA 


TTTAGCATCT 


AG CACATCTA 


2640 


20 


A1\j X X L-AL-A X 


1 ACATGGATG 


AATTGGTTTC 


TAGCTGCTTT 


AGTTCCTGGA 


CTTGTTTCCT 


2700 




XAnX IXJl 1V7 J. 


ALL 111 1A1 x 


ATTTATAAAA 


TTTATCCACC 


AACTGTTAAA 


GAAACACCAA 


2760 




jt\ 1\i L* X /v\Ls/\Lj 


1 xXKKSCTGAA 


AATGAATTAG 


CGACTATGGG 


TAAAATCGCT 


TTAGCTGAAA 


2820 


25 


AATTTTATGAT 


TGGTATTTTT 


GTCGTTGCGT 


TAACACTATG 


GATTGTCGGA 


AGTTTCATTC 


2880 






AACTTTAA CG 


GCCTTTATTG 


CGCTAgcATT 


gTTATTATTG ACAGGCGTCT 


2940 






AGACATTTTA 


AACGAAACAG 


GTGCTTGGAA 


CACATTAGTA 


TGGTTCTCAG 


3000 


30 


TftTTRPTT'TT 


AATGGCCGAC 


CAATTAAACA 


AG CTTGG ATT 


TATTC CTTGG 


TTAAGTAAAT 


3060 




LLAl ±\»L-1AC 


AAGTCTTGGT 


GGCTTAAGCT 


GGCCTATAGT 


CCTGGTCATT 


TTAATATTGT 


3120 


35 


X v- inLl lvlA 


TTCACATTAC 


TTATTTGCAA 


GTTCTACAGC 


ACATATCAGT 


GCGATGTATG 


3180 




AGgCGTTGCC ATCGCAGCCG GTGCACCACC ATTATTCAGT GCATTAATGT 


3240 : 




X AkjKjrX 1 XJ- X_l_ 


CGGTAACCTA 


_TTAGCTTCAA 


.CAACACACTA -TAGTAGTGGT- 


CCAGCGCCGA 


3300 


40 


XXL. X AX 1L1L 


TTCAGGTTAC 


GTGACTCAAA 


AACGTTGGTG 


GACAATGAAC 


TTAATATTAG 


3360 




Ul 1 1 LO ILIA 


CTTTATTATC 


TGGATTGGTT 


TAGGATCACT 


TTGGATGAAA 


GTAATTGGTA 


3420 




XAX X J. XAAAA 


TATTTAAATT 


AGCGCTCGAA 


TCTCATTGAT 


TTGGGCGCTT 


TTTAATTTGT 


3480 


45 


AX X XAAAAiL 


AACCTTTGCT 


AAATCAAGAC 


tcclttttta 


AAATACGTTT 


ATCCTTTAAA 


3540 




TCATTGCGTG 


CTTCACTGAA 


AATTTGTATA 


AAGATTTAAG 


TCATTACGTA 


ACATCACATA 


3600 




AAATACATTT 


CTATACTATT 


CCGCTTCATT 


GATTAACATT 


ACGTATGCCC 


TCATAAATCA 


3660 


50 


TCATACAAAA 


AACACCTTCG 


TTTAAATTCA 


TTTTAATTGC 


GAATTCAACG 


AAAGTGCCTT 


3720 




A 11 1 CAT ATT 


TAATGTTTCA 


AATTTATACG 


TCTGTCACTG 


TTACTGCACA 


CATACCTCAG 


3780 
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TTATAGGGTT TTTGCGACCG GATGTTTCTT CAATTTAATG TATTGAGAAA GACTATATAA 3900 

CACAATACCT GTCCAAATAA ATATAAACGT AATTAATTGA TCTATACTAA AAGGCTCTTT 3960 

GAAAACAAAT ATGCCGAGTA CAAACATTAT TGTTGGTCCA ACGTATTGAA TAAATCCTAT 4020 

TAGCGAAAGT GGAATACGTT TTGCCCCGGC TGAGAATAGG ATTAGTGGTA TTGCCGTAAT 4080 

AGCACCAGAA AATAACAACC AAAATGATGA CATGTTCAAT CCAAATGACA TCTGATGTTG 414 0 

CTGCCATAAA TAAATAACGT ATATTAGTCC AGCAGGTGCG GTAACAATAC ATTCAATCGT 420 0 

AATACTGCTG ATGGCATCAA TATGTACTAC TTTTTTCAAT AATCCGTATG TACCAAAGGA 4 260 

TAACGCTAAT ATAATAGAGA CGATTGGGAA TTCTCCAATC TTGAG CGTCA TATATAATAC 4 320 

ACCGATGAAT GCGAATAAAA TGGCTAGCCA TTCAAATTTA TTGAATCTTT CTTTTAAAAA 43 80 

GATAAGTGCG AG CAAAATGC TAACAAGTGG ATTTATATAA TAACCTAAAC TTGTTTGTAG 444 0 

GACGTGACCG TTCGTTACAG CCCAAATAAA TGTACCCCAA TTTAATGTAA TGACATAGCC 4 500 

TGCTACGACA ATCGCTAATA GCTGAATGGG CTTGCCTAAC AATTGATTCA TATCTCGTTG 4 560 

AAATGCATTG CGTTGTTTTT GTCCAACCGC GAGTATGAAA ATCATGAATA TTGCTGAAAA 4 620 

25 TATAATACGA AAGGCTAAAA TTTCAAATGC GCCTATTGCA TCAACGAACT G C CAAT AT AT 4680 

AGGTAGTATT CCCCACAGAA TGTATGCACT GAGTGCTAAA AATATGCCTT TTTTATACTC 4740 

TGAATTCACC TTCAAACCTC CTTACTTTCC TAATTTTTAA TTTACTGCAT ACGCTCACTT 4 8 00 

30 GGTTATGCTA ATATAACGAT TTTACTAATA ATATTTCGAT AAAG AT AT CA TTTTGTTTAT 4 860 

ATTTCCCACA TTTATTCACC AACCACTAAA CAATATTAAT TTTATAAATA ATTCTGTACA 4 920 

AATCAGGGTA TATTGCCAGA AAGACTACCA TACAACATAA AGGATGGATA CAAATGACTT 4 980 

TACCTAAAAT TGGAAAG CCT GCAACACGCG CGCTAAATTC ACAAGGTATA TACACATTAG 504 0 

AAGCAGTATC ACAATATACG AAGTCATCTC TAATGGAGAT GCATGGCGTT GGTCCTAAAG 5100 

CTATATCAAT ATTGGAACAA GCTTTATTTC AG 5132 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22243 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

AAGTAAATTA TATTATGAAT TTGCCTGTCA ATTTCTTAAA GACATTCTTA CCGGAACTAA 6 0 
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TAGAAGCAAT 


TAATAATGCy mAAGAAAAGA CAGCTAATAA TACCGGCTTA AAATTAATAT 


180 




TTGCAATTAA 


TTATGGTGGC 


AGAGCAGAAC 


TTGTTCATAG 


TATTAAAAAT 


ATGTTTGACG 


240 


5 


AGCTTCATCA 


ALAAGGlllA 


AATAGTGATA 


TCATAGATGA 


AACATATATA 


AACAATCATT 


300 




TAATGACAAA 


AGACTAT CCT 


GATCCAGAGT 


TGTTAATTCG 


TACTTCAGGA 


GAACAAAGAA 


360 


10 


TAAGTAATTT 


CTTGATTTGG 


CAAGTTTCGT 


ATAGTGAATT 


TATCTTTAAT 


CAAAAATTAT 


420 


GGCCTGACTT 


TGACGAAGAT 


GAATTAATTA 


AATGTATAAA 


AATTTATCAG 


TCACGTCAAA 


460 




GACGCTTTGG 


CGGATTGAGT 


GAGGAGTAGT 


ATAGTATGAA 


AGTTAGAACG 


CTGACAGCTA 


540 


15 


TTATTGCCTT 


AATCGTATTC 


TTGCCTATCT 


TGTTAAAAGG 


CGGCCTTGTG 


TTAATGATAT 


600 




TTGCTAATAT 


ATTAGCATTG 


ATTGCATTAA 


AAGAATTGTT 


GAATATGAAT 


ATGATTAAAT 


660 




TTGTTTCAGT 


TCCTGGTTTA 


ATTAGTGCAG 


TTGGTCTTAT 


CATCATTATG 


TTGCCACAAC 


720 


20 


ATGCAGGGCC 


ATGGGTACAA 


GTAATTCAAT 


TAAAAAGTTT 


AATTGCAATG 


AG CTTTATTG 


780 




TATTAAGTTA 


TACTGTCTTA 


TCTAAAAACA 


GATTTAGTTT 


TATGGATGCT 


GCATTTTGCT 


84 0 




TAATGTCTGT 


GG CTTATGT A 


GGCATTGGTT 


TTATGTTCTT 


TTATGAAACG 


AGATCAGAAG 


900 


25 


GATTACATTA 


CATATTATAT 


GCLTmTAA 


TTGTTTGGCT 


TACAGATACA 


GGGGCTTACT 


960 




TGTTTGGTAA 


AATGATGGGT 


AAACATAAGC 


TTTGGCCAGT 


AATAAGTCCG 


AATAAAACAA 


1020 


30 


TCGAAGGATT 


CATAGGTGGC 


TTGTTCTGTA 


GTTTGATAGT 


ACCACTTGCA 


ATGTTATATT 


1080 


TTGTAGATTT 


CAATATGAAT 


GTATGGATAT 


TACTTGGAGT 


GACATTGATT 


TTAAGTTTAT 


1140 




TTGGTCAATT 


AGGTGATTTA 


GTGGAATCAG 


GATTTAAGCG 


TCATTTCGGC 


GTTAAAGACT 


1200 


35 


CAGGTCGAAT 


ACTACCTGGA 


CACGGTGGTA 


TTTTAGACCG 


ATTTGACAGC 


TTTATGTTTG 


1260 


TGTTACCATT 


ATTAAATATT 


TTATTAATAC 


AATCTTAATG 


CTGAGAACAA 


ATCAATAAAC 


1320 




— GTAAAGAGGA GTTGCTGAGA TAATTTAATG AATCTCAGAA CTCCTTTTGA 


AAATTATACG ~ 




40 


CAATATTAAC 


TTTGAAAATT 


ATACGCAATA 


TTAACTTTGA 


AAATTAGACG 


TTATATTTTG 


1440 




TGATTTGTCA 


GTATCATATT 


ATAATG A CTT 


ATGTTACGTA 


TACAGCAATC 


ATTTTTAAAA 


1500 




TAAAAGAAAT 


TTATAAACAA 


TCGAGGTGTA 


GCGAGTGAGC 


TATTTAGTTA 


CAATAATTGC 


1560 


45 


ATTTATTATT 


GTTTTTGGTG 


TACTAGTAAC 


TGTTCATGAA 


TATGGCCATA 


TGTTTTTTGC 


1620 




G AAAAG AG CA 


GGCATTATGT 


GTCCAGAATT 


TGCGATCGGT 


ATGGGGCCAA 


AAATTTTTAG 


1680 




TTTTAGAAAA 


AATGAAACAC 


TTTACACTAT 


TAGGTTATTG 


CCTGTTGGTG 


GATATGTTCG 


1740 


SO 


TATGGCAGGA 


GATGGCTTAG 


AAGAGCCACC 


AGTCGAGCCC 


GGTATGAACG 


TTAAAATTAA 


1800 




ACTTAATGAA 


GAAAATGAAA 


TAACACATAT 


CATATTAGAT 


GATCATCATA 


AGTTTCAACA 


1860 



801 
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G ATAATG A A A 
vj a x nn x vjaaa 


G A C* aTO ATTT 
v/iV^n X V*A XXX 


TAAAATTGCT 

l/uinni x ww x 


AGAAAGTCTT 

nu/vmu x v™ x x 


XCTTTGTTGA 


1980 




AAATGGTAGC 

#v«w» X vjvj x rvvjv 


X X *WJ X X V_/v\r\ 


TTGCTCCGAG 

X X VJV_ X WVV3AU 


AGACAGACAA 


TTTGCACATA 


AAAAGCCATG 


2040 


5 


GCCGAAATTT 

\J V* V.VJ>AAX\ X X X 


TT AA P ATT AT 

X X /vIVmA X X A> X 


TTGCGGG AC C 


GTTATTTAAC 

vj x x n xxx iwvp 


TTTATATTAG 

XXX X A X X 


CTTT AG T CCT 


2100 




AX X 1/11 X VJVJ 1 


V_ X X uL/ilni X 


ATPaAflRrAC 

A. X V^CLAVJVJ V-A.V_ 


GPrTAPRTPT 


ACTGTAGAAC 

/iv* x vj x nu/vi\< 


AAGTCGCAGA 


2160 


10 


TAAfTTATC PA 

X nny x A X V— V-A 




CAGGATTACA 


AAAAGGTGAT 


AAGATCGTCC 


AAATTGGCAA 


2220 


ATATAAAATA 

AX AX AAAAX A 


Tf^TG A A TTTfl 
X V. x vxAA 1 1 lu 


ATGATGTTGA 

a X VJ A X VJ X X VJ A 


T AAGG CGTT A 

XA^\VJVJV»VJ X lil 


GATAAAGTTA 


AAGATAATAA 


2280 




G A fG A GTG TT 
vjAv^vjAV, 1 Vj X X 


A A ATTTGA AP" 
nnn 1 X X UnnL 


GTG ATGGTA A 
VJ X VJ A X VJVJ X AA 


AArJVAARTPA 


GTTGAATTAA 

VJ X X \J-*m*± X X iVl 


CACCTAAAAA 


2340 


15 


C* aHYia A A A A 
VjAV- IvjAAAAA 


A A APTGAP^TA 
nnnL X VjAv. X A 


AA Vj X nnu 1 1L 


AGAG A CG AAG 


XA.XVJX X V» X VrVJ 


GATTCCAACC 


24 00 




AGTG A A 
Avjv*vjAvj 1 vjaA 


ft ft ft # -i-i"r 
v»A 1 Av—Av_ 1X1 


tta A appa AT 


iwlnl 1V,VJVJA 


TTT A A A AGCT 

XXX AAAAVJV. X 


TTTTAATCGfi 

X X X X **** X %«V7V3 


2460 




1 aLj 1 Av_ x iAl 


AX XXX IrtLMo 


P"*TGT A G T A GG 
Clul M.VJ X nuvi 


TATGTTGGP^T 

X A X VJ X X VJVJV. X 


AHTATATTTA 

^VVJ X *V X f\ XXX «v 


CGGGCGGATT 

V» VJVJVJ V* VJVJ** X X 


2520 


20 


L. X V— rvl X X\jAX 


ft THTTH ft ^TYI 


VJ X V-V-Vjvj X X VJVJ 


T ATTT ATP AT 

XAX X XAXV~AX 


AACGTCGACT 

AAV^VJ X v<VJAU X 


CAG TTGTTAA 

V^TAVJ X X VJ X X J±J*. 


2580 




Avjv-vjvjvj XnlL 


ft ^n^P ft ^^^^^P & ?^ 
AX X AuL X X AA 


TTCTTf nPlP 
X XvjvjI tnUnL 


TflPPITT A TTA 

X VJ V. VJ X X f\ X X A 


AGTGTAAACT 
rvvj x vj x A*v*v- x 


TAGGTATTAT 

X AVJVJ X *V X X *\ X 


2640 




/"^ * ft " i ~ i — i ■ ft ft ■ ■ — 1» 


V- V- X A X 1 v_v_ X vj 


papTa/iap<^ 

V~AV- X aVjAV-VJVj 


TGGTCGT A TT 

X VjVj X V.VJ X AX X 


TT ATTTGTT A 

X 1A1 X IVJl Xf\ 


TATATGAAGC 

X f\ X f \ X VJXU*IVJV. 


2700 


25 


GATTTTCAGA 


AAACv^Avjl 1A 


ata a a aaa^p 

AX AAAAAAvjv- 


nn a a a r* a a p**g 

VJVJAAAV-AAV-VJ 


ATTATTGOTA 

AX XAX XVJV^XA 


TTGGTGPGAT 

X X VJVJ X VJV.VJl X 


2760 




111 LAllij 1 1- 


Vj X X A X AA X Vj A 


TaTTapTaap 

X A X X AVj X nnL 


GTGG A A TG AT 
Vj X VJVJ AA X VjA X 


ATTCG A CG AT 

AX lV.V3nV.VJAl 


ATTTPTTATA 

AX X X V» X XAX A 


2820 




All XAvjvjAvjvj 


n Tft ft ft Tft ATT 
AXAAA1 AA1 X 


A TP! A aCP A AT 
A X VJAAvjv-AA X 


ppl A AGTTTT 
v_ Lnnnu X X X X 


TATAPPAAPH 

X A X nv. V-AAV-VJ 


ATGCGTG A CG 

f \ X VJVvVJ X VJ1^V^V3 


2880 


30 


1 Vj V- V_A 1 v_AvjA 


Ab LAunnb V~A 


fas RGTP1TP 


GTTTATTATT 
Vj X X XAX X A X X 


GAAATCGGGT 

VjAAAi X V»VJVJVJ X 


TTGATAAAAC 

x x un x rwwiv 


2940 




AAAGTACAAG 


TGGGATTTAT 


AGTTATTTAC 


CGCTAGCAAC 


ACGTGTGTTA 


AATAATATTA 


3000 


35 


CTGCAATTGT 


GCGACAAGAA 


ATGGAACGTA 


TCGATTCTGT 


TGAAATTTTA 


ATG CCAGCGT 


3060 


TACAACAAGC 


TGAATTATGG 


GAAGAATCAG 


GACGTTGGGG 


TGCATATGGC 


CCAGAATTAA 


3120 




TGCGTTTACA AG AT AG a CAT GGAAgACAAT TTgCATTAGG 


TCCaACACAT 


GAAGAATTAG 


3180 


40 


TTACATCAAT 


AGTAAGAAAT 


GAATTGAAAT 


CATACAAACA 


ATTACCGATG 


A CATT ATT C C 


3240 


aAATTCAATC 


TAAATTCCGT 


GATGAAAAGA GACCACGTTT TGGTTTAyTC 


GTGGGCGTGA 

VJ X VJVJ\JVi.V7 X >J-*W 


3300 




ATTTATTATG 


AAAGATG CAT 


ATTCATTCCA 


TGCTGACGAG 


GCATCATTAG 


ATCAAACGTA 


3360 


45 


TCAAGATATG 


TATCAAGCGT 


ATAGC CGTAT 


TTTTGAGAGA 


GTTGGCATTA 


ACGCAAGACC 


3420 




AGTAGTTGCA 


GATTCAGGTG 


CTATAGGCGG 


TAGCCATaCA 


CATGAATTTA 


TGG CATT AAG 


3480 




TGCTATCGGT 


GAGGATACAA 


TCGTTTACAG 


TAAAGAAAGT 


GATTATGCTG 


CTAACATCGA 


3540 


SO 


AAAAGCAGAA 


GTCGTTTACG 


Axe CAaATcA 


TaAGCATACT 


ACTGTGCAAC 


CTTTAGAAAA 


3600 




AATTGAAACA 


CCAAATGTTA 


AGACTG CGCA 


AGAATTGGCA 


GACTTCTTAG 


GTAGACCAGT 


3660 



55 
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v7\-.vj x unj\«v_/\ 1 




ATGACATTAA 


ATT AAAAT CT 


TATTTCGGCA 


CAGATAATAT 


3780 




TGAATTARPA 


AUAt_AAGA<_G 


AAATTGTTAA 


TTTAGTTGGT 


GCAAATCCTG 


GTTCACTAGG 


3840 


5 


Tf (TY2T*A ITT 


V? A x AAAljAAA 


*^^ft ft ft * i'| m I Ml > j\ 

TCAAAATTTA 


^n^^^ TV TV n^TV TV tr 

TGCAGATAAT 


I HI 1 H 1 WM 1 WV m •« ^« 

TTTGTGCAAG 


ATTTAAATAA 


3900 




a A. X />VJ X x w x v~ 




ft ft ft qy/^'Wr' ft 

AAGA fGGTTA 


TCACTTAATT 


AATGTAAATG 


TAGGTAGAGA 


3960 




Sip X X \vrV%VvW X X 


VjA X VjAAX A 1 vj 


G CGATTTCCG 


TTTTATTTTA 


TV TV r*%f~* TV TV TV 

GAAGGCGAAA 


■m ^ ii|U|k m m Amn 

AGTTAAGTGA 


4020 


10 


X «\J X X ^VkVJ\J^« 




x 1 V»V- xGAAGG 


TV WyTV» TV TV OMN I'l 

TATTGAAGTT 


GGT CAAGTAT 


TCAAATTGGG 


4080 




TACTAAGTliT 


Tr^ftiiaTpa a 

1 LAUnn 1 LnA 


TGAATGLiAC 


ATT CTTAGAT 


TV TV TV TV ^>^T TV TV 

AACCAAGGAA 


AAG CTCAATC 


4140 


15 


X X lAnl InXVj 




/-» ft TV 'I'lV^PK ft T> 

UAATTGGAAT 


TTCTAGAACG 


CTAAGTG CGA 


TTGTTGAACA 


4200 


A A A Tf* A PR A T 


«A 1 AAITjVjAA 


TTGTTTGGCC 


fmV W TV fTT^ TV ^flH 1 % 

TAAAT CAGTT 


ACTCCGTTTG 


ATTTACATTT 


4260 




nrt 1 x 1U1 A 1 1 


ft A *TV"»f l, m ft A /""• TV 

Ml Ct-IAAGA 


AAGATG AT CA 


ACGAGAACTA 


GCAGATGCAC 


TATATGCTGA 


4320 


20 


nl 1 XAAXAl* X 


AAAl 1 xGATG 


tTV • 1 »• 1 V T» T* TV /"^✓^ TV 

TGTTGTACGA 


PtVI TV m/t/ T«1S^1 TV /** 

TGATCGTCAG 


GAACGTGCAG 


GTGTTAAATT 


4380 






GATTTAATTG 


M l || ■ 1 Tv ^rs"T iv rwn 

GTTTACCACT 


GCGAATTGTT 


GTTGGTAAAC 


GTGCATCGGA 


4440 




A^II/^TTA p P r TY"''"P A 
AlnjJ.nl 1 vj» 1 A 


bAAGlTAAAG 


AACGTTTAAC 


AGGTGATAGC 


GAAGAAGTTC 


ACATTGATGA 


4500 


25 


PTTH A*TY"2 A IT 


GTCATTACAA 


lv (Tik TV Tv rn TV nw^ tv 

ATAAATATGA 


TAACTTAAAA 


TAATTAAGAT 


CGAATGAATT 


4560 




A x AAlaAls 1 ALj 


bAAAAAbCTG 


TV TV A /"I TV TV TV m^wn 

AAAGAAATCT 


GATGCTTATG 


TCCTGCTCTT 


ATTArnTO 


4620 




A X A 1 MA luA 1 


T 7V 'IT^V^ A TV^ TV 

1A1 1CUATGA 


TV A TV TV fl^u^ TV / Mll^i 

AAAATGACTG 


TV TV ^ TV A TV #TTTV >**<T^ 

AAGACATAGT 


ATAATTAAAG 


ATAAATTTGT 


4680 


30 


ttt a a a a t a 


T A A *TV* IV ' T w 1 ' TV 

1 AA 1 C»A i TAG 


tv tv tv ^nTV n^Tv tv 

C CAAAT AT AA 


AG CATTT AAT 


TTTCTATCAT 


TACTATGCTC 


4740 




af a*FA ttTPT A 
■r\v_r\ x nn X v. X A 


AA1A1 l\aTTC 


/1» TV /""» TV /*l/*V»l»Ti TV 

GAACACGTAA 


TV TV ^•TTTV TV nwwn/l 

AAGTAATTTC 


TATTTAAGGT 


GGTAATTGTC 


4800 




TTW^f* A A Tf2 A 
X X v7V3l_AA X IxA 


uAbAbLAALA 


AAAATTTAAA 


GTGCTTGCTG 


ATCAAATTAA 


AATTTCAAAT 


4860 


35 


f AATVAfSATY"; 


\- lbAAAl 111 


Tv A Tv mm^iTi /^✓nrr^ 

AAATTCAGGT 


TV TV 4 Wllrf 1 TV ^TV 

GAACTGACAC 


GTATAGATGT 


TTCTAACAAA 


4920 




AACAf^AAOAT 


VjG\J AA 111 CA 


TATTACATTA 


Tv tv tv nwn/^m 

C CACAATTCT_ 


-TAGCTCATGA- 


AGATTATTTA - - 


-4.9-B0- - 


40 


M" 1 'ft n^HT* 1 ' A W J > ft fv 


AibLAAlAGA 


/^l TV\ TV /—» TV / T*l MIMII 

GCAAGAGTTT 


AAAGATATCG 


C CAACGTTAC 


ATGTCG'ri"l w r 


5040 


^VV—wVj X AA^»AA 


» TT*^ f* ft *"*■/"• ft ft 

A 1 (jvjv^A^VjAA 


ft ft ft TV~» ft TV 

I LAAGATCAA 


tv fTT/ a i/ n iTv tv mm TV 

CATGGAATTA 


TV TV fTT TV <■■!■■»« ■ vy^ 

AATACTTTGG 


GCACTGTATT 


5100 




rja nr a a a c* a 


V- 1 1 1AIL1 LL 


ft IV TA TV *"»f TT»I» TV TV TV 

AAAAGTTAAA 


/-^fT^S** TV TV rrVTV^ TV 

GGTCAATTGA 


AACAGAAAAA 


GCTTATTATG 


5160 


45 


w TC m \Y2f2 A A A T\/~" 
X V- IVtVaAAAAVj 


*Pft n* i'tv ft t\ t\ /""h 
TATTAAAAG T 


TV TV *T\rf*** / ««T»TV m^l W 

AATGGTATCA 


AATGACATTG 


AACGTAATCA 


TTTTGATAAG 


5220 




GCATGTAATG 


GAAGTCTTAT 


CAAAGCGTTT 


AGAAATTGTG 


GTTTTGATAT 


CGATAAAATC 


5280 




ATATTCGAAA 


CAAATGATAA 


TG AT CAAGAA 


CAAAACTTAG 


CTTCTTTAGA AgCACaTATT 


5340 


50 


CAAGAAGAAG 


ACGAACAAAG 


TGCACGATTG 


GCAACAGAGA 


AACTTGAAAA 


AATGAAAGCT 


5400 




GAAAAAGCGA 


AACAACAAGA 


TAACAACGAA 


AGTGCTGTCG 


ATAAGTGTCA AATTGGTAAG 


5460 



55 
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GCAATAGAGG 


GTGTCATTTT 


TGATATAAAC 


TTAAAAGAAC 


TTAAAAGTGG 


TCGCCATATC 


5580 


GTAGAAATTA 


AAGTGACTGA 


CTATACGGAC 


TCTTTAGTTT 


TAAAAATGTT 


TACTCGTAAA 


5640 


AACAAAGATG 


ATTTAGAACA 


TTTTAAAGCG 


CTAAGTGTTG 


GTAAATGGGT 


TAGGGCTCAA 


5700 


GGTCGTATTG 


AAGAAGATAC 


ATTTATTAGA 


GATTTAGTTA 


TGATGATGTC 


TGATATTGAA 


5760 


GAGATTAAAA 


AAGCGACAAA 


AAAAGATAAG 


GCTGAAGAAA 


AGCGTGTAGA 


ATTCCACTTG 


5820 


CATACTGCAA 


TGAGCCAAAT 


GGATGGTATA 


CCCAATATTG 


GTGCGTATGT 


TAAACAGGCA 


5880 


GCAGACTGGG 


GACATCCAGC 


CATTG CGGTT 


ACAGACCATA 


ATGTTGTGCA 


AGCATTTCCA 


5940 


GATG CTCACG 


CAGCAGCGGA 


AAAACATGGC 


ATTAAAATGA 


TATACGGTAT 


GGAAGGTATG 


6000 


TTAGTTGATG 


ATGGTGTTCC 


GATTGCATAC 


AAACCACAAG 


ATGTCGTATT 


AAAAGATGCT 


6060 


ACTTATGTTG 


TGTTCGACGT 


TGAGACAACT 


GGTTTATCAA 


ATCAGTATGA 


TAAAATCATC 


6120 


GAG CTTGCAG 


CTGTGAAAGT 


TCATAACGGT 


GAAATCATCG 


ATAAGTTTGA 


AAGGTTTAGT 


6180 


AATCCGCATG 


AACGATTATC 


GGAAACGATT 


ATCAATTTGA 


CGCATATTAC 


TGATGATATG 


6240 


TTAGTAGATG 


CCCCTGAGAT 


TGAAGAAGTA 


CTTACAGAGT 


TTAAAGAATG 


GGTTGGCGAT 


6300 


GCGATATTCG 


TAGCGCATAA 


TGCTTCGTTT 


GATATGGGCT 


TCATCGATAC 


GGGATATGAA 


6360 


CGTCTTGGGT 


TTGGACCATC 


AACGAATGGT 


GTTATCGATA 


CTTTAGAATT 


ATCTCGTACG 


6420 


ATTAATACTG 


AATATGGTAA 


ACATGGTTTG 


AATTTCTTGG 


CTAAAAAATA 


TGGCGTAGAA 


6480 


TTAACGCAAC 


ATCACCGTGC 


CATTTATGAT 


ACAGAAGCAA 


CAGCTTACAT 


TTTCATAAAA 


6540 


ATGGTTCAAC 


AAATGAAAGA 


ATTAGGCGTA 


TTAAATCATA 


ACGAAATCAA 


CAAAAAACTC 


6600 


AGTAATGAAG 


ATGCATATAA 


ACGTGCAAGA 


CCTAGTCATG 


TCACATTAAT 


TGTACAAAAC 


6660 


CAACAAGGTC 


TTAAAAATCT 


ATTTAAAATT 


GTAAGTGCAT 


CATTGGTGAA 


GTATTTCTAC 


6720 


CGTAGACCTC 


GAATTCCACG 


TTCATTGTTA 


GATGAATATC 


GTGAGGGATT 


ATTGGTAGGT 


6780 


ACAGCGTGTG 


ATGAAGGTGA 


ATTATTTACG 


GCAGTTATGC 


AGAAGGACCA 


GAGTCAAGTT 


6840 


GAAAAAATTG 


CCAAATATTA 


TGATTTTATT 


GAAATTCAAC 


CACCGGCACT 


TTATCAAGAT 


6900 


TTAATTGATA 


GAGAGCTTAT 


TAGAGATACT 


GAAACATTAC 


ATGAAATTTA 


TCAACGTTTA 


6960 


ATACATGCAG 


GTGACACAGC 


GGGTATACCT 


GTTATTGCGA 


CAGGAAATGC 


ACACTATTTG 


7020 


TTTGAACATG 


ATGGTATCGC 


ACGTAAAATT 


TTAATAGCAT 


CACAACCCGG 


CAATCCACTT 


7080 


AATCGCTCAA 


CTTTACCGGA 


AGCACATTTT 


AGAACTACAG 


ATGAAATGTT 


AAACGAGTTT 


7140 


CATTTTTTAG 


GTGAAGAAAA 


AGCGCATGAA 


ATTGTTGTGA 


AAAATACAAA 


CGAATTAGCA 


7200 


GATCGAATTG 


AACGTGTTGT 


TCCTATTAAA 


GATGAATTAT 


ACACACCGCG 


TATGGAAGGT 


7260 
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CTGCCTCAAA T CGTAATTG A TCGATTAGAA AAAGAATTAA AAAGTATTAT CGGTAATGGA 73 BO 

TTTGCGGTAA TTTACTTAAT TTCGCAACGT TTAGTTAAAA AATCATTAGA TGATGGATAC 7440 
TTAGTTGGTT CCCGTGGTTC AGTAGGTTCT AGTTTTGTAG CGACAATGAC TGAGATTACT 75 00 

GAAGTAAACC CGTTACCGCC ACACTATATT TGTCCGAACT GTAAAACGAG TGAATTTTTC 7560 

AATGATGGTT CAGTAGGATC AGGATTTGAT TTA CCTGATA AGACGTGTGA AACTTGTGGA 7620 

GCGCCACTTA TTAAAGAAGG ACAAGATATT CCGTTTGAAA CATTTTTAGG ATTTAAGGGA 768 0 

G AT AAAGTT C CTGATATCGA CTTAAACTTT AGTGGTGAAT ATCAACCGAA TGCC CATAAC 7740 

TACACAAAAG TATTATTTGG TGAGGATAAA GTATTCCGTG CAGGTACAAT TGGTACTGTT 7800 

GCTGAAAAGA CTGCTTTTGG TTATGTTAAA GGTTATTTGA ATGATCAAGG TATCCACAAA 7860 

AGAGGTGCTG AAATAGATCG ACTCGTTAAA GGATGTACAG GTGTTAAACG TACAACTGGA 792 0 

20 CAGCATCCAG GGGGT ATTAT TGTAGTACCT GATTACATGG ATATTTATGA TTTTACGCCG 7980 

ATACAAT AT C CTGCCGATGA TCAAAATTCA GCATGGATGA CGACACATTT TGATTTCCAT 8040 

TCTATTCATG ATAATGTATT AAAACTTGAT ATACTTGGAC ACGATGATCC AACAATGATT 8100 

25 CGTATGCTTC AAGATTTATC AGGAATTGAT CCAAAAACAA TACCTGTAGA TGATAAAGAA 8 160 

GTTATGCAGA TATTTAGTAC ACCTGAAAGT TTGGGTGTTA CTGAAGATGA AATTTTATGT 8220 

AAAACAGGTA CATTTGGGGT ACCAGAATTC GGTACAGGAT TCGTGCGTCA AATGTTAGAA 8280 

GATACAAAGC CAACAACATT TTCTGAATTA GTTCAAATCT CAGGATTATC TCATGGTACA 834 0 

GATGTGTGGT TAGGCAATGC T CAAG AATTA ATTAAAACCG GTATATGTGA TTTATCAAGT 8400 

GTAATTGGTT GTCGTGATGA TATCATGGTT TATTTAATGT ATGCTGGTTT AGAACCATCA 846 0 

ATGGCTTTTA AAATAATGGA GTCAGTACGT AAAGGTAAAG GTTTAACTGA AGAAATGATT 852 0 

_GAAAGGATGA - AAGAAAATGA -AGTGGGAGAT- -TGGTATTTAG-ATTGATGTCT TAAAATTAAG 85 80 

TACATGTTCC CTAAAGCCCA TGCAGCAGCA TACGTTTTAA TGGCAGTACG TATCGCATAT 864 0 

TTCAAAGTAC ATCATCCACT TTATTACTAT GCATCTTACT TTACAATTCG TGCGTCAGAC 8700 

TTTGATTTAA TCACGATGAT TAAAGATAAA ACAAGCATTC GAAATACTGT AAAAGACATG 8760 

TATTCTCGCT ATATGGATCT AGGTAAAAAA GAAAAAGACG TATTAACAGT CTTGGAAATT 882 0 

ATGAATGAAA TGGCGCATCG AGGTTATCGA ATGCAACCGA TTAGTTTAGA AAAGAGTCAG 888 0 

GCGTTCGAAT TTATCATTGA AGGCGATACA CTTATTCCGC CGTTCATATC AGTGCCTGGG 8 94 0 

SO CTTGGCGAAA ACGTTGCGAA ACGAATTGTT GAAGCTCGTG ACGATGGCCC ATTTTT AT CA 9000 

AAAGAAGATT TAAACAAAAA AGCTGGATTA TCTCAGAAAA TTATTGAGTA TTTAGATGAG 9060 
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GAAATAATCA 


AGGTATTTAT 


TTAATGCGTA 


TGGCGTAGTC 


AAAGAAATAC 


AAAATTGTTG 


9180 




CTGGACACAA 


AATTATGCCC 


GTATTTCTTT 


TCAATGTCTT 


ACGAGTCTAT 


TCAAATGTAA 


9240 


5 


TGGTGAAATA AAGGAACAAA 


CTTTTACAAG 


AATCTCTGAT 


TAATAGTGAA 


GTCATTTGTT 


9300 




TCAAG CAT AA 


ACTTATGCTA 


TAATTAAGTT 


GCTTAAAAAT 


TAGTGAACTC 


AGGCAGAAGA 


9360 


10 


GTGGGAGATT 


CCCGCTCTTT 


TCTATTTGCC 


AAAAAGGGAG 


GCCTGTATGA 


GTAAAATTAC 


9420 


AGAACAAGTA 


GAAGTGATTG 


TTAAACCAAT 


TATGGAAGAC 


TTGAATTTTG 


AACTTGTAGA 


9480 




CGTTGAATAT 


GTCAAAGAGG 


GTAGAGATCA 


TTTTCTTAGA 


ATCTCTATTG 


ATAAAGAAGG 


9540 


IS 


TGGCGTAGAT 


TTAAATGATT 


GTACGCTAGC 


TTCTGAAAAA 


ATAAGTGAAG 


CTATGGATGC 


9600 


AAATGATCCT 


ATTCCTGAAA 


TGTATTATTT 


AGACGTAGCG 


TCACCTGGTG 


CAGAACGTCC 


9660 




AATTAAAAAA 


GAACAAGATT 


TCCAAAATGC 


AATAACTAAA 


CCTGTATTTG 


TTTCTTTATA 


9720 


20 


TGTAC CAATT 


GAAGGTGAAA 


AGGAATGGTT 


AGGCATTTTA 


CAAGAAGTCA 


ATAATGAAAC 


9780 




AATTGTAGTA 


CAAGTTAAAA 


TCAAAGCAAG 


AACGAAAGAT 


ATAGAGATAC 


CGAGAGACAA 


9840 




AATAGCAAAA 


GCACGTCACG 


CAGTTATGAT 


TTAACGTGAT 


GAGGAGGAAA 


AAACGTGTCA 


9900 


25 


AGTAATGAAT 


TATTATTAGC 


TACTGAGTAT 


TTAGAAAAAG 


AAAAGAAGAT 


TCCTAGAGCA 


9960 




GTATTAATTG 


ATGCTATTGA 


AGCAGCTTTA 


ATTACTG CAT 


ACAAAAAGAA 


TTATGATAGT 


10020 




GCAAGAAATG 


TCCGTGTGGA 


ATTAAATATG 


GATCAAGGTA 


CTTTCAAAGT 


TATCGCTCGT 


10080 


30 


AAAGATGTTG 


TTGAAGAAGT 


ATTTGACGAC 


AGAGATGAAG 


TGGATTTAAG 


TACAGCGCTT 


10140 




GTTAAAAACC 


CTGCATATGA 


AATTGGTGAT 


ATATACGAAG 


AAGATGTAAC 


ACCTAAAGAT 


10200 




TTTGGTCGTG 


TAGGTGCTCA 


AGCAGCGAAA 


CAAGCAGTAA 


TGCAACGTCT 


TCGTGATGCT 


10260 


35 


GAACGTGAAA 


TTTTATTTGA 


AGAATTTATA 


GACAAAGAAG 


AAGACATACT 


TACTGGAATT 


10320 




ATTGAC CGTG 


TTGACCATCG 


TTATGTATAT 


GTGAATTTAG 


GTCGTATCGA 


AG CTGTTTTA 


10380 


40 


TCTGAAGCAG 


AAAGAAGTC C 


TAACGAAAAA 


TATATTCCTA ACGAACGTAT 


CAAAGTATAT 


10440 


GTTAACAAAG 


TGGAACAAAC 


GACAAAAGGT 


CCTCAAATCT 


ATGTTTCTCG 


TAGCCATCCA 


10500 




GGTTTATTAA 


AACGTTTATT 


TGAACAAGAA 


GTTCCAGAAA 


TTTACGATGG 


TACTGTAATT 


10560 


45 


GTTAAATCAG 


TAGCACGTGA 


AGCTGGCGAT 


CGCTCTAAAA 


TTAGTGTCTT 


CTCTGAAAAC 


10620 


AATGATATAG 


ATGCTGTTGG 


TGCATGTGTT 


GGTGCTAAAG 


GCGCACGTGT 


TGAAGCTGTT 


10680 




GTTGAAG AG C 


TAGGTGGTGA 


AAAAATCGAC 


ATCGTTCAAT 


GGAATGAAGA 


TCCAAAAGTA 


10740 


SO 


TTTGTAAAAA 


ATGCTTTAAG 


CCCTTCTCAA 


GTTTTAGAAG 


TTATTGTTGA 


TGAAACAAAT 


10800 




CAATCTACAG 


TAGTTGTTGT 


TCCTGATTAT 


CAATTGTCAT 


TAGCGATTGG 


TAAAAGAGGA 


10860 
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GATGCGCGTG 


AAGCGGGTAT 


CTATCCAGTA 


GTTGAAGCTG 


AAAAAGTAAC 


TGAAGAAGAT 


10980 




GTTGCTTTAG 


AAGATGCTGA 


CACAACAGAA 


TCAACCGAAG 


AGGTAAATGA 


TGTTTCAGTT 


11040 


5 


GAAACAAATG 


TAGAGAAAGA ATCTGAATAA 


TAGGTTGGAG 


TGAAGTATCT 


ATGAAAAAGA 


11100 




AAAAAATTCC 




1AI 1 LIU 


CAAATGAAAT 


GCATCCCAAA 


AAAGATATGA 


11160 


10 


TTCGTGTTGT 




/"^Tv ftfVS/V^ft ft ft 


TCTTTGCGGA 


TGTTACTGGA 


AAGAAACAAG 


11220 


GCCGTGGCGC 


TAT" TV TV"" ■ *• i»»p/*p 

rt Art. J. V» 1 1 


ft ft ftOft*TY* , *I* f T*/^» 


CTATGGTTGA 


AAAAGCACAA 


CaAAAAGAAA 


11280 




TTTTAGAAAA 


AiA J. 1 1 1 AAA 


/•^ /"» T\ TV ■*!• TV IV ftO 


AGCAATTGGA 


TCCTGTTTAC 


AAAGAAATTA 


11340 


15 


TTAGATTAAT 




O TV f ft TV" 1 /"*/*' TV JV 

L>ACjA 1 LCCAA 


AATGAGTATA 


GATCAAATAT 


TAAACTTTTT 


11400 


AGGATTAGCA 




/ "l*T\ TA JVO*^T\ IV TV 

UTAAAvjTAAA 


AACAGGTGAA 


TCAGTCATTG 


TTAATGAGAT 


11460 




TAAAAAAGGA 


ft hTTTPTi »pp 

AA1 X 1 VjAAvjL. 


TCGTTATTGT 


TGCAAATGAT 


GCGTCTGATA 


ATACAGCTAA 


11520 


20 


ATTAATTACA 


/"^ ft fft R TV »TV^*T»7V 

<j A 1 AAATGTA 


AGAGTTACAA 


AGTTCCATTC 


AGAAAGTTTG 


GAAATCGAAA 


11580 




TGAATTGGGA 


ATAG CACTTG 


O TV 7V TV 1V^^**V^TV 

GAAAAGGTGA 


GCGTGTTAAT 


GTAGGGATTA 


CTGACCCAGG 


11640 




CTTTGCTAAA 


AAGTTGCTAT 


CAATGATAGA 


TGAATATCAT 


AAGGAGTGAT 


TATATGAGTA 


11700 


25 


AACAAAGAAT 


*PT*ft. ft ft T*ft T« 

1 lAt-ljAATAT 


TV TV TV TV TV t* 

GCGAAAGAAT 


TAAATCTAAA 


GAGTAAAGAG 


ATTATAGATG 


11760 




AGTTAAAAAG 


CATGAATATT 


GAGGTTTCAA 


AT CAT ATGCA 


AGCTTTGGAA 


GATGACCAAA 


11820 




TTAAAGCATT 


AGATAAAAAG 


• imi ft ft j\ TV TV TV ^* 

TlCAAAAAAG 


AACAAAAGAA 


OGACAATAAA 


CAAAGCACTC 


11880 


30 


AAAATAATCA 


CCAAAAATCA 


«v tv /"* tv tv*tv*tv tv tv 
AACAATCAAA 


ACCAAAATAA 


AGGGCmACAA 


AAAGATAACA 


11940 




AAAAGAATCm 


ACAACAAAAT 


TV TV T 1 TV TV ft TV 

AATAAAGGCA 


ACAAAGGCAA 


TAAAAAGAAT 


AATAGAAATa 


12000 




ATAAGAAAAA 


TAACAAGAAT 


ft TVf'TV TV TV ^/"^ JV /"» 

AATAAAC lac 


AAAATCAACC 


AGCTGCTCCA 


AAAGAAATAC 


12060 


35 


CATCAAAAGT 


GACATATCAA 


Oft TV /^!/*"T» TV »l*«t» TV 

CjAACK»TATTA 


CAGTAGGCGA 


ATTTGCGGAT 


AAATTAAATG 


12120 





_ _ TTGAATCATC AGAAATTATC 


ft ft ft ft ft TV T»T* TV 

AAAAAAJ. TXAT- 


-TCTTACrrGG- 


TATTGTTGCT - 


AATATCAATC - 


- 12-180- 


40 


AATCATTAAA 


TCAAGAAACA 


ATCGAATTAA 


TTG CCGATGA 


TTATGGCGTT 


GAGGTTGAAG 


12240 


AAGAAGTTGT 


GATTAATGAA 


GAAGACTTAT 


CAATCTATTT 


CGAAGACGAA 


AAAGATGATC 


12300 




CAGAGGCAAT 


TGAGAGACCA 


GCAGTTGTAA 


CAATTATGGG 


ACATGTTGAC 


CATGGTAAAA 


12360 


45 


CGACTTTATT 


AGATTCAATT 


CGTCATACAA 


AAGTTACAGC 


AGGTGAAGCA 


GGCGGAATCA 


12420 




CTCAACATAT 


TGGTGCATAT 


CAAATTGAAA 


ACGATGGCAA 


AAAAATCACT 


TTCTTAGATA 


12480 




CACCGGGACA 


TGCTGCATTT 


ACAACGATGC 


GTGCGCGTGG 


TGCa CAAGTA 


ACAGATATTA 


12540 


SO 


CTATTTTAGT 


AGTAGCAGCT 


GACGATGGTG 


TTATGCCACA 


AACAATTGAA 


GCAATTAACC 


12600 




ATGCTAAAGA 


AGCAgAAGTA 


CCAATTATTG 


TTGCAGTAAA 


TAAAATTGAT 


AAACCAACTT 


12660 
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GCGGCGAAAC AATTtTCGTc CACTTTCTGC ATTAAGTGGT GATGGTATCG ACGATTTATT 1278 0 

AGAAATGATA GGATTAGTTG CAGAAGTTCA AGAACTTAAA GCAAATCCTA AAAACCGTGC 12 840 

TGTTGGTACA GTTATCGAAG CTGAATTAGA TAAATCACGT GGTCCTTCTG CATCATTATT 12900 

AGTACAAAAC GGTACATTAA ATGTTGGTGA TGCGATTGTA GTTGGTAATA CTTACGGCCG 1296 0 

TATCCGTGCA ATGGTTAATG ACTTAGGTCA AAGAATCAAA ACGGCTGGTC CATCAACGCC 13 020 

TGTTGAAATT ACAGGTATTA ATGATGTGCC ACAAGCTGGG GATCGCTTTG TTGTATTTAG 1308 0 

TGATGAAAAA CAAGCTCGTC GTATTGGTGA ATCAAGACAC GAAGCTAGCA TTATACAACA 1314 0 

ACGTCAAGAA AGTAAAAATG TTTCATTAGA TAACCTGTTT GAACAAATGA AACAAGGTGA 13200 

AATGAAAGAT TTAAACGTTA TTATTAAAGG TGATGTTCAA GGTTCTGTTG AAGCTTTAGC 13260 

TGCATCATTA ATGAAAATTG ATGTTGAAGG CGTAAATGTT CGTATCATTC ATACAGCGGT 1332 0 

TGGTG CAATT AATGAGTCAG ACGTGACACT TGCTAATGCC TCAAATGGTA TTATCATTGG 133 8 0 

TTTCAATGTT CGTCCAGACA GTGGTGCAAA ACGTGCTGCA GAAGCTGAAA ATGTTGATAT 1344 0 

GCGTTTACAC AGAGTTATTT ATAATGTTAT CGAAGAAATT GAATCAGCGA TGAAAGGTTT 13500 

25 ACTTGATCCA GAATTTGAAG AACAAGTTAT CGGACAAGCT GAAGTTCGTC AAACATTCAA 13560 

AGTTTCTAAA GTTGGTACTA TTGCTGGATG TTATGTTACT GAAGGTAAAA TTACGCGAAA 13 620 

TGCTGGTGTA CGTATTATTC GTGATGGTAT TGTTCAATAT GAAGGCGAAT TAGATACACT 13 680 

30 TAAACGTTTC AAAGATGATG CTAAGGAAGT TGCAAAAGGT TATGAATGTG GTATTACAAy 13740 

TGAAAACTAC AATGACCTTA AAGAAGGCGA TGTTATCGAA GCATTTGAAA TGGTTGAAAT 13 800 

TAAGCGTTAA TTAAATAAAT TACAAGCTAA AAGTATAGTT AAGATTGATA TGCTCCCTAT 13 860 

AAATATTGCA CTTTTTAAGT GTCTACTTTA TAGGGAGCAT ATTTGATACT AGCTTTTGGT 13 920 

TTTFTATTAG AATAGATTAC CTATTAAAAG TTACGTTATA TGGACATGAT TTTGTATAAA 13980 

ATTTTGTGGT GGCCTAGAAT GATTTTTAAT GACAAAATAT AATGTCGACT ATTATTGGAA 14 04 0 

AATTTTCTGT TGaAATGCCT ATCTTACGGC AAACTTTATT TGATTTTATA GGCTTAATTT 14100 

ATTAAAATAA CGTGTGAGCT AAAATAATTG TTTAAGCATT GTTACACTAA AAAATGCAAA 14160 

TAACAATTGA ACTTAAAGAT AAAGAGGTGA CAAGAATGAG CAGTATGAGA GCAGAGCGTG 14220 

TTGGTGAACA AATGAAGAAG GAATTAATGG ATATCATCAA CAATAAAGTC AAAGATCCTC 142 80 

GAGTTGGTTT TATTACAATT ACAGATGTTG TTTTAACAAA TGATTTATCG CAGG CTAAAG 14 340 

TATTTTTAAC TGTATTAGGT AACGATAAAG AAGTAGAAAA TACATTTAAA GCACTTGATA 14400 

AAGCAAAAGG CTTCATTAAG TCTGAATTAG GTTCTAGAAT GCGATTACGT ATTATGCCGG 144 60 
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AAGATTTACA CAAACAAGAT AGATAATTTA GTGTTAGGTA TCTGGAAAAT GTTTGATAAT 14 580 

TTCTTAATAT CGGTATATTA ACATTAAACA GTTAATACAT AGATGTGTAG AAATAGTTAA 14 640 

CATTTTCCAG TTTTTTTATG AATAAATTTA GTTGATACGC TATTAAAATA TATTTTAAAA 14700 

AAGAAGGTGA CTATATGTAT AATGGGATAT TACCAGTATA TAAAGAGCGC GGTTTAACAA 14 760 

GTCATGACGT TGTATTCAAA TTGCGTAAAA TATTAAAAAC TAAAAAAATA GGTCACACGG 14 820 

GTACGCTTGA TCCCGAAGTT GCAGGCGTGT TACCGGTATG TATAGGTAAT GCAACGAGAG 14 880 

TTAGTGATTA TGTTATGGAT ATGGGCAAAG CTTATGAAGC AACTGTATCG ATAGGAAGAA 14 94 0 

GTACAACGAC TGAAGATCAA ACGGGTGATA CATTGGAAAC AAAAGGTGTA CACTCAGCAG 15000 

ATTTTAATAA GGACGATATT GACCGATTGT TAGAAAGTTT TAAAGGTATC ATTGAACAAA 15060 

TTCCGCCGAT GTACT CATCC GTCAAAGTAA ATGGTAAAAA ATTATATGAA TATGCGCGTA 15120 

ATAATGAAAC AGTTGAAAGA CCAAAGCGTA AAGT t AATAT TAAAGACATT GGGCGTATAT 15180 

CTGAATTAGA TTTTAAAGAA AATGAGTGTC ATTTTAAAAT ACGCGTCATC TGTGGTAAAG 15240 

GTACATATAT TAGAACGCTA GCAACTGATA TTGGTGTGAA ATTAGGCTTT CCGGCACATA 15300 

25 TGTCGAAATT AACACGAATC GAGTCTGGTG GATTTGTGTT GAAAGATAGC CTTACATTAG 15360 

AACAAATAAA AGAACTTCAT GAGCAGGATT CATTGCAAAA TAAATTGTTT CCTTTAGAAT 15420 

ATGGATTAAA GGGTTTGCCA AGCATTAAAA TTAAAGATTC GCACATAAAA AAACGTATTT 15480 

30 TAAATGGGCA GAAATTTAAT AAAAATGAAT TTGATAACAA AATTAAAGAC CAAATTGTAT 15540 

TTATTGATGA TGATTCAGAA AAAGTATTAG CAATTTATAT GGTACACCCT ACAAAAGAAT 1560 0 

CAGAAATTAA ACCTAAAAAA GTCTTTAATT AAAGGAGATA GAATTTATGA AAGTCATAGA 15660 

AGtGACACAT CCTATACAAT CTAAACAGTA TATTACAGAG GATGTTGCAA TGGCATTCGG 15720 

ATTTTTCGAT GGCATGCATA_AAGGTCATGA-CAAAGTCTTT-G^ 15780- 

TGAGGCACGC AGTTTAAAAA AAGCGGTGAT GACATTTGAT CCGCATCCGT CTGTCGTGTT 1584 0 

GAATCCTAAA AGAAAACGAA CAACGTATTT AACGCCACTT TCAGATAAAA TCGAAAAAAT 15900 

TAG CCAACAT GATATTGATT ATTGTATAGT GGTTAATTTT TCATCTAGGT TTGCTAATGT 1596 0 

GAGCGTAGAA GATTTTGTTG AAAATTATAT AATTAAAAAT AATGTAAAAG AAGTCATTGC 16020 

TGGTTTTGAT TTTACTTTTG GTAAATTTGG AAAAGGTAAT ATGACTGTAC TTCAAGAATA 1608 0 

TGATGCGTTT AATACGACAA TTGTGAGTAA ACAAGAAATT GAAAATGAAA AAATTTCTAC 1614 0 

50 AACTTCTATT CGTCAAGATT TAATCAATGG TGAGTTGCAA AAAGCGAATG ATGCTTTAGG 16200 

CTATATATAT TCTATTAAAG kCACTGTAGT GCAAGGTGAA AAAAGGGGAA GAACTATTGG 16260 
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TGCTGTTAGT ATTGAAATCG GCACTGAAAA TAAATTATAT CGAGGGGTAG CTAACATAGG 163 80 

TGTAAAGCCA ACATTTCATG ATCCTAACAA AGCAGAAGTT GTCATCGAAG TGAATATCTT 16440 

TGACTTTGAG GATAATATTT ATGGTGAACG AGTGACCGTG AATTGGCATC ATTTCTTACG 16500 

TCCTGAGATT AAATTTGATG GTATCGACCC ATTAGTTAAA CAAATGAACG ATGATAAATC 16 560 

GCGTGCTAAA TATTTATTAG CAGTTGATTT TGGTGATGAA GTAGCTTATA ATATCTAGAG 16620 

TTGCGTATAG tTATATAAAC AATCTATACC ACACCTTTTT CTTAGTAGGT CGAATCTCCA 16 680 

ACGCCTAACT CGGATTAAGG AGTATTCAAA CATTTTAAGG AGGAAATTGA TTATGGCAAT 16740 

TTCACAAGAA CGTAAAAACG AAATCATTAA AGAATACCGT GTACACGAAA CTGATACTGG 16 800 

TTCACCAGAA GTACAAATCG CTGTACTTAC TGCAGAAATC AACG CaGTAA ACGAACACTT 16860 

ACGTACACAC AAAAAAGACC ACCATTCACG TCGTGGATTA TTAAAAATGG TAGGTCGTCG 16920 

TAGcATTTaT TAAACTACTT ACGTaGTAAA GATATTCAAC GTTACCGTGA ATTAATTAAA 16980 

TCACTTGGTA TCCGTCGTTA ATCTTAATAT AACGTCTTTG AGGTTGGGGC ATATTTATGT 17 040 

TCCAACCTTA ATTTATATTA AAAAAGCTTT TTACAAATAT TAACATTTAT TATATGTTAA 17100 

25 GCTAATATTG AGTGAATAAT AAGGTTACAA TGAGATAAAG ATGATATAAG TACACCTAGA 17160 

GTAATAATCA AGATATTAAA AATAAAGTAT GTTTTTTTAA AAAATATAAC TTATATTTAT 17220 

ACTGATAAGG GTGGGACGAT AAGTCTATTT TGTAAATAAT AGATGGATAT CCCGCTCTCT 17280 

30 TTTTTTCCAA TTCAATATTT TATAACTAAT ATTAAAATAC GATAATAAAT GATATGATAT 17340 

AACTATTAGA TTCAAGAGAG GAGATTTATA ATGTCTCAAG AAAAGAAAGT TTTTAAAACT 17400 

GAATGGGCAG GAAGATCTTT AACGATTGAA ACAGGGCAAT TAGCTAAACA AGCAAATGGC 17460 

GCTGTATTGG TTCGTTATGG AGATACAGTC GTGTTATCGA CGGCAACTGC ATCAAAAGAA 17520 

CCTCGTGATG GAGATTTCTT CCCATTAACA GTGAACTATG AAGAAAAAAT GTACGCTGCG 17580 

GGTAAAATTC CTGGTGGATT TAAAAAGAGA GAAGGACGTC CTGGTGACGA TGCAACATTA 17640 

ACTGCGCGAT TAATTGATAG ACCAATTAGA CCTTTATTCC CTAAAGGATA TAAGCATGAT 17700 

GTT CAAATT A TGAACATGGT ATTAAGTGCA GATCCTGATT GTTCACCACA AATGGCTGCA 17760 

ATGATTGGTT CATCTATGGC GCTTAGTGTG TCGGATATTC CATTCCAAGG GCCAATCGCC 17820 

GGTGTAAATG TGGGTTATAT TGACGGTAAA TATATCATTA ACCCAACAGT AGAAGAAAAA 17880 

GAAGTTTCTC GTTTAGACCT TGAAGTAGCT GGTCATAAAG ATG CGGTAAA CATGGTAGAG 17940 

SO GCAGGCGCTA GTGAGATTAC TGAACAAGAA ATGTTAGAGG CGATTTTCTT TGGTGATGAA 18000 

GAGATTCAAC GTTTAGTTGA TTTCCAACAA CAAATCGTCG ACCACATTCA ACCTGTTAAA 18060 
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GAAGAAAAAG 


GACTTAAAGA 


AACAGTTTTA 


ACATTTGATA 


AACAACAACG 


AGATGaAAAT 


18180 




CTTGATAACT 


TAAAAGAAGA AATCGTCAAT GAATTTATCG ATGAAGAAGA TCCAGAGAAT 


18240 


5 


GAaTTACTTA 


tt a a a n a a n t 


'I" t* Ta T*/^ Tv TV m*?^ 
1 1A1 v>U\An 


nrv P TV TV TV ^V' TV TV *T* 

TTAAATGAAT 


ryn TV /"^»| TV TV TV i - ^ TV 

TAGTGAAAGA 


TV /" » TV TV / **|M| »/^^« TV 

AGAAGTTCGA 


18300 




CGTTTAATTG 


rnr ATY2 A A 21 21 


A A *W A ^ A TA 
A/4.X IAuALvJA 


GACGGCCGTA 


TV TV ^T^/*^ TV T*i^» TV 

AACCTGATGA 


AATCCGTCCA 


18360 


10 


TTAGATTCTG 




"I " 1 ~ 1 ' TA f^i " 1* FA TV 

X X 1ACv,1A<jA 


TV ^V"*/"* TV TP/ *«!*» 1 ' 

ACGCATGGI x 


CAGGTCTATT 


*Ti TV TV /VTV^ 

TACACGTGGT 


18420 


CAGACTCAAG 


v-rtw XXX LaVj 1 


X X XAAVJA1 1A 


GGTG CTTTAG 


/->» i^i/— » TV TV T»/*^ TV 

G CG ATT AT CA 


ATT AA TTG AT 


18480 




GGTTTAGGAC 


fTY2A Af2A A/"2A 


A A 2k Ta RP^TTP 
AAAAALsAi. 1 (_ 


TV TV*' P* TV fP TV i|Mi» 

ATGCATCATT 


TV PTV TV /•Mtwp/T'P 

ACAACTTCCC 


GAATTTTTCA 


18540 


IS 


GTAGGTGAAA 


w X vj^j X X 


ta rv* iv^ r*r+ r*r+ tv 


GGTCGTCGTG 


Tv tv tv mnvi/i TV TV 

AAATTGGACA 


TGGTGCGTTA 


18600 




GGTGAAAGAG 


PHTTIk A A A*T*A 


TATTATTCCT 


TV ipiV / OTV 'i'w 1 yi 

GATACTGCTG 


TA »1 WIWH/t/l/N Tv »TITV 

ATTTCCCATA 


TACAATTCGT 


18660 




ATTGTAAGTG 


nuuiALl 1 vjA 


TA™/**T\ TV HIVVP 

AxUAAATGGT 


TV T*r* TV T»/'*« 1 IP 

TCATCATCTC 


TV TV /"» /-^/"IfT*/^ TA TV m 

AAGCGTCAAT 


TTGTGGATCA 


18720 


on 


ACATTAG CAT 




/"'»/"*»/■■» /"^/ w /*^/™V^ 

<j<jG cgtaccg 


ATTAAAGCAC 


CAGTTGCTGG 


TATTGCTATG 


18780 




GGCCTTGTTA 


TA f*f**T*f* Tv TV /"* TV 

<JAwL* 1 (jAAGA 


T»TV P* /mi »7\ T* TV /"*/""< 

TAGCTATACG 


ATTTTAACTG 


ATATCCAAGG 


TATGGAAGAT 


18840 


25 


GCATTAGGTG 


TV Ti TV *TV~»r~» TV /T^» |» 


TAAAGTCGCT 


GGTACTAAAG 


AAGGTATTAC 


AGCAATC CAA 


18900 


ATGGATATTA 


A TA TV •PTV"' TV /^/*^/~> 

AAA1 JAjACVjCi 


TTTAACGCGT 


GAAATTATCG 


AAGAGGCTCT 


AG AACAAG CG 


18960 




AGACGTGGTC 


val 1 xAIjAAAT 


AATGAATCAT 


ATGTTACAAA 


CAATTGATCA 


ACCACGTACT 


19020 


30 


GAATTAAGTG 


C X 1 AVAjt-CrCL. 


TA TV IV TV/'<i|*'l</ , **T*7A 

AAAAGTTGTA 


TA /™*T*Tl TV"* TV /*! TV TV 

ACTATGACAA 


TTAAACCAGA 


TAAGATTAGA 


19080 




GATGTTATCG 


e* ta pv TPPT/^r 1 
v»AL V_ X vjCj 1 viG 


*PTV TV TV TV TV TV TV »T **« > 

TAAAAAAATT 


TV TV f^f^ TV TV TV f| >*T» "J\ 

AACGAAATTA 


mm/1 m/^ « « « y« 

TTGATGAAAC 


AGGTGTTAAA 


19140 




TTAGATATTG 


TA BPft TW"* 

AAv— AAvjA 1 viQj 


1 ACTATCTTT 


ATTGGTGCTG 


TTG AT CAAG C 


TATGATAAAT 


19200 


35 


CGTGCTCGTG 


A A TA Tf* T\ TTV TV 
AAA X v-A X 1\jA 


^/"*TV TV TV n^n*Tv /** TV 

bGAAATTACA 


CGTGAAGCGG 


TV TV /^T^ TV /^/"ifr*/"* TV 

AAGTAGGTCA 


AACTTATCAA 


19260 





_ GCCACTGTTA. 


A TA ^^'"^ f 1 • TV TTV^ TV 


TV TV TV TV T*TV 0/""»/~»'T» 

AAAAIACGGT 


-GCGTTTGTAG- 


-GCCTATTCCC- 


AGGTAAAG AT 


19320 




GCGTTGCTTC 


ALAi 1 XI— Av_A 


AA1 1 xi_AAAA 


TV TV T*TV 7V TV TTV 

AATAGAATTG 


TV TV TV TV TV /"i»T»/~'«/^ TV 

AAAAAGTGGA 


TV TV fflMfflTi IV 

AGATGTATTA 


19380 


40 


AAAATCGGTG 


ALALAA1 1V»A 


TA^*P r PTA TV /"* Tv T*T» 

Atj 1 1 AAvjAl X 


jv /'iipp » » TV TTV^ 

ACTGAAATTG 


TV *P TV TV TV TV TA F+f* 

ATAAACAAGG 


TCX5AGTAAAT 


19440 




GCTTCACATA 


n ta nr^ ta hpt j\ 


TA /** TV TV TV TV T» TV T* 

AGAATAATAT 


null 7\ TV TV f^ T T*/~* TV «T> 

1TAAAGTCAT 


TV fTV^ *1V y^y^i ft ^« TV TV 

ATGACGACAA 


TGTATCGTCA 


19500 


45 


TGTGATTTTT 


TTATGCCACT 


TTTTACGAAG 


TGACCCGTTT 


TGAATTTGTT 


GTATTGAACA 


19560 




TTTTAAAACG 


CTTTATTATT 


TTGTGTGCAA 


CTGTTAATTA 


TCCTGTATGT 


ATAGTGATTA 


19620 




ATAGTGTACA 


TCAAGTGTTT 


TTTAACTTAT 


AATGAATAGT 


GAGTTTATAT 


ATGGACGGGT 


19680 


50 


AACAAATTTA 


GGAGGTAAGA 


TTTTGAGTTT 


AATAAAGAAA 


AAGAATAAAG 


ATATTCGCAT 


19740 




TATACCATTA 


GGCGGTGTTG 


GCGAAATTGC 


TAAAAATATG 


TATATCGTTG 


AAGTAGACGA 


19800 




TGAAATGTTT 


ATGTTAGATG 


CTGGACTTAT 


GTTTCCAGAA 


GACGAAATGC 


TAGGTATTGA 


19860 
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w 



15 



20 



25 



30 



35 



40 



45 



SO 



CCTTACACAC 
TG CACCAGT A 
TAATATTGAT 
AAACGTGAAT 
TATTCACACT 
ACATGGACAT 
TGTCTTAATC 
GATTGAACAT 
TTATG CTTCG 
TAAAGTGTCA 
GTATTTCGAC 
AAATGAAGTG 
AATGGCGCAA 
AATTACGGCT 
GnCTGGCGCA 
GGAAGAATTA 
TGAATTTAAA 
AAAGATTTTC 
AAATGAAAAG 
AAATATCGTG 
AACGTTAGAT 
TGTATATGTA 
AGTAGAGGCT 
TGATCAAATT 
AATTTCTGAA 
ATATAATGGT 
GATTAATCTG 
CGTCTGTTTG 
TTAACTAAGG 



GGACATGAGC 
TATGGATCTA 
AAAAAAGTTC 
ATTAGTTTCT 
TCATATGGTG 
TATGCACCAG 
AGTGATTCTA 
CATATGTATG 
AACTTTATAC 
TTTTTAGGAA 
ATTCCTAAAG 
ATAATTATAG 
CATAAGCATA 
TCTGCTAATA 
CATATTATTC 
AAAATGATGA 
ATGCAGATAG 
CTTGTGGAAA 
GTAAATTCAG 
TTGAGAGACC 
CCTAAAAATA 
CGTGAAAGTG 
GGTTTACAAG 
AGTAAACTAT 
ATTTAATCAA 
TGTCATGGAC 
TTATCTTAAG 
GACTACATAT 
CAACATAAGG 



ACGCGATTGG 
AATTGACAAT 
GCTACTATAC 
TTAATACGAC 
CCATTGTGTA 
ATATTAAACG 
CTGAGGCAGA 
ATGCTTTTGC 
GTATTCAGCA 
GATCACTTGA 
ATTTGCTAAT 
CTACTGGTAT 
AAATTATGAA 
TGGAAGTTAT 
CAAATAACAA 
TTAATATTAT 
CACATGCGAA 
AAGGAGATGT 
GAAATATTTT 
GTCATCTTTT 
GACGTATAGC 
AAGACTTATT 
AAAAACGCAT 
TATTCGAAAG 
AAAGTCATTA 
AATTTACTTA 
TAAATTGATA 
TCTAAACATC 
AGGTGCGTCA 



TGCAGTGAGT 
AG CGTTAATT 
AGTTAATAAT 
ACACAGTATT 
TACAGGTGAA 
TATGGCAGAG 
GAAACCTGGA 
AAAAGTGCGA 
AGTTTTAAAT 
AAGTTCATTT 
TCCTATAACA 
GCAAGGAGAA 
TATCGAAGAA 
CATTG CGAAT 
AAAGATTCAT 
GAAACCTGAA 
GCTAGCAGCT 
CATTAATTAC 
AATAGATGGC 
AGCAGAAGAT 
TGCGGGACCT 
ACGTGAAGCA 
AGAATGGTCT 
TACAAAACGT 
ACATAAAAGA 
TATTTTATGA 
CATAGATGAT 
AAATAGGAAA 
ATTGGCACAA 



TATGTTTTAG 
AAAGAAAATA 
GATTCAATTA 
CCTGATAGTT 
TTTAAGTTTG 
ATTGGTGAAG 
TATAATACTC 
GGTCGCTTGA 
ATTGCTAGCA 
AATATTG CTC 
GAAGTTGATA 
CCTGTAGAAG 
GGCGATTCTG 
AcATTAAATG 
GCTTCAAGTC 
TACTTTATTC 
GAAGCAGGTG 
AACGGTAAAG 
ATTGGTATTG 
GGTATCTTTA 
GAAATTCAAT 
GAAGAGAAAG 
GAAATTAAAC 
CGTCCTATGA 
GGTCAGAACA 
TAGTCAATTG 
ATTGTTCTAA 
TTATATATAA 
GCAAAAAAGA 



AACAATTAGA 
TGAAAGCCCG 
TGAGATTCAA 
TAGGTGTTTG 
ACCAAAGTTT 
AAGG CGTATT 
CGGAAAATGT 
TAGTTTCATG 
AGCTAAATCG 
GTAAAATGGG 
ATTATCCTAA 
CCTTAAGTCA 
TATTTTTAGC 
AGCtTgTtAC 
ATGGTTGCAT 
CTGTACAAGG 
TTGCACCAGA 
ATATGATATT 
GGGATGTAGG 
TTGCTGTTGT 
CTCGTGGGTT 
TACGTGAAAT 
AAAATATGCG 
TTATTCCAGT 
AGTCACTGAA 
AAGGGGTAAC 
CCTCTTTCAT 
TAACGTCGTT 
AATCGACAGC 



19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760. 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 

21600 

21660 
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GATACGTTAT GTCATAGCTA TTTTAGTAGT 


TGTATTAATG 


GTGTTGGGTG 


TTTTCCAATT 


21780 




AGGAATAATA GGTCGTCTAA TTGACAGCTT 


CTTTAATTAT 


TTATTTGGGT 


ACAGTAGATA 


21840 


5 


TTTAACATAT ATTTTAGTAC TCTTAGCAAC 


TGGTTTTATT 


ACATACTCTA 


AACGTATTCC 


21900 




TAmaACTAGA CGAACGGCTG GTTCGATTGT 


ATTGCAAATT 


GCATTGCTAT 


TTGTATCACA 


21960 


10 


GTTAGTTTTT CATTTTAATA GTGGTATCAA 


AGCTGAAAGA 


GAACCTGTAC 


TTTCTTATGT 


22020 


GTATCAGTCA TACCAACACA GTCATTTCCC 


AAATTTTGGT 


GGCGGTGTAT 


TAGGCTTTTA 


22080 




TTTATTAGAG TTAAGCGTAC CTTTAATTTC 


ATTATTTGGT 


GTATGTATTA 


TTACTATTTT 


22140 


15 


ATTATTATGC TCAAGTGTTA TTTTATTAAC 


AAACCATCAA 


CATCGTGAAG 


TTGCAAAAGT 


22200 




TGCACTGGAA AATATAAAAG CTTGGTTTGG 


TTCATTTAAT 


GAA 




22243 




(2) INFORMATION FOR SEQ ID NO: 16S : 








20 
25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 






30 


TTATTAATnA TTAATATTTT TATTTTTAAA 


AATAAAGCGA 


GGAGCTATCA 


ATGGAACAAA 


60 




TTACTTCTGC ACAAAATAAT AGAATTAAAC 


AAGCGAACAA 


GCTAAAAmAG 


AAACGTGAGA 


120 




GGGATAAAAC TGGATTAGCT TTAATTGAAG 


GTGTGCATTT 


AATTGAAGAA 


GCTTATCAAA 


180 


3$ 


GTGGAATTGT AATTACACAA TTATTTGCAA 


TTGAACCGGC 


AAGATTAGAT 


CAGCAAATTA 


240 




wCGCATACGC GCAAGAAGTT TTTgAAATAA ACATGAAAGT TGCTGAATCT TTATCAGGTA 


300 




cagtSacacc ACAAGGGTTT TTCGCAATCA 


TTGAGAAGCC 


G CATT ATG AT 


ATTTCTAAAG 


360 


40 


CACAACAAGT ATTGCTCATC GATCGTGTTC 


AAGATCCTGG 


AAATTTAGGC 


ACATTAATTA 


420 




GAACTGCGGA TGCTG CTGGA ATGGATG CTG 


TAATAATGGA 


GAAGGGTACG 


ACAGATCCTT 


480 


45 


ATCAAGATAA AGTGTTGCGA GCGAGTCAAG 


GTAGTGTTTT 


CCATTTGCCA 


GTTATGACAC 


540 


AAGATCTCGA TACGTTTATT ACTCAATTTA 


ATGGTCCTGT 


TTATGGTACA 


GCACTTGAAA 


600 




ACGCAGTGgC ATACAAAGAA GTTACTTCAA 


GTGATTCTTT 


TG CATT ACTA 


TTAGGTAATG 


660 


SO 


AGGGAGAAGG TGTTAATCCT GAATTATTAG 


CACATACTAC 


ACAAAATTTA 


ATCATACCTA 


720 




TTTATGGTAA AGCTGAAAGT TTAAATGTAG 


CGATTGCAGG 


TAGTATTTTA 


CTTTATCATT 


780 




TGAAAGGTTG ACCGTGTTGA AAGTTTTCCG 


ATATAATTAT 


AATTAATTGT 


TTAACAGAAC 


840 
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ATAAATAATT GTTTTAGGGA GAATAATCGT GACTGCAAGT TATTCCAATT ATTTAAAGTC 96 0 

TTTTCACCTT TTTGGTTACT TAAAGAGATT TAAGTCGGAA AGACAATCCG TTATCAATAT 1020 

TAAACAAGTG TATGCTTAGG CATAAATTTG GGTGGTACCA CGG AAATGAC TTTCGTCCCT 108 0 

TATTTTTTAA GAGGATGAAA GTCTTTTTTT AGTTAAACAA CAAATATGAT AAATAGAAAA 1140 

TGAATAGTTC GAATAGGGAG GTCAGTGACA TATGTCTGAA CAACAAACAA TGTCAGAGTT 1200 

AAAACAACAA GCGCTTGTAG ATATTAATGA AGCAAATGAT GAACGTGCAC TGCAAGAAGT 1260 

T AAAG TG AAA TACTTAGGTA AAAAAGGGTC AGTTAGCGGA CTAATGAAAT TGATGAAGGA 1320 

TTTGCCGAAT GAAGATAAAC CTGCGTTTGG TCAAAAAGTG AATGAATTGC GTCAAACAAT 1380 

TCAAAATGAA TTAGATGAAA GACAACAGAT GTTAGTTAAA GAAAAATTAA ATAAGCcAAT 1440 

TGGcTGAAGA AACAATTGAT GTATCATTAC CAGGTCGTCA TATTGAAATC GGTTCAAAGC 1500 

20 ATCCATTAAC ACGTACAATA GAAGAAATTG AAGACTTATT CTTAGGTTTA GGTTATGAAA 1560 

TTGTGAATGG ATATGAAGTT GAACAAGATC ATTATAACTT CGAAATGCTG AATTT AC CT A 1620 

AATCA CACCC TGCACGTGAT ATGCAAGATA GTTTCTATAT TACGGATGAA ATTTTATTAC 168 0 

25 GTACG CAT AC ATCACCAGTG CAGGCACGTa CGATGGAATC ACGTCATGGT CAAGGTCCAG 174 0 

TTAAAATTAT TTGCCCTGGT AAAGTGTATC GTCGTGACTC TGATGATGCG ACACATAGTC 1800 

ATCAATTTAC ACAAATCGAA GGATTAGTTG TTGATAAAAA CGTTAAAATG AGTGATTTGA 186 0 

AAGGTACTTT AGAATTGTTA GCTAAGAAAT TATTTGGTGC TGATCGTGAA ATTCGTTTAC 192 0 

GTCCAAGTTA CTTCCCATTC ACTGAACCTT CTGTAGAAGT TGATGTGTCA TGTTTTAAAT 1980 

GTAAAGGAAA AGGTTGTAAT GTGTGTAAAC ACACAGGATG GATTGAAATT TTAGGTGCTG 2040 

GAATGGTACA TCCTAATGTA TTAGAAATGG CTGGTTTTGA TTCTTCAGAG TACTCTGGAT 2100 

TTGCATTTGG TATGGGACCA GACCGTATTG CAATGTTGAA ATATGGTATA GAAGATATTC 216 0 

40 GTCATTTCTA TACTAATGAT GTGAGATTTT TAGATCAATT TAAAG CGGTA GAAGATAGAG 2220 

GTGACATGTA ATGTTGATAT CAAATGAATG GTTGAAAGAA TATGTAACAA TCGATGATTC 2280 

TGTAAGTAAT TTGG CAGAAC GTATTACGCG CACAGGTATT GAAGTGGATG ATTTAATTGA 234 0 

45 CTACACAAAA GAT AT CAAAA ATTTAGTTGT CGGCTTCGTT AAGTCAAAAG AG AAA CATC C 2400 

TGATGCTGAT AAATTAAATG TTTGCCAAGT TGATATCGGA GAAGACGAAC CTGTACAAAT 2460 

CGTTTGTGGT GCACCGAACG TTGaTGCAGG ACAATATGTC ATTGTTGCTA AAGTAGGTGG 2520 

so 

CAGATTGCCT GGTGGTATTA AAATTAAGCG TGCCAAATTA CGCGGTGAAC GTTCAGAAGG 2580 

TATGATTTGT TCGTTACAAG AAATTGGTAT TTCAAGTAAC TATATACCGA AAAGTTTTGA 264 0 
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ATATTTAGAT GATCAAGTAA TGGAATTTGA TTTAACGCCG AATCGTGCAG ATGCTTTAAG 2760 

TATGATAGGT ACTGCTTATG AAGTTGCAGC ATTATATAAT ACAAAAATGA CTAAGCCAGA 2 820 
GACAACATCA AATGAGCTTG ATTTATCTGC AAATGATGAA CTGACTGTGA CAATTGAAAA 
TGAAGATAAA GTACCATATT ATAGTGCACG TGTTGTTCAC GACGTGACAA TTGAACCCTC 
GCCAATTTGG ATGCAAGCAC GCTTAATAAA AGCGGGTATA CGTCCTATTA ATAATGTTGT 
TGACATTTCA AATTATGTGT TATTAGAATA CGGTCAACCA TTGCACATGT TTGATCAAGA 
TGOGATTGGT TCACAACAAA TTGTTGTTCG TCAAGCTAAT GAAGGCGAAA AAATGACAAC 

15 ATTAGATGAT ACAGAACGTG AATTATTAAC GAGCGATATT GTCATTACTA ATGGACAAAC 3180 

TCCAATTGCA TTAGCTGGTG TTATGGGTGG CGATTTTTCA GAAGTTAAAG AACAAACATC 324 0 

AAATATAGTG ATTGAAGGTG CTATTTTTGA TCCAGTTTCA ATTCGTCATA CATCAAGACG 33 00 
TTTAAATTTA CGCAGTGAAT CATCTAGTCG TTTTGAAAAA GGAAT AG CT A CTGAATTTGT 
AGATGAAGCA GTCGACCGTG CATGTTATTT ATTACAAACT TATGCAAACG GAAAAGTGCT 

AAAAGATAGA GTGTCTTCAG GAGAACTTGG TGCATTTATT ACACCAATCG ACATCACTGC 3480 

TGATAAAATT AATCGCACTA TTGGATTTGA TTTGTCACAA AATGATATTG TTACTATTTT 3540 

TAATCAACTA GGGTTTGATA CAGAAATAAA TGATGATGTT ATTACAGTGC TAGTACCATC 3600 

3Q AGGTCGTAAA GATATTACAA TTAAAGAAGA TTTAATTGAA GAAGTTGCAC GTATATATGG 3 660 

ATACGACGAT ATTCCATCAA CGTTACCTGT CTTCGATAAA GTTACTAGTG GTCAGCTAAC 3 720 

TGATCGCCAA TATAAAACTA GAATGGTTAA AGAAGTGTTA GAAGGTGCTG GATTAGACCa 378 0 

35 AGCTATTACG TATTCGTTAG TTTCTAAAGA AGATGCTACT GCaTTTTCGA TGCAACAGCG 3 84 0 

TCAAACAATT GATTTATTGA TGCCAATGAG TGAAGCGCAT GCGTCATTAC GTCAAAGTTT 3900 

ATTACCACAT TTAATCGAAG CGGCATCATA TAATGTGGCA CGCAAAAATA AAGATGTAAA 3 960 

ATTATTTGAA ATCGGCAATG TCTTCTTTGC TAATGGAGAA GGTGAACTAC CAGATCAAGT 4 020 

TGAATATTTA AGTGGTATTT TAACTGGAGA TTATGTAGTC AATCAATGGC AAGGTAAGAA 4 080 

AGAAACGGTT GATTTCTATT TAGCAAAAGG TGTCGTGGAT CGAGTATCTG AAAAGTTAAA 414 0 

TCTTGAATTT AGTTATCGCC GTGCTGATAT TGaTGGATTA CATCCAGGTC GTACTG CTGA 4200 

AATCTTATTA GAGAATAAAG TTGTTGGTTT TATTGGTGAA TTACATCCAA TATTAGCAGC 4260 

^ TGATAATGAT TTAAAACGTA CGTATGTTTT TGAGTTGAAT TTTGATGCAT TAATGGCTGT 4320 

GTCGGTAGGT TACATTAATT ACCAGCCAAT TCCGAGATTC CCAGGCATGT CTCGTGACAT 43 80 

TGCATTAGAA GTAGATCAAA ATATTCCAGC AGCTGATTTA TTATCAACGA TTCATGCACA 444 0 
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AAAAGGTAAA AAATCAATTG CAATACGTTT AAATTATTTA GACACAGAAG AAACATTGAC 4 560 

AGATGAGCGC GTTTCAAAAG TACAAGCGGA AATTGAAGCA GCATTAATTG AACAAGGTGC 4620 

TGTTATTAGA TAATGATTTA AACCCCATGT ATAAGGATAT CTGAAGTAGA TTGATATCCC 4 680 

TAACATGGGG TTTTATTTTT GGGTTCACCA ATTTGGTTCC AATGCATTTA AAAAGTCAAA 474 0 

GAGGAACAGC GGAATACAGA TGATGcTTCG CACAACTGCA TAAAAGCCTC TAATGATTAA 4 800 

AAATCAAAGA GGCTTTAAAA TTTTTTGGGC TTTTTCACGA TTTTTAAAAT GCTTTTTTGA 4 860 

AATGGTATCT AAACGTGAAA GACCGTATTT TTTTATAATT TTGGCGGCGA TTACATCGAC 4 920 

TTTAGCACCG GCACCTTTAG GAATCGTCAT ATTAATATTT TTTGATATTT GATCCATATA 4 980 

TGTAACAAAT GCGTATCGAG AAATTATGCT TGCCACTGCA ATGGCTAATG ACTTCGATTC 504 0 

TCCTTTTGTT TCAAATTTTG TTTTCTTTGG AAGTGGTATA TCTGATAATG CGTAATGG CT 5100 

20 ATACACTTCG CGTTTTGCGA ACTGATCAAT GACGATATAG TCTAATTGAG ACGAATCAAT 5160 

TTTTTCAAGT ACATTTTTGA TGGCTTCATT ATGAAGGGCA GCTTTCATTT TTACTTGAGT 522 0 

CCAGCCTTTT GCTTGCTGAA TATTATATTT TTCATTGTGT AGTGTTAATA ATGAATGTGG 5280 

25 TATGAAAGTA ACCAATTGCT CAGCAAGTTC TACAATTTTG GTATCGGTTA ATTTTTTTGA 534 0 

ATCATCTACA CCCAAAGTTT TTAAAATAGG GACATGCTCT TTGGTAACGA AAGCAGCACA 54 00 

CACAGTCAAC GGACCAAAGT AATCGCGACT TCCAGCCTCA TCACTACCAA TACAGTTAAA 5460 

TTG r TCATAC ATTAaAGTTg TcCAgAAAAG AATTAGCCAT ATTTnCCTTT 5510 
(2) INFORMATION FOR SEQ ID NO: 166: 



30 



35 



40 



45 



SO 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xll SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

GnTTATACTT ATAAATTTTA CGGGGGTAAT ATAATACTtA TTTACCTGTA ATATATGATA 60 

ATTCTTCAGC GGCAGCTGCG TTGATAGTTC TATGAGAAAT GATACCTAAT CCTTTAACAT 120 

TGGATTCTGA AATAACGATA GAAC CATCAC TGTTAACTTT TTCAACAAAT GCTACATGAC 180 

CGTAATGTTG ATCTGCACCA AATTGTCCAG CCTCAAATAC AACAGCAGCA TGACGTTTTG 24 0 

GTG TATGACT TACTTGATAA TCACGGTATT GAGCTCGATT ATTCCAATTA TGTGCATCAC 300 

CTAAATCACC TGAGATAGAT GTACCAAATT GTTTCATACG GTTATATACG TACCAAGTAC 360 



55 



BNSDOCID: <£P 078651 9 A2 I > 



816 



EP 0 786 519 A2 



w 
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ATGAATCATC 

CGTCAAACTG 

GATCTGTAGC 

ATTTCCATGT 

AGTAATCTTT 

ACAATTGATT 

CTTTGATACC 

ATTCTAAGAT 

GATGTGCATC 

TAAATTGTCC 

CACGTGTATC 

TCAATTCATC 

TTTTAGATTG 

AAATAGAGTC 

TTGCTTTATC 

AATTTGGTTG 

TGTCTGCTTT 

TGTTTTTATT 

TATCCGAATT 

ATTTATTTTT 

CAAT&ATATT 

TGTCTTGGTT 

TAGAAGTTTC 

CTTTTTGAGG 

TTGATAGCAA 

TATAATAATT 

AGTTGATAGG 

GCAAAAATAA 

TGTGACAATG 



ATAATCCTTG 

AGTTAATTGA 

ATATGTTTTA 

TGGTTTATAA 

TAGTGATTCT 

ACCATCAGCT 

GAATAAATTA 

TGCTTGGGCA 

TTTAGCAATT 

GCTATCATCA 

CTTTTGATTA 

TTGTGTTGGT 

AGATGCATAA 

TAAAGCCGAA 

GTCACTTGCT 

CTTATTAGAT 

ATCTTGTTTA 

CGAATCATTT 

TAAATTGAAT 

GGTTAGCAAT 

GTTAGAGTCT 

ACTTGTATTA 

ATCGTCATTA 

TGTATCAGCA 

ATAAATTAAA 

AAGTGTGATA 

TATCAATCGA 

ATTAATTT A C 

AAGAACGCAT 



ATAGAACGTT 

TAGTGTTTAA 

GATAAGTGTG 

ATTGTTCGAT 

TTCGTGCTTG 

TCTAATGTGT 

TGGTTTGGTG 

ATCATGACAG 

GATTTGACAA 

TTGTTAGATA 

ACATCGTTAT 

AACTGTGGAT 

TCTTTTTGTG 

TCTGACATTG 

GGTTGACTAT 

GTACTTGGTT 

GATGATTGCG 

GTTGACTTTT 

AAGTTTTGGA 

TGGTTTATAT 

GAAGTGCTGT 

TTTTTGTCTG 

GATTTTTTTG 

TAAGCGgTAG 

ATTTTATTTT 

ATAAACTATG 

CTAAATATCT 

AAAAAATATA 

TTAAATTATT 



CATATTTATC 

TAATACTGTT 

ATGTTG CATC 

TGCCATCAAT 

GATATTTTCG 

TAAAAGGAAC 

ACwTAGCTAA 

ACGCATAAAT 

ATTGACGTGT 

TACTAGGATC 

TGAATGATTG 

TCTTTGTATT 

TTTTCTTTGC 

ATTGATTATC 

TTGATTGATT 

TTGTATTGTT 

TATCAGTGTC 

CGCCATTACG 

TTAAAGTTGT 

TGGTTTGTGG 

CGTCTATAGT 

CTTTATCAAT 

AATCATGAGA 

GTGAAaCTAA 

T AGG CAT ATT 

ATTTGTTATA 

TCCAGTAAAT 

TAAAAAATAT 

TAAAATGGAT 



TAAATCTGGC 

TAATTTCTTA 

TTTATAAGAA 

ACCATTTTTA 

GAATCCAGCA 

AGAATTCCCT 

AGCACTACGA 

ATCGTTATCT 

ATCTTTTGAG 

TGTTTCGAAT 

AGCAGGTTTA 

AGATTTTTCA 

ATCTTCACTG 

TTTCGATGAA 

AGGTTGTGTT 

TGATTTAGGT 

ATTTTTGATG 

AGGTTGTTCG 

TAATGAGTAA 

TAAATTCTTA 

TTTAAATTTT 

ATCTTTACTT 

TGTTGTCTTA 

AGTAGGTAAT 

TCGTATTCTC 

ATTTATCGTA 

TGATTATACT 

GAATAATTCC 

CTGAAGAAAG 



ATG CGTTCAT 
G CATAGTTTG 
TCGGCTTCCG 
ATAAGGTCAG 
TTAATACTAT 
TCaAAAGCAC 
CCTGAGTCAG 
TGACCAATGC 
TCAACAACGT 
AATGATGTTG 
GATTTATGTT 
TTTTTGTCTT 
TATTGATCCA 
GATTTTTGAT 
GGCTTTGGCG 
GCTTTTTGAT 
CTATTGTCAC 
TAATCAGAAA 
TTATCATCGT 
TAAATAAAAT 
TTGTCGTTAT 
GTAGTATCCT 
GCTGTAGTAT 
ACGAGCGTAG 
CCTTGAAAAA 
TGCTGAAAAT 
AATTCACAAC 
TACATAGGAG 
TAG CT AAGAC 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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TAACTTATGG GCAATGTGGG ATCCATATGG CAACACGGGA CACATCAAGG TCGCAGTCGT 2280 

TAATGAAGAT AAAGGCGACA CAATCAGAGG GAAAAAAGTT AATGTCGGTA ATACGATGGT 2340 

TAATACACTC AAGAAAAATA AAAGTTTTGA TTGGCAGTTT GTAAGTAGAG AGAAAGCTGA 2400 

TCATGAGATA AAAATGGGTA AATATTTTGC AGGTATTTAC ATCCCATCTA AGTTTACACA 24 60 

TGAAATTACA GGGACACTAC GTAAGCAGCC TCAAAAAGCA GATGTAGAAT TTAAGGTGAA 2520 

TCAGAAGATT AACGCTGTTG CGTCTAAGCT AACAGATACT GGTTCGTCAG TTGTCGTTGA 25 80 

AAAAGCGAAT GAACAATTTA ATAAAACAGT AACTCGAGCA TTATTAGAAG AAGCTAACAA 2 64 0 

AGCAGGTTTA ACTATTGAAG AAAATGTGCC GACAATTAAC AAGATAAAAA ATGCGGTATA 2700 

TTCAGCAGAT AAAGCTTTAC CTAAGATTAA TGACTTTGCG AATAAAATTG TATATTTGAA 2760 

TAACCACCAA GCGGATTTAG ATAAATATGC CAATGATTTT AGAAAACTAG GAAATTATAA 2820 

20 AGGTGATATT TTAGATGCTC AGAAAAAATT AAACGAaGTC AATGGTGCTA TTCCGCAACT 2880 

TAATGAAAAG GCTAAGTTGA TATTAGCTTT AAATAATTAT ATGCCGAAAA TTGAAAAAGC 294 0 

GTTAAATTTT GCAG CTGATG ACGTGCCAGC GCAGTTCCCT AAAATTAATC AAGGACTTAA 3000 

CATTG CG AG T CAAGGTATTG ATCAAGCTAA TGGACAGTTA AATGATGCCA AAGGCTTCGT 3 060 

CACACAAGTT AGAAGTAGAG TCGGTGATTA TCAAGATGCA ATTCGACGCG CGCAAGATTT 3120 

AAATCGAAGA AACCAGCAAC AGATTCCTCA AAATAGCGCG GCGAACAACG AAACATCAAA 318 0 

TAGTGCACCT GCAGCTGGTA ATGGTGTAGC ATCAACGCCA C CAAGTGCAC CAAGTGGCGA 324 0 

TACTGCACCA AATAATAATG TTACGCAAAA TACCGCACCA AATAGTAATA ATGCGCCTGT 3300 

ATCGACTACA CCACAAAGTA CAAGCGGGAA AAAAGATGGT CAAAGTTTTG TAGATATAAC 3360 

AACAACACAA GTCAGCACAG CTAACGAGAA CACACAAAAC ATTACAGATA AAGATGTTAA 34 20 

ATCAJtTGGAA GCGGCATTAA CGGGCTCTTT ATTATCATTA TCAAATAATT TAGATACCCA 34 80 

40 AGCGAAAGCC GCACAAAAAG ATAGTCAGGC ATTACGTAAT ATTTCGTATG GGATTTTAGC 354 0 

ATCGGACAAG CCTTCTGATT TTAGAGAGTC TTTAGATAAT GTTAAGTCCG GTTTAGAATA 3600 

CACAACGCAA TATAATCAAC AATTTATCGA TACATTAAAA GAGATTGAGA AGAATGAAAA 3660 

TGTTGATTTA TCAAAAGAAA TTGATAAGGT AAAAGCAGCT AATAATCGAA TTAATGAATC 3720 

ATTAAGGTTA GTTAATCAAT TAAGCAATGC ATTAAAGAAT GGTAGTTCAG GAACTGCTGA 3780 

AG CT ACT AAA TTACTAGATC AACTTTCAAA ACT AG ATT CA T CATT AT CAT CATTTAGAGA 384 0 

TTATGTTAAA AAAGATCTTA ACAGCTCTTT AGTATCAATA TCACAACGTA TTATGGATGA 3 900 

ATTGAACAAA GGGCAAACTG CATTATCCAA TGTTCAGTCT AAATTAAATA CAATTGATCA 3 960 
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AACAGTATTA 


CCAAGTATTG 


AACAACAATA 


CATTAGTGCT 


GTTAAAAATG 


CTCAAGCAAA 


4080 




C ITCTCGAAA 


GTGAAAAGTG 


ATGTAGcTAA 


AGCTGCTAAC 


TTTGTGCGCA 


ATGACTTACC 


4140 


5 


AtAbliAGAA 


CAGCGATTAA 


CTAATGCGAC 


AGCAAGTGTG 


AATAAAAATT 


TACCAACGTT 


4200 




Ax 1AAATG G T 


TATGATCAAG 


CGGTAGGATT 


ACTAAATAAA 


AATCAGCCAC 


AAGCGAAAAA 


4260 


10 


GGCTTTATCA 


GATTTAGCTG 


ATTTTTCTCA 


AAATAAATTG 


CCTGATGTTG 


AAAAAGATTT 


4320 




GAAAAAAGCG 


AATAAAATTT 


TCAAGAAATT 


AGACAAAGAT 


GATGCAGTCG 


A CAAATTAAT 


4380 




CGACACACTT 


AAGAATGATT 


TGAAAAAGCA 


AGCGGGTATT 


ATTGCAAATC 


CTATTAATAA 


4440 


1S 


GAAGACTGTT 


GATGTTTTCC 


CAGTTAAGGA 


TTATGGTTCA 


GGTATGACAC 


CATTCTATAC 


4500 




TGCACTGTCA 


GTATGGGTAG 


GTGCACTCTT 


GATGGTAAGT 


TTATTAACGG 


TTGATAATAA 


4560 




ACATAAGAGT 


CTAGAGTCAG 


TCTTAACGAC 


AAGACAAGTG 


TTCTTAGGTA 


AGGCAGGATT 


4620 


20 


CTTTATAATG 


CTTGGTATGT 


TGCAAGCACT 


CATTGTATCG 


GTTGGAGATT 


TGTTAATCCT 


4680 




Ik. lk Tk, *\ Vk j~m +, 

AAAAGCAGGA 


GTTGAGTCAC 


CTGTATTATT 


TGTACTTATA 


ACGATTTTCT 


GTTCGATTAT 


4740 


25 


TTTCAACTCA 


ATCGTATATA 


CGTGCGTATC 


ATTACTTGGT 


AACCCAGGTA 


AAGCCATTGC 


4800 


AATCGTATTG 


CTTGTATTAC 


AAATTGCAGG 


TGGTGGGGGA 


ACATTCCCAA 


TTCAAACTAC 


4860 




GCCACAATTT 


TTCCAAAACA 


TTTCGCCATA 


CTTACCATTT 


ACGTATGCAA 


TTGATTCATT 


4920 


30 


ACGTGAAACA 


GTAGGCGGTA 


TTGTTCCGGA 


AATCCTAATT 


ACAAAATTAA 


TTATATTAAC 


4980 




GTTATTTGGT 


ATAGGATTCT 


TCGTTGTAGG 


TTTAATTTTA 


AAACCTGTAA 


CAGATCCATT 


5040 




GATGAAGCGC 


GTATCTGAAA 


AAGTTGACCA 


AAGTAACGTT 


ACAGAATAAA 


AATTAAATCC 


5100 


35 


ACACATTAGG 


GTTATAGCTC 


CTTAATGTGT 


GGATTTTTAT 


GTTTTTAGAC 


AGAAGAGATA 


5160 




GTAA iiivii? 


TCTTTTATGG 


GACGGTTGTT 


ATCATTGCTA 


TTATCCAGGA 


TGACTTACTA 


5220 




TAGGACTAAT 


ATTACCGACA 


AAGTGAATAT 


CCTCGTCTTC 


CGTAGTTAAA 


ATAAAGCTAG 


5280 


40 


AACLiI"l"l"11i 


GATGTCATAG 


TGCTTATCGT 


TTACTGTTAA 


AGTACCAGTA 


CCATCGATAA 


5340 




TTGTAACTAA 


GCAATAAGCA 


TGTGGTTTAT 


TGAATTTTAA 


ATCTCCATGA 


ATATCCCATT 


5400 


45 


TATATACTGC 


AAAATATTGA 


TTATCTACAA 


ATTGAGTTAC 


AGTGTGTGTG 


TCGATGTGAG 


5460 


TTGTTATAGG 


AGTAGTATTT 


GGTTCATGAT 


TGCCTAATTC 


AATCACATCT 


TTACTTTGCT 


5520 




CTAAGTGCAA 


ATCACGCAAT 


TGACCATTTT 


GATCTCGTCT 


ATCATAGTCA 


TAAATACGGT 


5580 


SO 


ATGTCGTATC 


GGAGGATTGT 


TGTGTCTCTA 


AAATTAAAAT 


ACCCGAACCA 


ATGGCATGGA 


5640 




CAGTGCCAGC 


AGGAACATAA 


TAAAAGTCAC 


CGGGCTTAAC 


AGGTATACGT 


TTGAAAAGAC 


5700 




TGTCAAATTC 


ATGATTATCA 


ATCATGTCTA 


TTAACGTCTG 


TTTATTATGT 


GCATGTACGC 


5760 
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GTTCGCCTTC GTGTTTTAAA GCGTAGTCAT 
CATTGGCATC TAATACTTTA GTTAGCAGAG 
5 ATTCACGATG TTGTGACCAA AGTTGATCTA 

TTGTATTAGG ACCATTTGGA TGTGCAGAAA 
GGATATCATA GTTAAATGCT TTTAATGCAT 

10 

GTTGTAAAAA TAATGCCATA GTTAAAACTC 
TCTGTAGTAC TGTTTGCATT AATT AG TG AT 

1S GATAAGCGCT GAAGTATTTT TAAATGTGTA 

AAT AT CAATT GAGGTAGACT ACCATCTAGA 
ATAACAGCTA CAATCGGTTG TTTTACAACA 

20 ATGCCAATAG CTGTCGTAGm t Cc ATTTCAC 

TGTGCTCTAC ATAACGGCAA ATTTTAAGTT 
GAGACATGTC GTGATCAG T A ATT AT CAT AG 

25 TGAGATGTGA ATGTTTCGCG GTGTTATCTA 

TATCTGTATC ATGAAGTTGC GTGTGTTGCG 
TG CGTTTTAA TAATAGTACA GTAGT CATTG 

30 

TAAAC CAT AA TACATGATCA ATACCACCTA 
CTCTATCGCC GACACCACCA ATGGCTGCAA 
TTGCAGGTAT AATGCG CAAT GGATCTTGGG 

35 

CAAATAGTCC CATAGTGAAG GAAGCCTTAC 
ACTTTTGAAC AAACGTTGCT AAACCTAAAC 

40 CCATACCCAT AACGGCGTAA TTACCTTCAG 

CCTTGTTTAC TGGACCGCCC ATATCGAAGG 
TAATAATATT AGCACCTTGC ATACTTTTTA 

45 AAATTGGTGC AC CG ATT AAA AATATAAATA 

GAATAATAAT GATAGGCATA ATTGGTGCCA 
ACTTTGCGAT ATAACCTGCT AAGAAACCAG 

SO 

CATCACTGCC ATAAAAACTA CCGTCAGCAG 
GACCGGGcTT GTCAGCGATA CTAACAGCGA 
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CATCTGGGTG AACTTGAACA GATAATTTAT s 880 

GGAAACTATC TCGTGAATCA TTATCGAATA 594 0 

GGGTCATATC CTTGTATGGA CCATTGATAA 6000 

TTGCCCAGCA TTCACCAGTT GTTTCATTAG 6060 

GACCGCCCCA AATTCTGTCT TTAAAAACGG 612 0 

CTCTATATTT TCATTAATAA GTTATAAATT 6180 

TGGCGTGTCT CATCATTCAT TAACGCTTTA 624 0 

TCCTGACTGT TGTTTGGTAC GGCAATTAAG 6300 

CTGTCCCATT TAACACCATG ATTATTTTTC 6360 

TCAGACTTTG CATGTGGAAT GGCCACG TTC 6420 

GTTCTAGTAT TGCATTTTTT AAATG CG ATG 64 80 

TATGAATCAA CATATCAATT GCTTCGTTTC 654 0 

TTTGTTGATC AAAAACATGA GAAGGTTTAT 6600 

CATTGTCAAC CTCTGTATCA TGTTGTGTAA 6660 

CTGGTGCATC TACTGCTATA ACTGGTGTAT 6720 

TGACAAGACT ACCTACTATC ACTGCAAAGA 6780 

ATACAGCCAC GATTGGACCT CCATGTGCGA 684 0 

TGACTGATGC AAT CATTGCA CCAATGATGT 6 900 

CTG CGAAAGG AATAGCACCT TCAGTAATAC 6960 

CCATTTCTCT TTCGGAATGA TTGAATTTAT 7020 

CGATTGGTGG TGTACATACA GCAACTGCGA 7080 

CAATAAGTGC TGAGCCAAAT AAAAATGCTA 7140 

CAATCATCGC ACCTATAATC ATCGCAAGTA 7200 

ACCAGGTTGT TAATGCCTCA AAAATATTAG 7260 

TCAATCCTAC AACGACCGAT GAAATAATGG 7320 

TTGCTTTTGG AACTTTAATA TCTTTAATCC 73 80 

CAACAATACC ACCTAAAAAT CCTGCGCCTG 7440 

CGATAGCGCC GCCAATCATA CCAGGAACAA 7500 

TATATCCAGC TAGTATTGGA AC CAT AAATT 7560 
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ATCCTTTTGA TGTCGTTtCA CCGCCTAGAG TCAGCGCGAT GGCGATAAGG AGTCCACCAA 76 80 

CTACGATAAA AGGAACCATA AACGATACAC CGTTCATTAA ATGTTGATAC ACCATTTGAA 774 0 

TACCATTTTT AGACTTACCG CGATCTTTCG AATGATAATT TGTTTCAGAT TGATAAATAG 7800 

GCGCATCTTG ATTAATGATA CGTTGAATTA GACCTCTCGG ATTATGAATC CCTTCGCGAA 7860 

CATTTTCATT AATCAACCGT TTACCAACAA ATCGGGACAG ATCAACTTGT TTATCAG CTG 7920 

CAATTATGAC ACCGTCAGCT TCTTCGATGT CTTGCGTAGT TAAAACATTT TCAGCACCAA 7980 

CACCGCCCTG TGTCTCTACT TTAATATCCA CACCCATTTC TTTTGCTACC TGCTCAAGCT 804 0 

r5 TTTCTTGAGC CATATATGTA TGTGCAATGC CATTTGGGCA TGAGGTAATA GCTACAATTT 8100 

TCATAAAATC ATCTCCTTTT CTATATTGTA AGCGTATTCT CGATACTAAA AAAAAGAATA 8160 

ATTACCGTTA CTAGTGGCAA TTATTCTTGT AAGTATTCAA ATAACTGTTG CTTTAAACTA 8220 

20 TGATCATCTA AACTACATAA ATGGTTCACT GAATCATCAT CCAAGTTAGC AATTAATTGC 8280 

ATCATTTGTT TTGTAAAAGC TTTGTCTTTA TGCGAAATCG CTAAGAAAAA GACAAGTTTG 8340 

ACATCGTGTT GTCG CCAAGG AAAAACATCT TTTGTGCGAA AAATAAGCAC ATGTGATTGT 84 00 

AAAACTTTTT CAGGATCTCC ATGAGGAATC GCCATAAAAT TACCTATGTA TGTAGAAGAT 8460 

GATTTCTCAC GCTCTAAAGC TGATTCGATA TATCCTTCTA CAATCGCATG ATGTGCTTGT 8520 

AATATTTTTT GAGCTTCTTC AAAAATTTGC ACAGTATGCC GTGATTTTTG TTCAGTATTT 8580 

ACGACAAGGA AATTGACAGT GTCCATATGA TGATGTG CTT GAACCGGATT TTGCTTTTGC 864 0 

TTCACAACGT GTCTGATTTT GTGACGATCA TCTTCAGAAA ATAATGGTGC AACCTTGATA 8700 

GTCGTCAGGT GCTTAGGAAG TATGTTTAGC GTTTGTTTAG GAATATCATG GGTCGTTATT 8760 

AATAAATCTA CATTGTCAAA GTGATAGTGT GTTATATTTT CTAGTTTAAT CGTATTTATC 8820 

ACTGACAACT CTTCGGATAA GTTATTTATT TTAGTTTCTA AAAAATTCGA CACACCTAGA 88 80 

40 CCATAATAAC AAGCAATGAC TACATTTAAT TGTGTTTTGG TACGACGCTC GATGGCAGCT 8940 

TGAAAATGAA TTGTTAAAAA TGCAATTTCA TCTTCGCTCA TCTCTATATC AGTATCAATT 9000 

GCTAATTTAT CAATCGCTTC AAAAAGTGTG TTAAACACAA AGGGATAGAG TTTTTTAATC 9060 

TCTATAACTA AAGGATTGTT TAAATAAATG TTTTGAGTGA TACGTAAATA TGCTTTACTA 9120 

AAATGATTAT ATAAATTTTG TTGTAAAATC GAATCTT CAT TGAAAGGTAC ATGAATACGT 9180 

TGCTGCATCA ATTCGATTAA GCGATCAATA TAACTTTGTA TAAATATACG TTCTATGCCA 924 0 

ATATCGAGTT TATTAAAATG ATAAGCAATA AAGAATGAAA ACATATTGAT TACTTTTTCG 93 00 

TTCAAGTCAT AACCTAATCT TTCGTTGATT TGCTTAATGC AAGATTGAGA TATCAATTTT 9360 
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AGATGAATTA AAAGCTGTTG TATTTGAATA TCAGTTGTTT CAATACTATG TTGTTGAAGT 94 80 

GTCTCTTGTA TAATATG CG A AATCATCCTT TGGTGTGAAT CAGGTAATTC aTTTAAAATT 954 0 

AGGTCTTCAA CATGTACATG CCCTGATGAT AATTGATTTA AATGGATGAT GGCATTAGTG 9600 

ATATCATTAT CTGTTCCATC GAC 9623 
(2) INFORMATION FOR SEQ ID NO: 167: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
15 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

20 ACCGTGGAAA CACGTCTAGT CAATCAGAAA G CG AT AAAAA TGTGACTAAA TCATCTCAAG 60 

AGGAAAATCA AGCAAAAGAA GAATTACAAA GCGTTTTAAA CAAAATTAAC AAACAATCAA 120 

GTAAGAATAA TTAAAAAATT TTGATATTGT CTATGTTTAT AGTTCACAAG CCATTCAACG 180 

TATTGTAAAC TAAGGATAGT GTATTTTTTT AATAGTAATT TGTCAGGAGG TGCCTATCTA 240 

TGGAAGAACA TTACTACGTA AGTATTGATA TTGGATCATC AAGCGTAAAA ACAATAGTAG 3 00 

GCGAGAAATT TCACAATGGT ATAAATGTGA TAGGTACAGG ACAAACCTAC ACGAGCGGTA 360 

TAAAAAATGG TTTAATTGAT GATTTTGATA TTGCGCGACA AGCAATCAAA GACACAATTA 4 20 

AAAAGGCATC AATCGCTTCG GGTGTTGATA TTAAAGAAGT TTTCCTGAAA TT AC CT AT CA 4 80 

TTGGAACGGA AGTTTATGAT GAATCAAATG AAATCGACTT TTATGAGGAT ACAGAAATCA 54 0 

ACGGTTCACA TATCGAAAAA GTATTAGAAG GTATTAGAGA AAAAAATGAT GTGCAAGAAA 6 00 

CAGAAGTAAT TAATGTGTTC CCGATTCGTT TTATAGTCGA TAAAGAAAAT GAGGTTTCAG 66 0 

40 ACCCTAAAGA ATTAATTGCC AGACATTCAT TAAAGGTTGA AGCAGGCGTA ATTGCTATTC 720 

AAAAATCGAT TTTAATTAAT ATGATTAAAT GCGTAGAAGC ATGTGGTGTT GATGTATTAG 780 

ATGTTTACTC TG ATG CAT AT AACTATGGTT CAATCCTAAC AG CTACTG AA AAAGAGTTAG 84 0 

GTGCATGTGT CATTGATATT GGTGAAGACG TTACGCAAGT TGCTTTTTAT GAACGCGGTG 900 

AATTAGTAGA TGCTGATTCT ATCGAAATGG CAGGGCGTGA TATTACaGAC GATaTTGCAC 960 

aAGGrTTaAA CACTTCTnAT GAAACTGCTG nAAAAAGTTA AACACCAATn TGGTCATGCA 1020 

T 1021 
(2) INFORMATION FOR SEQ ID NO: 168: 
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(A) LENGTH: 7963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

W TAATCTATTA TAAAAACTGT CCATACCCTT TGATTACCTT CTCTTCAGGT ACAGGCCACA 60 

CTTGAGGCCA TAAGCCATAT GCTTGCTGTG AATAAAATTG TGCCATTTGT AACAATATAA 120 

TATATACAAA TAAACACCCA ATAATTGCTG TCACTAATGG ATATGATAAC CAAACCATTA 160 

ATAAAACTGC AATAATTACT AACCTAAAGA TAATATTAAA TGCGTCTCTC CCTCTTATAA 240 

AGCTTCTAAT AAATAAGAAT AAATACATCG CATTAGAGTT AAATTTACTA CCCTTTGGAA 300 

CTGGTAAAAG TATATCTAGA TAACTTCTTC TGACTGCAGA TTCTTTCAAA TGTTTTACAT 3 60 

CGGTGAACAT ATTAACAAAT TTATAATAAT TCATATGATG TCGATGTTCG ATTGCAATCA 420 

TTTTCTCCCA AGGATACAAA AAGCCTGGTT TATATTTTTT AACTAAAAAT TCTATTAACA 4 80 

CAGGCAAAGC AACCATCACA AATGCGATGT ACCATTTTGG AGCTAATAGT AAGTAATATG 54 0 

TTAGAGCAAA GGTGATGAAT GATATTAAAT TAACTTGCCA TGTTTTAAGT CCCGATTGAT 600 

ACCATTGCCA TCTTAAGCGT AAACCAACAT ATGGAAAAAT TAATGCACTG ACTCCAAAAC 660 

AAATATAAAA TGCCACATTA TGTTGATTAA TATTGTAAAA CAACGGGAAC ATTACAATAA 72 0 

CAATAATGAG TTGGATTAAT ATGCGCGCAA AGTAACTATA TAAAATCGCA TGACGCATAA 7 80 

ATTGAGACAT GTGTTTTTCA AATGGTAATA AAAAGATTTT ATCCgCTTCT TTTAACAGTG 84 0 

35 GTCsCmTTGG AAAAATAGrT GTCAACGCAA CAATCACTGC TGCTATTaAT GAAAAATTGa 900 

^i^TOGTTGG ^^^^TTT^_^^Q^TVc^C CATATCCArA AATAAATGCA CCCAGCAAAA 960- 

TAAGTAAAAA GACCATGAAA TGACCATTAA ATATAAACTT ATTATAATAA TTTTtCTCTT 1020 

TACGAAGGGC ATGTAATCTT TTATTAAATA ATGTGGTAGC TTGGTTACGC ATGTACATCT 1080 

CCACCTTGCG TCACATGAAT ATATATATCG TCTAATGTTT GATTATGTAA GCCAGTTTGT 114 0 

TGTCTCAATG CTTCTAAATC TCCAAATGCA ACGACTTCAC CTTCGTCTAG TATGaTAAAA 1200 

CGATCACAGT AACGTTCAGC TGTTGCTAAA ATATGTGTAC TCATTAGAAC GGTTCTACCT 1260 

TCGTTTTTCT TTTCAACCAT TAAATCTAAC ATGGATTGAA TTCCTAATGG ATCTAGGCCA 1320 

AGGAATGGTT CGTCTATAAT ATACAATTCG GGATTAACGA TAAACGCACA AATAATCATG 1380 

ACTTTTTGTT TCATCCCCTT AGAAAAATGA CTCGGAAAAA CTTTCAACTC ATTTTCTAAA 1440 

CGGAATGTCT TTAATAATGG CATTGCTCGA TTCATCGTTT CATCACGATC AATATCATAT 1500 
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TCCGGAATAT AAGATAACTT TCTTCTATAA GCCTCTATGT CATCATTAAT GTTGATATCT 1620 

GAAATTGATA GAGATCCTTC CATAGGTGTA AGCAATCCTA GCATATGTTT AATCGTTGTA 1680 

CTCTTACCAG CGCCATTAAG GCCAATAAGT CCAACAATTT CGCCTTTGTT TAATTCAAAA 174 0 

TTTATATCTT TAATTACAGG GCGTTTTCCA TATCCACCTG TAAGCTGTTC TACTTTAACT 1800 

GTCATAAGGC ACCTCCATGA CTTATATTGT ACCAAAAATT ATAAAATGCT CATATTAAAT 18 60 

ACACATGTCC TAATATCGAA TTTTTAGCGA CAATGTTATA ATGAATGGTA ATACTAGTTG 1920 

AAAAGGAGTG TAGTCATCAT GTCAGAAACA ATTTTCGGCA AAATTTTAAC TGGAGAAATT 1980 

CCTAGCTTTA AAGTATATGA AGACGATTAT GTCTATGCCT TTTTAGATAT ATCACAAGTT 2040 

ACTAAAGGAC ATACGTTATT AATTCCTAAA AAAGCTTCTG CTAATATCTT TGAAACTGAT 2100 

GAAGAAACAA TGAAACATAT CGGTGCAGCA TTACCTAAAG TAG CAAATG C TATTAAGCGT 2160 

GCATTTAATC CTGATGGTTT AAACATTATT CAAAATAATG GTGAGTTTGC AGATCAATCT 2220 

GT ATTT CAT A TTCATTTCCA CTTAATTCCT CGATACGAAA ATGATATTGA TGGATTTGGT 2280 

TATAAGTGGG AAACACATGA AGACATTTTA GATAACGATG CAAAACAACA AATTGCTGAA 2 340 

25 CAAAXTCAAG CACAATTTTA AATGTATGCT TAATCTAAGC TCGAACGGGT ATAATATGAT 2400 

TAATATTATA ACAATTGCGT TTGAAGTGAT AACATCAAGG TTAGCAATTT TAAACAAAAT 24 60 

GAGTTATCAA GATAACAGAT GTTAAAAGTG AGGAGAATAT AAATGAAAGC ATCACGCATT 2520 

CTATTCGGTA TCGGTGTTGG CGTAGCAGCT GGTTTTGTAG TTGCACTTCA AGGACGTGAC 2580 

GACAAAAGTG TCAAGAACAA CACGATCGAT CGTACTGCCC CTACTGGTTC AAAATCAGAA 2640 

CTACAACGTG AATTTGAAAC GATTAAACAA AGTTTTAATG ACATTTTAAA CTATGGTGTT 2700 

CAAATTAAAA ACGAAAGTGC GGAATTTGGT AGTTCAATTG GTGGTGAAAT TAAGTCATTA 2760 

CTTGGAAACT TCAAATCTGA CATTAATCCT AATATTGAAC GTTTACAGTC ACACATCGAA 2820 

AATTTACAAA ATCGTGGCGA GGATATTGGA AACGAAATTT CTAAGTAGCA GGTTACGTTC 2880 

TCGATCACAA CTATTTTTAT TAGTAACAGC ATATTTATTT TTTAAAATTA AATGCCAAAT 294 0 

AAACGAGATG ACATTAGAAA TTAGATATTT CTTGTCATCT CTTTTTTAAA ACT CAAATG A 3000 

ACTTATGTTT ACAAATTATA GGAAGACATT GTTTGTAGTG ATTTTCGCTT AAATCATATT 3060 

TATGAATTGA TTGAAAACAT TGCTTAGGAT TCATTGTGTT ATCCtTGCAC TTTGATTACG 3120 

CTTTACTTAA ATCATTATCG ACAAACAACA TACTTATATT TTCATTGAGC CGAAC CTT AT 3180 

50 ATACACATTA CATATACCTT ACTTGCACAA ATTATTAATC TGGTGTTTAT TATAATTACA 324 0 

TATCACTATA TTTTTAGCAT TTGTATAACT TAGTTGGTCA AAAGATGCTT TTGCATATGC 3300 
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X X X V— n X f\r\<j i. 


vj A Ivj L I I TAT 


TAGCAAGAAT 


ATGTGTTCGC 


AGAAATTTGT 


TCTGCATTCT 


3420 






1 XV* X/"VV»Vj\_ 


TAGTCAATCA 


GACAATTTTA 


CCAATCCCCA 


CTTTCGCGTT 


TCAAATCAAA 


34B0 




5 


CAATACGTr'fl 


CTCCTTTCTT 


CTTATATAAC 


AATTCTTCTA 


ACATGATATG 


TTACTATTGA 


3540 






^\ x x «^n— x unnL 


CTGAGTTAGT 


TATAATCTAA 


CTTATATTGA 


AAAGAGATGA 


GGCGTAAGAT 


3600 






ATGTTTTT A T 


GTAAAAGACA AATTGATATC 


AATGCACGAT 


TTGGTTTGCC 


TAG AATTG CA 


3660 




10 


xxx n x wio x v? 


CAGTTGCAAC 


CATCATTATG 


T^XttAGTTA 


GTTATGAAGT 


AATGTATTTT 


3720 






TT A TOT* A IV f A 


CGCCATTATC 


AGATAGACAT 


TTTCTCATCT 


TTTTATTACT 


TGTATTTATG 


3780 




15 


& fY2*F IV TVT* a *"P 


TACATAAAAG 


TATACATTTA 


TTATTTTTCT 


TACCATATAG 


AAAATCGTTT 


3840 




&A&flTTr a t a 


AGTTAACTAA 


AAGAAAATGG 


CTTATATTCT 


ATAATACCTA 


CGTCAATCAA 


3900 






v~v- X\» X AVJAtJLA 


AATTTTATTT 


TTGCATTAAC 


TTAATATTGC 


CGTTAATTAT 


CTTATCTGCA 


3960 




20 


A X\J X X V_lj ill 


ATCTAACAAT 


TTCATTCCCG 


CAATATGGAC 


ATTATTTTAT 


GTTCTTATTG 


4020 








TCGGTATTTC 


CATTACAGAT 


TTATTATATT 


TAAAAATAAT 


TATATTTTCT 


4080 






AAA XAX\j{jAC 


AATATATAGA 


AGAACATAGT 


ACAGGTATTA 


ATATTTTGAA 


AAAAATTAAA 


4140 




25 


AAX LLnlAl V_ 


ATTTATAACA 


AAATAATTAT 


AGCAAGGTGT 


TATTATTTGT 


TTTTAGGCTA 


4200 






A \» X Aft 1 Aye 1 


tACAATCAAA 


TGTATATAGA 


CCTTGTTTTT 


TTATTTTCAT 


CAATTTCTAC 


4260 






LLL 1AAACL1 


AATGCTCTAG 


TCTGATGTCA 


TGGGTTATTG 


ATTGGTGATA 


ATATAAAACT 


4320 




30 


nlul inlAl 1 


CACGATGATT 


AACTTACAAA 


GGAGTTTCAA 


CTATGAAGAT 


GATAAACAAA 


4380 








CGGTAACAGC 


TAGTGCTTTA 


TTATTAGGCG 


CTTGTGGCgC 


TAGTGCCACA 


4440 








AAAATACATT 


AATTTCTTCT 


AAAGCTGGAG 


ACGTAACAGT 


TGCAGATACA 


4500 




35 


aTYlA A A A A & & 


TCGGTAAAGA 


TCAAATTGCA 


AATGCATCAT 


TTACTGAAAT 


GTTAAATAAA 


4560 






— ni l mnVjv,x vj- 


ATAAATATAA AAATAAAGTT- 


AATGATAAG A - 


AGATTGACGA- 


ACAAATTGAA- - 


- 4620 




40 


AAAATY! f*A & A 


AGCAATACGG 


CGGTAAAGAT 


AAATTTGAAA 


AGGCCCTTCA 


ACAG CAAGGT 


4680 




X XA#iWivjLv.Vj 


ATAAATATAA 


AGAAAATTTA 


CGTACTGCTG 


CTTATCATAA 


AGAATTACTA 


4740 






TfAfiATA a a a 


TTAAAATCTC 


TGATTCTGAA 


ATTAAAGAAG 


ACAGCArGAA 


AGCTTCACAC 


4800 




45 


AX 1 X InAl 1A 


AAGTTAAATC 


TAAGAAAAGC 


GACmAAGAAG 


GCTTAGATGA 


TAAAGAAGCG 


4860 






AAACAAAAAG 


CTGAAGAAAT 


TCAAAAAGAA 


GTTTCAAAAG 


ATCCAAGTAA 


ATTTGGTGAA 


4920 






ATCGCTAAAA 


AAGAATCAAT 


GGATACTGGT 


TCAGCTAAAA 


AAGATGG CG A 


ATTAGGTTAT 


4980 




50 


GTTCTTAAAG 


GACAAACTGA 


TAAAGATTTT 


GAAAAAGCAC 


TATTTAAGCT 


TAAAGATGGT 


5040 






GAAGTATCAG 


AGGTTGTTAA 


ATCAAGCTTT 


GGATATCATA 


TTATTAAAGC 


TGATAAACCA 


5100 
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AAAAATCCAA 


AATTATTGAC 


TGATGCATAC 


AAAGATCTAT 


TAAAAGAATA 


CGATGTTGAC 


5220 




TTTAAAGATC 


GTGATATTAA 


ATCAGTTGTC 


GAAGATAAAA 


TCTTAAACCC 


TGAAAAACTT 


5280 


5 


AAACAAGGTG 


GCGCACAAGG 


CGGACAATCC 


GGCATGAGCC 


AATAACACAA 


AACCGAGCGA 


5340 




CCGTGGTTCA 


AAAATCATAC 


CACGGCCGCT 


CGGTTTTTTC 


GCATTAAAAA 


TCGGACAGAT 


5400 




GAGCTCATGT 


TTCAGTATAC 


TCATCTGTCC 


GATATCTTTT 


AATTCTTAAT 


CGAGTGATTC 


5460 


10 


AGGATTGTAG 


AATCTACGAT 


TTTCAAGACC 
CAAACATATT 


AAATATTTTA 
CATTCTAGCA 


TCTGTAAACT 
TCGATATTAT 


GACCCTTGTC 
CGATATAGCA 


5520 
5580 




Acrrri-rrriA 


TATGCCTTTT 


15 


TAAAATTTCT 


GCTTCTTTTA 


AGTATGGCAG 


TTTTGGAGAA 


CCATACTCTA 


ACTTACCATG 


5640 


ATGAGATAAA 


ATCATATGTC 


TTAACAACAT 


GATTTCTTCT 


CCTTCAATGT 


TCAATTCACG 


5700 




AGCTGCTTCA 


ACTACTT CAT 


CACTCGCAAT 


CGAGATGTGT 


CCTAATAAGT 


TACCTTCGAC 


5760 


20 


TGTATACGAC 


GTCGCAACAG 


GACCACTCAA 


TTCTCTAACT 


TTACCAATAT 


CATGCAAAAT 


5820 




AATACCACTA 


TATAACAAAC 


TTTTGTTTAA 


CAATGGATAA 


ATGTCaCAAA 


TTGATTTTGC 


5880 




AATACGTAAC 


ATCGTTAATA 


CATGATAGCT 


TAAGCCACTC 


GCAAAGTTAT 


GaTGATGAGA 


5940 


25 


ACTAGCAGCT 


GGATATGTGT 


AAAATCGTTC 


TTGATATTTT 


TTCAATAAAT 


GACGTGTGAT 


6000 




ACGTTGTAAA 


TTAGCATTTT 


CAATATCTAG 


CAAATAATGA 


GAAATCTCTT 


CTTGTATTTC 


6060 




TGCCGGTGAT 


AAAGGTGCAC 


CATCTACAAA 


TTGTTCTGTT 


TTTAATTGAT 


CTTCAGTTGT 


6120 


30 


CGCTAGTCTA 


ATTTGGTTGA 


CTTTCATCTG 


TTTATTTCCG 


CGATAGTTTA 


TGATGTCACC 


6180 




TTTAACATGT 


ACAATTTCTT 


CAGGCTTGAT 


TGTTGC CATA 


TCATTTTTTG 


TAGCCGTCCA 


6240 




AAATTTCGCT 


TCAATTTCAC 


CACTTTTATC 


TTGCAAATGT 


AATGTCATAT 


AATCTTTACC 


6300 


35 


TTGTGCTGTT 


ACACCCTGTG 


TAGCTTTATG 


CACTAAGAAA 


AAGTGATCAA 


CTGAATCTCC 


6360 




GGGATTTAGA 


TTCTCTATAT 


TTCTCATCGT 


TTCCCGCCTT 


CCTCTATTTT 


GTTTAATGTA 


6420 


40 


ATCACTTCTT 


TTGATGGAAC 


AAT ATT AT CT 


TTTACACATG 


TAAAGTATAG 


TACTTGATAG 


6480 


TGTTCTGATA 


ATGATCGTAA 


ATAATTCAAC 


ATTTTTTCAG 


TACGTTTTTT 


ATCAAAATGA 


6540 




ACAAATGCAT 


CATCAACAAT 


TAATGGGAAC 


GGATAATATG 


GTCTTAGTAC 


CTTAATTAAA 


6600 


45 


CTGATACGTA 


AAGCTACATA 


AAGTAATTCT 


TTTGTAGATT 


GACTTAGTTC 


AACAGGATCA 


6660 




TATAATTGAC 


CATTAACATG 


TTTAACCGTA 


ATTGAATCTT 


CATTATAGTT 


AATCATCGTA 


6720 




TATCTGCCAT 


CTGTTAAATG 


CTTCAATATT 


TCTACCGCTT 


CATTAATAAC 


TTGAGGCAAA 


6780 


50 


CGTTTATCTT 


TAATTTGTTT 


AATGTGTTCA 


TCAACTAAAC 


TTTGTAAATA 


ACTTAAACTT 


6840 




GCCCAATCTT 


TTGCGATATC 


ATTAAGTTGA 


TTTTTAAGAC 


TGTGATATTC 


ATGTCTTAAA 


6900 
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GCTTGCATTT CAAGATATTG CTCATTATAT TCGTCAACTT GAGTAGCCAA TAAATGATCT ~ 7020 

TCTTCTTCAA GTTGTGCAGT TGTTTTTTCA CTTAAACTAG AACTTAATTC ATAAGAATAG 7080 

TTTTGGTTCT CAAGATATTT AGTTAAATCA TTAAAACGAC TCAAATTACT AGTATAAGTT 7140 

TGGTAATCTT CATGATGTTG GTAAAAATCT TCTTCAGTAC CAACATTGAT AAAATCGAAT 7200 

AGTGCTGTAA TTTCTTTATT ATTTTCTTCT AATTGAGCAT TTAAATGATT TAATTCATTT 7260 

GTAACAAGTT TGGTATTTTC AGCATTAATA CGCCATTTTT CATTCGTGTC TTCAGCTGAT 7320 

TTCAACCATT GTtGCACATC GTGGAATAAA GATAATTTGT TGAAATAAAC AAATTGTGAT 7380 

TTTGTAACAG CTTCAGCATG ATTGTAGAAT GTATCTAATT CTTGAACCAA TTG CTGGCGT 7440 

TGTTGATTTA AATCACTGAT ATGTTGATCT AATGCTTTAA TATTCGCCAT TGTAGAAATA 7500 

CTATCAACAA TTAAATCATT TGAAATTTTA GATGATAAGT ATAATTCATC CTTAACGTTC 7560 

TCAACTGTCG ATTGTAATTC ATCATGACGC CCTTTCGCAT CATTTAAACG ACCTTCAATA 7620 

TACTGACGTT TCTCTTCTAA AATATCTTTA TTTTTCAAAG CTTGTTGCCA GTGATCACGA 76 80 

ATGCGATATT GCTCATCAAG ATCAAAATCT AAGTCATAAT TTTCATCTAA AATGGCTAGT 7740 

25 TGTGCTTTAA TTTCTTCGAT TTCATCTGTG ATGGCCTCGC TATAATCTAC TTCTTTTGAT 7800 

TTAGACATGA TGATACCGAT AACAAATACT AAAGTTAATA CTGCGAAAAT AATACCAAAC 7860 

AACATGTTGT TTGAAATAAA TGAGAAGGCA GTTAAACCAA TACCTACTAA TGTTAAAAGr 7920 

^0 ATAAACGTTG TTCGkAACAA TTTTTGACGT TTTTGttTTT CTT 7963 

(2) INFORMATION FOR SEQ ID NO: 169: 



20 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3958 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

_ _. _ _(D)_TOPOLOGY^_linear - 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

ATATTGTCTT TACAATAGTT TGCTATGGAG GTAATTAACC AATAGGAGGA ATTTATAATG 60 

45 GCAGTAATTT CAATGAAACA ATTACTAGAA GCGGGTGTTC mCttCGGTCA CCAAACACGT 120 

CGTTGGAACC CAAAAATGAA AAAATATATC TTCACTGAGA GAAATGGTAT TTATATCATC 180 

GACTTACAAA AAACAGTGAA AAAAGTAGAC GAGGCATACA ACTTCTTGAA ACAAGTTTCA 24 0 

50 GAAGaTGGTG GACAAGTCTT ATT CGTAGGA nCTAAAAAAC AAGCACAAGA ATCAGTTAAA 300 

TCTGAAGCAG AACGTGCTGG TCAATTCTAC ATTAACCAAA GATGGTTAGG TGGATTATTA 360 



55 
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GAAGATGGTT TATTCGAAGT ATTACCTAAA 
GACCGTTTAA TCAAATTCTT AGGCGGAATT 
5 TTCGTAGTTG ACCCACGTAA AGAGCGTAAT 

CCTATCGTAG GTATCGTTGA CACTAACTGT 
GCAAACGACG ATGCTATCCG TG CGGTT AAA 

W 

TTAGAAGGTC AACAAGGCGT TTCTAATGAA 
GATGAAAAAG AAAAATCAGA AGAAACAGAA 
GACTTAGATA TAGTTTAAAT GGGTGATAAG 

15 

AAAATCGAGG CAAATTACAA ATATTCAATT 
CTAAAATCAT AATATATAAn ATGATAACTT 

2Q AGCAAAACTT GTTAAAGAAT TACGTGAAAA 

AGCGCTAACT GAAACTGATG GTGACATCGA 
TATTGCTAAA GCAGCTAAAA AAGCAGACCG 

25 AACTAAAGGT AACGACGCAt TATCGTTGAA 

AACGAAGGTT TCCAAGAGTT AGTTAAAGAA 
GAAACTGTTG AAG CTTTAAT GGAAACAACT 

30 ATTAAAGAAG CAATTTCAAC AATCGGTGAA 

ACTAAAACTG ATAACGATGC TTTCGGCGCT 
TTAACAGTTG TTGAAGGTTC AACTGACGAA 

35 

GCTGCAATCA ACCCTAAATA TGTTTCTTCT 
GAAAQAGAAG TTTTAAAACA ACAAGCATTA 
AAAATGGTGG AAGGACGTTT ACGTAAATAC 

40 

TCGTTAAAAA CCCTGATGTA ACAGTTGAAG 
TTGACTTCGT ACGCTATGAA GTAGGCGAAG 

45 ATGAAGTTAA AGGACAAATG AAATAATCTG 
TTAATGTTGC TTTATTAAAA TGTAAATCAT 
CTTGTATATG TTACATATAT TCACGATAGA 

SO TATAAACGTG TAGTTTTGAA ACTAAGTGGT 

ATAAATC GAG TAATTATTAA AAGTGTTGCT 

55 



AAAGAAGTAG TAGAACTTAA AAAAGAATAC 4 80 

CGTGATATGA AATCAATGCC TCAAGCATTA 540 

GCAATTGCTG AAGCTCGTAA ATTAAATATT 600 

GATCCTGACG AAATTGACTA CGTTATCCCA 6 60 

TTATTAACTG CTAAAATGGC AGATGCAATC 720 

GAAGTAGCTG CAGAACAAAA CATCGATTTA 780 

GCAACTGAAG AATAATCAAC TGTTGAATCT 84 0 

ATATTAATGC TTATCACCTT TTTTAAAAAG 900 

AGAGTATTGG CAATCTTGCC TATAATAATG 960 

ATTGGAGGAA TAATGAATGG CAACTATTTC 1020 

AACTGGCGCG GGTATGATGG ATTGTAAAAA 1080 

TAAAGCGATT GACTACCTAC GTGAAAAAGG 114 0 

TATTGCGGCT GAAGGTTTAG TACATGTAGA 12 00 

ATCAACTCTG AAACAGACTT TGTTGCTCGT 1260 

ATCGCTAATC AAGTATTAGA TACAAAAGCT 1320 

TTACCAAATG GTAAATCAGT TGATGAAAGA 13 80 

AAATTAAGTG TTCGTCGTTT TGCTATCAGA 144 0 

TACTTACACA TGGGTGGACG CATTGGTGTA 1500 

GAAGCAGCAA GAGACGTTGC TATGCATATC 1560 

GAACAAGTTA GCGAAGAAGA AATCAACCAC 1620 

AATGAAGGTA AACCAGAAAA CATCGTTGAA 1680 

TTACAAGAAA TTTGTGCTGT AGATCAAGmT 174 0 

CTTTCTTAAA AACAAAAGGT GGAAAACTTG 1800 

GTATGGAAAA ACGCGAAGAA AACTTTGCGG 1860 

TCATAAAGTA AAACAAGGAA GAAGACACCT 1920 

TCTAATAAAA CGACAACTGT GTCTTCTTTA 19 8 0 

GAGGATAAGA AAATGGCTCA AATTTCTAAA 2 04 0 

GAAGCGTTAG CTGGAGAAAA AGGATTTGGC 2100 

GAGCAAGTGG CTGAAGTTGC TAAAATGGAC 216 0 
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TTAGGTATGG 


&.CC , ci r rcinb. ivp 


TP f **|V* A TTP A P 


ATGGGTATGC 


TTGCAACTGT 


AATGAATGCC 


2280 




TT AG CATTAC 


AAGATAf^TTT 

**rtvj/\ X MO XXI 


ALxAALMA lib 


P A TTPTP A TA 
VjATTGTGATA 


CACGAGTATT 


AACATCTATT 


2340 


5 


G AAATG AAG C 


AAf5Tflf2 PTCl A 


A C CTT AT ATT 


PP TPP T PP*TV* 


CAATTAGACA 


CTTAGAAAAG 


2400 




AAACGCGTAG 


TTATTTTTnr 

* XfkX X X X X OL 


TP P A PPT A M" ! ' 

AoLALtVj X A X 1 


PJP 7A A A PPP A T 

UvjAAAL L lAT 


ACTTCTCTAC 


AGATACTACA 


2460 


10 


GCGGCATTAC 


GTG rTT2 c A rz ta 

Vw X X >J V— /A\_J.T\ 


AL» 1 1 LjAAvjLA 


G ATG TT ATTT 


TA AHV^PPPA A 

TAATGGGCAA 


AAATAATGTA 


2520 


GATGGTGTAT 


n x x \v x VJwiun 


TPPTA A a PTE. 
1 L L 1 AMAvj X A 


AACAAAGATG 


CGGTAAAATA 


TGAACATTTA 


2580 




ACGCATATTC 


X VJV. X X L~r* 


A P A A pr"P0 w T ' A 


CAAGTAATGG 


ATTCAACAGC 


ATCCTGATTC 


2640 


15 


TGTATGGATA 


ATA & rATrrr 

AXAMLMx ILL 


ol 1AALTGTT 


TTCTCTATTA 


TGGAAGAAGG 


AAATATTAAA 


2700 




CGTGCTGTTA 


x vjoo x unnnn 


OA A ALtVj X ALLj 


TTAATTACAA AATAAATTTA 


GAGGTGTAAA 


2760 






AO ATTHTT7A 
A\ — A 1 1A1 1 MA 


TP AAA PT A IV A 

1 <j AAAl i AAA 


TCAAGAATGC 


AAAAATCAAT 


CGAAAGCTTA 


2620 


20 


TCACGTGAAT 


A AOL X MALM 1 


P A PTP P A PP A 

LAO I (jLA(jCjA 


AGAG CTAATT 


CAAATTTATT 


AAACGGCGTA 


2880 






ML 1 M I LaV? 1 OL 


A PPA A /"* A /""* /-+rrt 

AL lmalmlCT 


GTACAACAAT 


TAGCAAGCAT 


CAATGTTCCA 


2940 






XML A lul AMI 


TT CT CCATAC 


GACAAAACTT 


CTGTAGCTGA 


CATCGAAAAA 


3000 




CiCdA t A at a n 

Van. x X nu 


LMLtL 1 MAL 1 1 


ACjCsTGTTmAC 


CCAACAAGTG 


ATGGTGAAGT 


GATACGTATT 


3060 




gctgtapptt: 


PPTTa t\ ptap a 
LL X x AALMoA 


A P A A A A 

ACjAAlGTAGA 


AAAGAGCGCG 


TTAAAGATGT 


TAAGAAAATT 


3120 




OdTCzmLdZi. tap. 

vsv» x v»xr\M\J AA\j 


PT7A 7A TAPTATP 
L 1 AAALt i A 1 L 


T , /~" <T*«-» /»>i/~» «v A A 

TG TT CGAAAT 


ATTCGTCGTG 


ATATGAATGA 


TCAGTTGAAA 


3180 


30 


AAAP.ATTCA2V& 


MAMA IVjbLbA 


/"» A 'l«l> 7\ /™«*T»/^ A A 

CATTACTGAA 


GATGAGTTGA 


GAAGTGGCAC 


TGAAGATGTT 


3240 




CAGAAAGrA A 


P A 7\ /-« A A TTP 

LMoALAA 1 a l 


A A *T* AAAA A A 

aaTmmmAGAA 


ATTGATCAAA 


TGATTGCTGA 


TAAAGAAAAA 


3300 


35 


GATATTATGT 
w<r"i x x x x vj x 


P A P^T A T R. 7A AA 

LftVJ X M x AAAA 


PTJ\ TV T* A T* A PA 

L I aATATAlm 


ATGACATATT 


AAAATGCCAG 


TATTAAACGA 


3360 


TAATGTAAPA 

x^v% x w innwi 


TTTA 71 7a hTRT 
X X XMMMMx LKj 


GCATGTTTAA 


TTAAATCAAA 


GATGCATGTG 


ATAATTTAAA 


3420 




TTCAGAATHA 

— — X-X >wn\2srwx wn — 


fiPlTl 7A 7A 7A TAT 
oLft AMMAAA x — 


GGTGTTTAAA - 


CAAGTTAATT- 


AAAGATATAG 


-TTTATAAATA 


— 3480 - 


40 


ATAGGCATTA 


fifiTAT 7A TTP P 
v»Vj X M X A X luL 


TATA ATA A A P 

1 A 1 AA I AAAo 


TTATGTAATT 


TTTAACCTCA 


GTATGTATGT 


3540 




CAC^ATTTfTG 

V-nVB^rV X X X V— X \J 


rsTrtrii & a p*pp 

vjlol AAAL 1 \j 


T A PP^ A PTP A 

1 ALLwAb I LA 


GACTTTGGTA 


CAGTTTTTTT 


ATTTGCTTAT 


3600 




TCAATGCATT 


AAATfTArTaT 
MMM X OALj 1 M 1 


P A T A A A A TP A 

LiA A AAAA 1 vj A 


TAATGATTGT 


TTAGTAACTT 


ATACTATATG 


3660 


45 


Ar* Af2 Ari att; 3i 


TP TV PP r*"T*0/^/^ 


A/^/^A A A P A 

AGGAAAGACC 


ATGTTTAAAA 


AGCTAATAAA 


TAAAAAGAAC 


3720 




ACTATAAATA 


ATTATAATGA 


AGAATTAGAC 


TCGTCTAATA 


TACCTGAACA 


TATCGCTATT 


3780 




ATTATGGATG 


GTAATGGG CG 


ATGGGCTAAG 


AAGCGAAAAA 


TGCCTAGAAT 


TAAAGGTCAT 


3840 


50 


TACGAAGtAT 


GCAAACAATA 


AAAAAAATTA 


CTAGGGTAGC 


TAGTGATATT 


GGTGTTAAGT 


3900 




ACTTAACTTT 


ATACGCCTTT 


TCCACTGAAA 


ATTGGTCAAG 


ACCTGAAAGT 


GAAGTAAA 


3958 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



10 





ATTAAAACAA 


CTTAATATAC 


CTATTTATGG 


TGGTCCTTTA 


GCATTAGGTT 


TAATCCGTAA 


60 




TAAACTTGAA 


GAACATCATT 


TATTACGTAC 


TGCTAAACTA 


AATGAAATCA 


ATGAGGACAG 


120 


15 


TGTGATTAAA 


TCTAAGCACT 


TTACGATTTC 


TTTCTACTTA 


ACTACACATA 


GTATTCCTGA 


180 


AACTTATGGC 


GTCATCGTAG 


ATACACCTGA 


AGGAAAAGTA 


GTTCATACCG 


GTGACTTTAA 


240 




ATTTGATTTT 


ACACCTGTAG 


GCAAACCAGC 


AAACATTGCT 


AAAATGGCTC 


AATTAGGCGA 


300 


20 


AGAAGGCGTT 


CTATGTTTAC 


TTTCAGACTC 


AACAAATTCA 


CTTGTGCCTG 


ATTTTACTTT 


360 




AAGCGAACGT 


GAAGTTGGTC 


AAAACGTAGA 


TAAGATCTTC 


CGTAATTGTA 


AAGGTCGTAT 


420 




TATATTTGCT 


AC CTTCGCTT 


CTAATATTTA 


CCGAGTTCAA 


CAAGCAGTTG 


AAGCTGCTAT 


480 


25 


CAAAAATAAC 


CGTAAAATTG 


TTACGTT CGG 


TCGTTCGATG 


GAAAACAATA 


TTAAAATAGG 


54 0 




TATGGAACTT 


GG TT AT ATT A 


AAGCACCACC 


TGAAACATTT 


ATTGAACCTA 


ATAAAATTAA 


600 




TACCGTACCG 


AAGCATGAGT 


TATTGATACT 


ATGTACTGGT 


TCACAAGGTG 


AACCAATGGC 


660 


30 


AGCATTATCT 


AGAATTGCTA 


ATGGTACTCA 


TAAGCAAATT 


AAAATTATAC 


CTGAAGATAC 


720 




CGTTGTATTT 


AGTTCATCAC 


CTATCCCAGG 


TAATACAAAA 


AGTATTAACA 


GAACTATTAA 


780 




TTCCTTGTAT 


AAAGCTGGTG 


CAGATGTTAT 


CCATAGCAAG 


ATTTCTAACA 


TCCATACTTC 


840 


35 


AGGGCATGGT 


TCTCAAGGTG 


ATCAACAATT 


AATGCTTCGA 


TTAATCAAGC 


CGAAATATTT 


900 




CTTA€CTATT 


CATGGTGAAT 


ACCGTATGTT 


AAAAGCACAT 


GGTGAGACTG 


GTGTTGAATG 


960 


40 


CGGCGTTGAA 


GAAGATAATG 


TCTTCATCTT 


TGATATTGGA 


GATGTCTTAG 


CTTTAACACA 


1020 


CGATTCAGCA 


CGTAAAGCTG 


GTCGCATTCC 


ATCTGGTAAT 


GTACTTGTTG 


ATGGTAGTGG 


1080 




TATCGGTGAT 


ATCGGTAATG 


TTGTAATAAG 


AGACCGTAAG 


CTATTATCTG 


AAGAAGGTTT 


1140 


45 


AGTTATCGTT 


GTTGTTAGTA 


TTGaTTTTAA 


TACAAATAAA 


TTACTTTCTG 


GTCCAGACAT 


1200 




TATTTCTCGA 


GGATTTGTAT 


ATATGAGGGA 


ATCAGGTCAA 


TTAATTTATG 


ATGCACAACG 


1260 




CAAAATCAAA 


ACTGATGTTA 


TTAGTAAGTT 


AAATCAAAAT 


AAAGATATTC 


AATGGCATCA 


1320 


SO 


GATTAAATCT 


TCTATCATTG 


AAACATTACA 


ACCTTATTTA 


TTTGAAAAAA 


CAGCTAGAAA 


1380 




ACCAATGATT 


TTACCAGTCA 


TTATGAAGGT 


AAACGAACAA 


AAAGAATCAA 


ACAATAAATA 


1440 



55 
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GCTTTTTCTT TATATATGAT GAGCTTGAGA 


CATAAATCAA 


TGTTCAATGC 


TCTACAAAGT 


icrn 
ISOO 






TATATTGGCA GTAGTTGACT GAACGAAAAT 


GCGCTTGTAA 


CAAGCTTTTT 


TCAATTCTAG 


i con 

A D 4 U 




5 


TCAGGGGCCC CAACATAGAG AATTTCGAAA 


AGAAATTCTA 


CAGGCAATGC 


GAGTTGGGGT 


icon 






GTGGGCCCCA ACAAAGAGAA ATTGGATTCC 


CAATTTCTAC 


AGACAATGTA 


agttggggtt; 


1 7^ A 

1 / 4U 




10 


GGACGACGAA ATAAATTTTG AGAAAATATC 


ATTTCTGTCC 


CACTCCCGAT 


TATCTCGTCG 


t p on 




CAATATTTTT TTCAAAGCGA TTTAAATCAT 


TATCATGTCC 


AATCATGATT 


AAAATATCAC 


i or n 

lOOU 






CTATTTCTAA ATTAATATTT GGATTTGGTG 


AAATGATGAA 


CTCTTTGCCT 


CG TTTAATTG 


T a o n 




15 


CAATAATGTT AATTCCATAT TGTGCTCTTA 


TATCTAAATC 


AATGATAGAC 


TGCCCCGCCA 






TCTTTTCAGT TGCTTTCAAT TCTACAATAG 


AATGCTCGTC 


TGCCAACTCA 


AGATAATPAA 


o rt a f\ 






GTACACTTGC ACTCGCAACA TTATGCGCAA 


TACGTCTACC 


CATATCACGC 


TCAGGGTR C A 






20 


CAACCGTATC TGCTCCAATT TTATTTAAAA 


TCTTTGCATG 


ATAATCATTT 


TGTGPTTTAfS 








CAGTTACTTT TTTTACACCT AACTCTTTTA 


AAATTAAAGT 


CGTCAACGTA 


CTTGATTOAa 
^» x x v^nx lUnn 








TATTTTCACC AATTGCCACA ATGACATGAT 


CAAAGTTACG 


GATACCTAAA 


v~ x x x X 






25 


CTGCTTCATC TGTAGTGTCT GCAACAACCG 


CATGAGTAGC 


GATATCACTA 


x**X X ^_rlX X \—r\ 








CTCTATTTTC ATCATGGTCG ATGGCCATTA 


CATCCATGTC 


TAATG CATT C 










CGATACTACC TCCAAAACGA CCTAGACCGA 


TGACTACATA 


TTCTTTACCC 


AT ACT CtlCCC 






30 


TCCATTAAAT GATTTTCATC AATTCATTGA 


AAATATAAAT 


TTAAAATTAT 


TATAAATGAfJ 








TACCCCAACT AAATTATCTA AATG CAGTAA 


TGCAAGTAAA 


TGAAAGTTGG 


GGTATCGTCT 

V7VJ J, X V- VJ X V- A 


"7 ^ it* n 

^ O O KJ 




35 


CAACTTATGA TTTCTTTCCT TCAACATATT 


CTTTGTCGAA 


AACAAATAAT 


C TT AAT AAT A 

w X Xi^f^X^TrtXrti 






ATATTAACGA TGGAAGTAAT AAAAGTAAAC 


CTAAAATAAA 


G AGAAT CACT 


AATGTCCAGP 

ran a. vj x vwiviv 








._ CCAKTCTGG ATTAACATAT -GCATCTGTAA- 


irrTTTACAAA 


CGGATATAAA- 


-AGGTATGGCA 






40 


ATTTACTAAT TCCATAGCCA AAGAACGCGA 


ACATCATTTG 


TAAAATAACA 


AATACAAAAG 


2820 




CCAAACCATG TTTTTTCTTA AAGAATGTTA 


ACAATGAAGC 


TAATG CAAAG 


AAT AA G AAA f 


A O V V 






TTATACCAAA CATCCACCAA TAGTCAAAAA 


CAGCTGAATA 


AAAATGTTCA 


GAATTTTGAA 


2 94 0 




45 


TGCGTAATGA TAGAAATACG AATAAACAAA 


TGATAATCAT 


CGGCGGCCCT 


AAAAATATGT 


3000 






GCCATTGTCT TGTTAAATTA TATGCTGGTT 


CGTCATTTGC 


TTTTTTAGCA 


TAATATGTCA 


3060 






AAAATCCTGA TGAAATATAT AAAACTGAAA 


TAATTGCCAA 


GAATACTACA 


GACCAAGCAA 


3120 




SO 


ATGGGCTTAA TAATAACTGC ACCCAATCTA 


GATCGATAAC 


ATTGTTTCGA 


ACATTAATAT 


3180 






AGCCACCTTC TGTAATAGTT AAAG CAGTAG 


ATAATGAAGC 


TGGAATTAAT 


AATCCACTTA 


3240 
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AACTGTTTCT CAACGAT AT C ATAATCAGTG CTATTGAACC TGGTATTAAC AATACCGTGC 3360 

CTAAATATTT GATTGACTCT GGAAAGAAAC CTACGAATCC TACGAAGAAG AAAACAAAGA 34 20 

ATACATTCGT AACTTCCCAA ACTGGGTTTA AATAACGTGA AATTAAGTGA TTAATTTTCT 34 80 

TTTCATCACC AGTTAACTTT GAATGCAATG CGAAGAAACC TGCCCCAAAA TCTATAGAAG 354 0 

CAATAATGAT ATAGCAAAAT AAAAACAACC ATAACACTGT TATACCTATA AATGCATAAA 3600 

TCATTTTTCT ATTTCTCCTC CTTGCTTCTT GGCTAAACGA TTTACATCTT CATACGCCGG 3660 

TTTATTTTTA AACATACGAA TTAATACGTA TGCACATGTA TACATTAAAA TGATGTACAA 3720 

TATGCCAAAT AAAATTGTAA CGAaGGTTAT TCCGCCTGCT TGTGTTG CTG CTTCTGCCAC 3780 

GCGCATATAA CCACGAACAA TCCAAGGCTG TCTACCCATC TCTGTTAAGA ACCATCCAAA 3840 

TT CT AT AG CT AGCATTGAAG CTGGGCCTGT TAATAATATT CCATAAAGCA TCCATTTATG 3 900 

AGTAGAAAAC TTTCTAAGCT TTTTAAACAT TAAAGTTAAG ACATAAACAC CTGAAATGAC 3960 

AAAACATAAA ATTCCCATCG TT AC CATTAA ATCAAAGAAA TAATGGACGA TCATAGGCGG 4 020 

ATGTAAACTT TTTGGAAAAT CATTTAACCC TTGTACTTTA GTTTTGACAC TATTATCTGC 4 080 

2s TAAGAAACTC AATAGTCCAG GTAATTCAAT CGCACCTTTA ACTTGCTGAG TCTTTTCATC 414 0 

TAACACACCA AATAATAATA ATTTGGCATG GGAAGATGTA TCGAAATGCC ATTCATAAGC 4 200 

TGCTAATTTT TCAGGTTGGA ATTTATGCAA AAATTTTGCA GATAAATCCC CTGCCAACAT 4260 

30 AGAAAGTAAT GTTGAAAAGA ATCCAACTAT CATAGACATT TTCAAAGCTT TCTTATGGTA 4 320 

GACAGTATCT TTAGGTTGAC GATTACGCAA TAATTTAAAA GCTGCTATTG ATGCAATAAC 4 38 0 

AAATGCCATC GTCATACCGG CTGTAGTAAT TACGTGAAAT GATCGAACTA TAAACGATGG 444 0 

GTTAAACATC GCTTCTATAG GTTGAACATT GACCATCTTT CCATTCTTCA ACTCAAAACC 4500 

TGCAGGCGTA TTCATAAATG AATTCACTGA AGTAATGAAG AATGCTGAGA AAGAGCCACC 4560 

AATAATTACT GGTATACTAA TTAAGAAATG TGTCCATTTA TTTTTAAAAC GATCCCAAGT 4 620 

ATATAAATAT ATACTTAAGA AAATAGCTTC AAAGAAGAAC GCAAATGTTT CCATAAATAA 4680 

TGGAAGTGCA ATAACGTGTC CACCCATTTC CATAAATGTA GGCCAAATCA ATGATAATTG 4 74 0 

AAGTCCTATA ATTGTACCTG TAACAACTCC CACTGCTACA GTAATTGTAT AAGCTTTAGC 4 800 

CCATCTTTTG GCCATAGCTA TATATTGAAG ATCATTTTTG CGAATACCTA AAAATTCTGC 4 860 

AATTG CGAAC ATTAAAGGCA TACCAACACC AATCGTTGCA AAAATGATAT GAACTGCTAA 4 920 

SO AGTCATAGCT GTCAAAAACC GACTGATTTC AACTGTATCC ATTTAAAAAC ATCACCTTTT 4 980 

TCTTTTTTTG ATGACAACAC AATGAACTTA ATTATAATTG CTATAATGTG TATTTTTAAA 5040 
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GAATTTCAAT GTATAATTGT GTATATTACA TTAGAATAAA GCACGAAGGA GCATGATACA 


5160 




TGTCAGAAAT AATCGTTTAT ACGCAGAATG ATTGTCCACC TTGTACATTT GTAAAAAATT 


5220 


5 


ATCTAAATGA GCATCACATT GATTTTGAAG AGAGAAATAT CAACAATCAA CAATATCGAA 


5280 




ACGAAATGAT AGATTTTGAT GCTTTTTCAA CT CCGTTT AT TTTGTTGAAT GGC 


5333 




(2) INFORMATION FOR SEQ ID NO: 171: 






10 
1S 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11126 base pairs 

(B) TYPE: nucleic acid 

( C > STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 






20 


ATACGTGACC CTTTATCCGA AAATTTCTTT TCATATTCTG TTAAAATATT 


ACTO CC. AT CC 


60 




TCTTCTTGAT GTAAATTTAG ATTTATTTTT GTAAAATACA TTCCAAATTG 


AGACATACTT 


120 




TCTAAACTGT AGGCAAATAG TCCTCTGTTA TCAGTTTTAA AATGTAAATC 


x \— x x v»-^v x \«-/» 


180 


25 


TTTAAGATTT GTTGATACAA CGCTAAAAAC GTATGATACG TTAAACGTCG 


XXX X \J *wXA X \JX\ 


240 




CGATTTTtTG GCCATGGATC TGAAAAGTTC AAATAAATAC GCGAAACTTC 


GCCGT CTTT A 


300 




AAATATTCAT TTAATTCAAT GGCGTCATTA CAAATAATCT TTAAATTTGT 


TAAACCCATC 


360 


30 


TCTTTAACTT TATCCAATAC TTTATAAACG ATACTTTTCT CACGTTCCAT 


TGAAATATAG 


420 




TTAATATGAG GATTTTGAGC AGCTAATGTT GTAATAAACT GCCCCATACC 


CGAACCAATT 


480 




TCAATGTGTA TCGGTTGCGT TTTaTCAAAC CATTCAGTCA TTTTCCCTGc 


ATGTTGACCG 


540 


35 


TCCATGTCAA CCAATTCAGG ATGATCTTTT AAATAATCTT CAGCCCATGG 


TTTGTATCGA 


600 





_ _ ACTCXCATAT- TTTATTCTCC -TCTTAAATAA- ACATGTTACT ATTCATAACT 


TCATTTAGGA" — 


660 


40 


ATTTAAGCCA AGTGTTCATA TrrTTHTHTr Trrri'ifirrr Tw«%'rKnohT 


TGAACAAGAC 


720 


CTATAGATTG AATTACCGTA TACCATTTCA TACGTTTATT TAAATTCAAG 


CTCTCTTGAA 


780 




CACCATATGT TTCAAGCCAT TCAGACCATT GTTGTTGTGG AACATAGTTG 


TAAAGCAGCA 


640 


45 


TTCCGATATC AATTGCCGGG TCTGCAATCA TTGCACCTTC CCAATCAACT 


AAAAATAGTT 


900 




CATCTCGATC GGATAATAAC CAATTATTAT GATTCACATC ACCATGTACA 


ACAGTGAAAA 


960 




AACGCGAATC TAAACTCGGT ATATG CTCTT CTAAATAGGT TAATGATTTT 


CTCACAATAT 


1020 


SO 


GATGTGTTAA AACTT CTCTT GATAAAGAGG CATTAATTTT ATTAAGCATA 


ATCTCAGGAG 


1080 




TAATAGGTTC CATTTCCATA CGCTTTAACA TACTTAATAA AGGTCTAGAA 


TTGTGTATCT 


1140 
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TTTTCCAATG TTGTG CTGTA ACAACCTCGC CTGTTTCTAT GCGTTTCGTC CATACTAATT 1260 

TGGGCACAAT ACCTTCTGCT GATAATGCCG CAATAAATGG ATTTGAATTT CGTTTTAAAA 1320 

ACAACTTTTG TCCATCTTGT TCAG CCATAT ATGCTTCACC AGATGCACCA CCTGCTGAAT 13 80 

CAAGTGTCCA CCCTAATTGA TAAAACTGCT CCAACTCGTC CACCTCACTT TCAATTAGAA 144 0 

AATGGCTCTA GAAATAGGTT TTTCAAGAGC CATATATTCT AATTTATAAC ACCATACTGG 1500 

TACAAATATT ATGTCCAGAT AATTATTGTA AATCCTCAAC CAATGCCTAC ATTACACGAC 1560 

TAAATTTAAA TCGTAATGTC TGTCATTGAC ACCATACATT CTATAGTCAC TTACTTGACA 162 0 

TATAATGTTA CCGTGTCTAA AACTACATGT TTTTGAATCT CTGTAGGCGA TAAACTcTAG 1680 

TTTTCAAAAT AATTGCTATC CCATTTTCAT GGTTAGCATA AATTTATGAA CTGTAACATT 174 0 

TACGTACTTA GTAAAATATG ATGCACATCA TATTTGTrAC TCATAGAAAA TTTT AT AA t T 1800 

2Q TTTATCATTA TATTTCAACT GAAAATGAGA AACAAAATGG CACTTTTTAC TAATATGTGT 1860 

TTTCTAAACA ACACTTTTAA GCTTCGTTTT AAATTATAAC ATAATTCACT TACGAAAGTT 1920 

GATAAATTTA AGTAATTTAA TCTAAAAATA TGATGAAAGA ATTTTAAATA CTGTGTGACT 1980 

25 CTATATACTT TTCAAATCCT TCTTGTAGTT GACGTGTAAT TGGGCCAACT TTACCATCAT 2 04 0 

TAACTGGTTC ACCATCTAAT TTAATAACAG GTGTAACCTC AGCTGAAGTA CTTGAAACAA 2100 

TAACTTCATC TGCGTTTTTC AAGAAATCTA CAGTAAACGT TTCTTCTTTA AATGGGATGT 2160 

TATAGTCTTC GGCAATTTTT TTAATTACAA TTCGTGTAAT ACCATTAAGA ATATAGTTGT 2220 

TAATCGGATG TGTATAAATC ACACCGTCTT T AATTG CAT A AG CATTACTT GAAGATCCTT 2280 

CAGTTACAGT TtCACCTCGA TGTTGAATTG CTTCAACTGC ATTATATTTC ACAGCATATT 2340 

CTTTTGCTAA TACATTcTCC TAATAAGTTC AAGCTTTTAA TGTCGCAACG TAACCATCGG 24 00 

ATATCTTCAA CGGTAACACC ATTCACACCA TTTTCTAAAT GATCATAAGG ACGATCATAA 2460 

CTCTTTGTAT AAGCAACAAT TGCTGGTTCT ACTTCAGGTG TCGGGAAGCT ATGATTCCTT 2520 

TCAGCTACAC CACGCGTTGC TyGAATATAA ATTGCCCCAG TTTCAATTTG ATTCATATCA 2580 

ACTAATTTAC GAGATAGTTC AATTAATTCT TCTACAGAAT AATTTAAATC TAAACCAATC 264 0 

45 TCATTGGCAC TACGTwAAAw TCTTTGATAA TGTTCTGTTA CTGTAAATAA CTTACCATTA 2 700 

TATACTCGAA TGTATT CAT A AATACCATCG CCAAATACGT ATCCTCTGTC GTTGTATGAA 2760 

AC CTTTG CTT CACTTGGACT TACAAACTCA CCATTTAAAA AAATTTTTTC CATATATTAT 2 820 

50 TCCTCCACGC ATAATGAATA AATTG CTT CT AAGTAAATAC TAGTTGCGTT AAATAACTGT 2880 

TTTTTAGTGA TATATTCATT TTTCTGATGC ATTAAATCTT CAGAATCACT AAACATTGCG 2 94 0 
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TCAGTCATAT 


CATTTGTTTG 


ATTTCTATAT 


GCAGTAACTA 


ACTTTTGTAC 


AAAAGGATCA 


3060 




TTTTTATCAA 


CATAATGTGG 


TGGTTGGACT 


TTACCTAATT 


TCACTTCAAA 


GCCATATTGT 


3120 


5 


TGAATCTCAT 


TTGCAAAACG 


ATCCATAGCT 


TTTTCAAATT 


CAAATCCTTC 


TGGGTAGCGT 


3180 




AAGTTGATAC 


CGAAAAGACC 


TGCGTTTTCA 


TTATCATATG 


TAATAACACC 


AATGTTAGTT 


3240 


10 


GTCACGTCAC 


CCATGACATC 


TGTATGGAAT 


TTCATTCCCA 


TCTTTTCACC 


AAAATCTGAA 


3300 


TTAAATAAGT 


AG CG ATT ACT 


AAATGCTACA 


AACGCTTGTG 


CATTATTATC 


AAGATTTAAT 


3360 




GATGCTAAGA 


ATTTTAGTAA 


GTAAAGACCC 


GCATTCACAC 


CGATAGATGG 


ATCCATACCA 


3420 


15 


TGAACCGCTT 


TACCTTCAAC 


TGTTAAAACT 


AGAATGCCAC 


TATCAACAGT 


ACTATCACCT 


3480 




TGTAAATGAT 


TTTGTTCTAA 


AAAGTACTCA 


AAGTCTTGAA 


TAACATCTGT 


CATATTTTCT 


3540 




TTAACAAGCA 


CTCTTGCTTC 


TGCATGATCA 


GGTACCATGT 


TGTAACGTTC 


ACCAGATTTA 


3600 


20 


AAAGTTATTA 


ATTCATAATC 


AGGTTCATCT 


TGATCTTCAG 


TAAGTTTATT 


TTGAACTAAA 


3660 




TCAAATGTTG 


TAATGCCTTT 


TTCACCATGA 


ATACATGGAA 


ATTCTGCATC 


TGGTGCAAAA 


3720 




C CTAATGTTG 


GCATTTCTTC 


TGTTTTAAAA 


TAGCGATCCG 


TACATTTCCA 


ATCAGATTCT 


3780 


25 


TCATCCGTAC 


CAATAATCAT 


ATGAATACGT 


TTCTTCCAAT 


CCACATTCAT 


ATCTTCTAAT 


3840 




ATCTTAATTG 


CATAATAAGC 


AGCAATTGTT 


GGACCTTTGT 


CATCAAGTGT 


ACCTCTAGCT 


3900 




ATGATAGCAT 


CTTCTGTTAC 


AACCGGCTCG 


AACGGATTAC 


TATCCCATCC 


ATCACCAGCA 


3960 


30 


GGAACAACGT 


CAACATGACA 


TAAGATACCT 


AATACGTCAT 


TTCCTTTACC 


TGCCTCAATT 


4020 




CTTCCTGCAA 


TATGATCCAC 


ATCATGTGTT 


GTAAATCCAT 


CTCTATGTGC 


AATTTCATAC 


4080 


35 


ATGTAGTCTA 


ATGCCTTACG 


AGGACCTGGA 


CCAACTGGTG 


CGTCTTCTGA 


TGCTTTTGCA 


4140 


TCATCTCTCA 


CACTTTCAAT 


TGCTAATAAT 


CCTTTTAAGT 


CATTAATGAT 


TTGATCTTCG 


4200 




TATTGTTGAA- CTTTTTCTTT 


CCACATTCGA- 


AATCGACTTC 


CTTTTTTCTA TAAG TTAAAT 


~~ 4260~ 


40 


TCTATTTTAC 


ATGAAAAGAT 


ATAAAAACTA 


CAATAAGATG 


TCAGAAAATA 


ATAAAAAGGA 


4320 




ACAAAACGAT 


GCTATTGATA 


TGACACAAAT 


CATAAATAGC 


TGCTTTGTTC 


CTTTTTTAAT 


4380 




TTATATATTT 


AAAATACACA 


TATTCAAGAG 


CTCGAGATAT 


AAGTCAATGT 


ACTAGGCACA 


4440 


45 


CAATTTAATA 


TTGACAGTAA 


TTAACCGAAC 


GAAAATGCGC 


CCCGGGGCCC 


CAACATAGAG 


4500 




AATTTCGAAA 


AGAAATTCTA 


CAGACAATGC 


AAGTTGGCGG 


GGCCCCAACA 


TAGAAGCTGG 


4560 




CCAATAGTTA 


GCTTTCAATA 


ATGTGCAAGT 


TGGGGTAAGG 


GCCCCAACAC 


AG AAG CTGGC 


4620 


50 


CAATAGTCAG 


CTTTCAATAA 


TGTGCAAGTT 


GGGGTAAGGG 


CCCCAACACA 


GAGAATTTCG 


4680 




AAAAGAAATT 


CTACAGACAA 


TGCAAGTTGG 


CGGGGCCCCA 


ACACAGAAGC 


TGGCCAATAG 


4740 
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50 



TAAAGAAATA CGTTTTCTTT AGATATTAGT ATTTCTTATG AATGAGTTTC ACGCATGTAT 
TCTTCTTTCT ATATGCATAT TAGCTATGAC TAACGATAAA GAACCTGAAA CACTAATAAA 
TGTCCTATAG TTTACAATAT TATATTGGCA GTAGTTGACT GAATGAAAAT ACGCTTGTAA 
CAAGCTTTTT TCAATTCTAG TCAACCTTGC CGGGGTGGGA CGACGAAATA AATTTTGCTA 
AAATATGATT TCTGTCCCAC TCCCTTATCA TTTCTGTCCT ACTCACATCT TATTCTTTAT 
CAGAT AATG C A T TTTTATTC TTTTTTAAAT CTTCTTCAGT GACGATACGT AAATTATTAT 
TTGGTGTGCG CCACCTTCAT CATCAAATTT ACCTTTTTCA ATACTTTCGT CAGTCTTATT 
GT CAT ATTCG GTAAATTTTG ATTTTTCTTC TTTGAAAAAT GCTTTTGGAT TATTTTTTAA 
TCTATTAGCA TATTCTTTCG GATTTGTTTT TACTTCTTTA ATTGTTTCAT TAGCAATTGT 
TCCTAATTGC GTCGCTTTAT CCTTAGCATT ATCTTTATAG CTTTGAGGAT CTTGTTTATA 
TTTATTATAT TCcTGcTTTC AGCTTGTCAC GACTATCTTT ACGTGTAACA AGTACAGCTG 
CTACAGCGCC AC CT AT AC CT AAAATCGCTT TAAATAAATT ACCTTTTGCC AT AT CAATCG 
TCTCCCTTTT ATTTATAATT TAATTTGTCA AAATCATTTT CAGTTAATAA ACGATATTCT 
CCTGAATCTA AATTGCTGTC CAATTCTAAA TCAGCAATTT TGATACGTCT TAAATGTAAT 
ACCTCATTTT GAATGCTATG AAACATTCGT TTAACTTGAT GATATTTTCC TTCATAAATT 
GTTACGTGTG ACGTTTGATT ATCAATATAA GTTAATATTG CAGGCTTAAC CTTGCCATCA 
GTCAGTGT t A CACCCTCTTT AAAAGCTTGA ATGTCGTCTT CAGTGATAGG ATTTGCTGAA 
ATAACTTCAT ATTTTTTAGA AACATGTTTG TTTGGACTCA TTAATTCATG ATTAAAATCA 
CCATCATTCG TTATCAATAA AAGCCCTTCT GTATCTTTAT CAAGACGACC AACCGGAAAA 
ATATTTAGAT GTTGGT ATT C AGGTATTAAA TCAATAACGG TTTTTGAATG ATGATCTTCA 
GTTGCTGATA TATAACCTTT TGGCTTATTT AACATAATAT AG ACATTTT C AATGT ATT CT 
ATTAATTCTC CACGAACTGT TATCTTATCG TTTTCTGGTT CTATATGTGT TTTTGGTGAT 
TTAATTACTT GTTCGTTGAC ATTTACAAGG CCTTTTTTAA GTAACTGTTT GACCTCATTA 
CGTGTACCGA CGCCCATATT TGCTAAAAAT TTAT CTATTC TCATCGTAAA AACCTAACTC 
TACGTCTTAA TTTTTCAGGA ATTTCACCTA AGAATTCGTC CGCAAGACGC GTTTTAATTG 
TGATTGTACC GTAAATTAGA ATACCTACTG TAACACCTAA AATAATAATG ATTAAGTAAC 
CAAGTTTAGT AGGTTCTAAG AATAGATTTG CAAGGAAAAA TACTAATTCT ACACCTAGCA 
TCATAATAAA TGAATACAAG AATATTTTTG CAAAATGAAT CCAACTATAG CTGAATTTAA 
ACTTCGCATA TTTTTTAAGA ATATAGAAAT TACATC CAAT TGCAAATAAT AATG CG AT AC 



4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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Av- 1 A viATAAC 


TACAGAAGCT 


AAAATAACAT 


AAACTGTTAA 


TTTCTGTTTA 


TCTATACCTT 


6660 






TGCCGTTACA 


CTTAATAGTG 


AAATTAGTAT 


TGCTACAGGC 


GCATAATAGA 


6720 


5 




A CTAC CAT CA 


TGGTTAGGGT 


CATGACCTAA 


AACAATTGGA 


TCGTAACCAT 


6780 




AlaAAAAt. 1\j I 


GAATAATGGT 


TGTGCCAAGG 


CCATAATTCC 


AATACTAGCT 


GGAACAGTTA 


6840 


10 


lAAAuATTAA 


TACACCAATA 


GATGTTCTAA 


TTTGATGATG 


CATTTCATGT 


AAGCGACCTT 


6900 


/^T»0/~»a TV fA'TY""T > 


TTTTGTAATA 


TAAGGAATTA 


AACTCACTGC 


AAAACCAGCA 


CTTAATGATG 


6960 




I V_vj(jAATV-AT 


TACAATTTTA 


TTAGTTGACA 


TATTTAGCAT 


ATTAAAGAAT 


ATATCTTGTA 


7020 


IS 


At. I vj l uAAGG 


TATACCAACT 


AAAGATAAAG 


CACCGTTATG 


TGTAAATTGA 


TCTACTAAGT 


7080 




TAAATAATGG 


ATAATTCAAA 


CTTACAATAA 


CGAACGGTAT 


ACTATAAGCA 


ATAATTTCTT 


7140 




TATACATCTT 


GCCATATGAC 


ACATCTATAT 


CTGTGTAATC 


AGATTCGACC 


ATACGATCAA 


7200 


20 


TATTATG CTT 


ACGCTTTCTC 


CAGTAATACC 


AGAGTGTGoLA 


TATACCAATA 


ATCGCACCAA 


7260 




CTGCTGCTG C 


AAAAGTAGCA 


ATACCATTGG 


CTAATAAAAT 


AGAGCCATCA 


AAGACATTTA 


7320 




GTACTAAATA 


ACTTCCGATT 


AATATGAAAA 


TCACGCGTGC 


AATTTGCTCA 


GTTACTTCTG 


7380 


2S 


ACACTGCTGT 


TGGCCCCATA 


GATTTATAAC 


CTTGGAATAT 


CCCTCTCCAT 


GTCGCTAATA 


7440 




CAGGAATAAA 


GATAACAACC 


ATACTAATGA 


TTCTTATAAT 


CCAAGTAATA 


TCATCGACTG 


7500 




ACCAACCGTT 


TTT AT CATG A 


ATGTTTCTAG 


CTAATGTTAA 


TTCAGAAATA 


TAAGGTGCTA 


7560 


30 


AGAAATACAG 


TACCAAGAAA 


CCTAAAACAC 


CGGTAATACT 


CATTACAATA 


AAACTCGATT 


7620 




TATAAAATTT 


CTGACTTACT 


TTATATGCCC 


CAATAGCATT 


ATATTTCGCA 


ACATATTTCG 


7680 


3S 


AAGCTGCTAA 


TGGTACACCT 


GCTGTCGCAA 


CTGCAATTGC 


AATATTATAT 


GGTGCATAAG 


7740 


v-GTATGTGAA 


CGGCGCCATA 


TTTTCTTGTC 


CACCAATTAA 


ATAGTTGAAT 


GGAATGATaA 


7800 




AAAGYACGC G- 


GAATAGGTTG- 


GTAATTAATA- 


TACTAATGGT AATTAAAAAG 


GTTCCACGCA 


"7860 


40 


CCATTTCTTT 


m /WMMW m _LL- 

ACTTTCACTC 


ATTACGAATC 


TCCCTATCTC 


ATGTTTATTA 


AAGTTTTGTA 


7920 




AAC 1 AAAAGC 


TGTTTCTCTG 


TAAAATCATT 


TTTCATTATT 


ATGAATATAT 


CACAAAACTT 


7980 




TATTTCATTG 


TCGTATATTC 


AATGAATTAT 


CATAACAAAA 


TTATCAACAC 


ATTGTCATTG 


8040 


45 


AATACTAGAT 


TTTGATTAGA 


ATATTACGAA 


ATTTCATATA 


AACATTATAC 


TACTATTTGA 


8100 




GATGAACATC 


GCATAACAGT 


AGAAAAATCA 


TTCTTATCAT 


ACACATACAT 


cttcattttt 


8160 




TATGAAGT7C 


ACATTATAAA 


TATATTCAAC 


ATAATTGTCA 


TCTCATAACA 


CAAGAGATAT 


8220 


SO 


AGCAAAGTTT 


AAAAAAGTAC 


TATAAAATAG 


CAATTGAATG 


TCCAGTAACA 


AATTTGGAGG 


8280 




AAGCGTATAT 


GTATCAAACA 


ATTATTATCG 


GAGGCGGACC 


TAGCGGCTTA 


ATGG CGGCAG 


8340 
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GTAAACTCAA AATATCTGGT GGCGGTAGAT GTAACGTAAC TAATCGATTA CCATATGCTG 84 60 

AAATTATTAA GAACATTCCT GGaAATGGGA AATTTTTATA TAGTCCCTTT TCAATTTTTG 8520 

ATAATGAATC CATCATAGAT TTTTTTGAGT CTAGGGGTGT TAAATTAAAA GAAGAAGATC 8580 

ACGGGCGTAT GTTTCCAGTT TCCAACAAAG CACAAGACGT GGTTGATACA TTAGTGACAA 8640 

CTATCGAACG CCAACATGTA ACGATTAAAG AAGAAGAAGC TGTTAGTAGA ATCGAAGTTA 8700 

ATACAGACCA AACTTTCACT GTACATACTC AAAATAATAG TTATGAAAGC CATTCGCTAG 8760 

TGATTGCTAC AGGTGGTACA AGTGTCCCTC AAACTGGTTC AACTGGTGAT GGTTATAAGT 8820 

TCGCACAAGA TTTAGGTCAT ACCATTACTG AGTTATTCCC GACCGAAGTT CCAATTACAT 88 80 

CAG CTGAACC TTTCATCAAA TCCAATCGTC TAAAAGGTTT AAGTTTAAAA GATGTTGAAT 8940 

TGTCAGTACT TAAGAAAAAT GGTAAAAAAC GCATCAGTCA TCAAATGGAT ATGTTATTTA 9000 

CTCATTTTGG TATCAGTGGT CCAGCTGCAT TAAGATGTAG TCAGTTTGTT TATAAAGAAC 9060 

AAAAAAATCA AAAGACACAG CACATTTCTA TGGCAATCGA TGCATTTCCT GAATTAAACC 9120 

ATGAACAATT AAAACAACAC ATCACATCAT TATTATCGGA CACACCAGAT AAAATCATTA 9180 

25 AAAACAGTTT GCATGGTCTA ATTGAAGAGC GCTACTTACT GTTCATGCTG GAACAAGCAG 9240 

GAATCGATGA AAATACCACA TCACATCACT TATCAAATCA ACAATTGAAC GACTTAGTAA 93 00 

ATATGTTTAA AGGGTTTGTA TTTAAGGTGA ACGGGACATT ACCTATAGAT AAGGCATTTG 93 60 

30 TCACAGGTGG TGGTGTGTCA CTTAAAGAAA TTCAACCTAA AACAATGATG TCTAAATTAG 94 20 

TTCCGGGATT ATTTTTATGT GGTGAAGTAT TAGATATACA TGGTTATACT GGTGGTTATA 94 80 

ATATTACAAG TGCACTCGTA ACAGGACATG TCGCTGGATT ATATGC CGGA CATTACTCAC 954 0 

ATGCATCAAT GGAATAATAG TATAAAATTT GGTTCGATTC TCTTTAGTAG ATCAACTTTT 9600 

TCATTCAAAT AAAAATGACC TTAATATAAC TGAGTCACTA AAAAGTGTCG TTATATTAAG 9660 

GTCATTTCGT TAATTATGAT TCTTTTTCGT TTTTAGTACG TCTTCTAGCT AACAAAGCCG 9720 

CACCTGTAAT CAGTG CAAAT TCTTTCAATG GTAAATCCAT TCCTTCAGAA CCTGTATTTG 9780 

GAAGTTCTTT TTCAACTTTG CGCGATTCAT GTGTCTCTTC TTTTTTAATA GGCGTACAAA 9840 

CTTTTGGAGC TGGCTGAATT TCTTTTGGTG ATACTTTCGT CGCTTCAGCT GGTAATTTAA 9900 

TTGCTAAAAT TTCATCAACA ATGAATTGCG TGTGTTGTTT GATGTCATTT AATGTCGCAT 9960 

CTTCATCAAT CATTCTATTG CCATCTGCAA CATATTGATC AATTAATACT TTTACTTTAG 10020 

SO CTAATTGTTC TGGTGTTGCG ATCGCTTTGA ATTTCG CAT A TGTTTGTTGA GCAATGTTAT 100 80 

CAATTCGCAG TAAGCTATTT TCTTTTTCAG TAATTACTGC TTCT AT AT CG CTTAATGCAA 1014 0 
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CATCCATTTG TAATTTTAAA GCAGTTATAG CTTTTAATGC ATCAGCCTTA TTACGATTAC 
TTACTTTTCG ATAATTTTGC ACTAAAGCAG TGACGCGTGC AAGATCATCA TTAATCGTTT 
TTTCAGCATC TGGCTTTTTA ATAGGATGTA CATCTAAATC ATGTATTGTT TGTAGATTTA 
ATGATGCTGT TTTATCAACT TGTGCATTGC TACGATCTTG ATCAATTTGT CCAATAGCAG 
TGTCATAAAT ATTTTGTAAC TGTGCTAATA TACTATTTCT TTCTTCTACC GTTGCTTGAA 
TATTCGCTTC AATTGCTTGT TTTTTATCGT TGAATAATGT TGTCAATTGT TCTCGAGCAG 
ACGCCTTTCT GTTAATAACA GGTTCGATTT CACGAATTTC GTTTTTCTCA TCATGCAATA 
AATATGCCAC ATCTG CATTA GTCACTGCAC TAGCAATTTG TTGTTTAGCT TTAATTAACT 
CTTTTTCAAC TTGTGCTATT GCAATATTTT GTTCTTCATC TGTCGCTTCG TTATTTGCTT 
TAATTAAATT AATTTTATTT GTAGCGATAT TTTGAATTTG TTGTAATGCT GTTGCTTTAA 
CTGTTGTCGC TGGTTTAATT TTTGAAATAA TATTTTGAGC ATTTATACTA TCTTGATTAA 
CTTGGGCAGT CTTATCTGCA TGATTGATCT GATCAATAGC CTGATTAAGT GCTTGTTCTA 
CTAAATGTTT AGCAGCTAGT CTTTCTTCTT CAGTTGATAA ATCGCTTTGA TCGATTAGTG 
CATTTTGAGC TTCGGCTTTT ACACCAACAG ATTGACGCGC TGCTGGTTTA ACTTGAACTT 
TAGGTAAAAT CACTTTGATG TTGTCGTTGC CATCAGTCnC AGTnCGATCC ACTTCTGCAT 
TCGTTTTGTT TTGTG CAATG TCATTT 
(2) INFORMATION FOR SEQ ID NO: 172: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3660 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11126 
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45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

TTGCCCCGCA CGGCGGTGTG nTTCCTAGAA ATAATGAATA TAAAGaGAAA TATATAACAA 60 

CGATTTTGAA TTATGAACCT GGTGATATCG TTACAATCAA ACGTGTGAGA GATAAGACCG 120 

ATTTGCTAAT ATATTTGTCT AGTAAAGATA TTTCTATTGG TAATGAAGTG GAAATTGTAT 180 

CGAAAGATGA AATGAATAAA GTAATTATCA TTAAACGTAA TGATAATGTA ATTATTGTCA 24 0 

GTTACGAAAA TGCAATGAAC ATGTTTGCTG AAAAATAAAA TAAAGAAGCC ATAAAGATAT 300 

CCATGATTGA ACTGATAAAG ACATATGGAT AATTGCTTTA GGCTTCTTTT TTATTAGTTA 360 

ATTTATCAAG TGAGTATATT TGAGTAAAAT ATTCACTGCA TAAAGATTGA AGATAATCCA 42 0 
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CTGTGGACTC 


GGACGCTGGA 


Tk TV S**T*/ m * TV TV 1 WTt 

AAGTCAATTT 


TV /— » /— ♦ TV TV TV^/^ T^/*^ 

AGCAATCG x C 


r* Tv TA /"••p TA J*** » rrw-p 
LAAv. X /\Vj/\ X 1 


Vj X nvjnrW X 4. V— 






< Tv t\ * TV »t» 

GCCTAA 1 AAT 


TV f~* 7\ /" > ^»«T» TV TV TV TV 

ACACCTAAAA 


<-p Tk rt-iTV qyTNp TV *T> TV 

T ATATU. \*A 1 A 


tv /•*TV , R,T , TY" , *IY^ 

AL iUU lVj X \» 


AvJi/ivj X M.vj X X 


HAATTTCTAC 


ouu 


5 


TATATTTTCA 


TL X~X~x~x AATA 


rp 71 t\ 7k TV T* TV f*** TV TV 


CA X viA 1 AuAA 


AX 1/vviul XX 


TAACAACAAT 


CCA 




GGGTGAGCCT 


TTTC t AGATG 


TTAAAATTAA 


AAAATAAAI A 


AA X A X X A 




/ ^ 0 




TATAAAGAAA 


CTAGG T ATCT 


GATAATGGCT 


/"V TV /•"/-» /'T* TV 7\ TV 

CGACGCTAAA 


r* f*T Ti T & 71 T 7i 
CC IHlUnnln 




Tan 


10 


TGGCATAAAA 


»T*Tl TV TV /"*»('"• TV JV 

TAACCACCAA 


TCGTTGTAAG 


CCA I XAioCCX 


v»t- X /\OxV ivjiv. 


X nun x x w 






AATTGCGAAT 


CCTTGTTGTA 


ATGTCTGTTG 


TCGCTCTCGT 


LtVji>\v_ X lul J. t\ 


PAATflArTAA 


Q fl fl 


IS 


ATcITTTGCA 


CGGCCACCAG 


CGAGTTTATT 


TV A TV J\ /"^*r* TA TV 

AAACAG 1 ALA 


X oAv_v-Att>\ X X 


v^nlvJlUl Xn/\ 




AACAGGGATA 


rr* TV otrwiwn TV TV TV TV 

TAGTTTAAAA 


#TV~1 TV TV >T>/ MM TA T\ 

TGAGATCTAA 


TV T^TV /"" I" l' 1 /^ T\ TA TA 

ATAG1 TLAAA 


AV— . ALjvj^— X 1A1 


V7 1 V. 1 /-^ X X X V-J 






ATGAATAGCA 


ATATAACAAG 


CTGCAACAAT 


TV TV TV fT^ TV TV ftV^ 

AACGATAAIaj 


rpTv T^Tv T'TV »r»«T' TV TV 

lAlAIAi 1AA 


X iuAni, X X 


1UOU 


20 


CGTATTAAAA 


AAGTTTGATA 


AATAATTCAT 


TGTTAACCTC 


A X A X Anb A X A 


■ j — 1 * Tv TV ■ I w I **X* TA TA TA 
X lnnl X ±f\mn. 






GTTTGCTTAT 


CACTTATTAT 


AAATGATATT 


y-1 /-^ y^ TV TV TV TV 

GGCATCAATA 


■»■ M 1 1 TV TV / " |» 

VjC^j 1 X AbAL X 


TT TA CZ TA PTT 2i 
X XAVjALJ. xav. 


lzuu 




CTTAGTTAAA 


CTAATTTTAA 


TTTTTGAAAA 


ggtgaatatg 


TGTTAAAATA 


AAGCAAAATC 


1260 


25 


ATTTCGATAT 


AAATAGGATG 


AATATAAATA 


CTG TT AAT AT 


TGATTACACT 


AACAX AAXAA 


X. u 




TGAAATAAGA 


TAGGAGATTC 


CTGTTATGAC 


TGTTGAAGLAA 


AuA 1 LLAA X A 


f^^CfCh. TA TAr^T 
CAVjCCAAAVj x 


i t o n 


30 


TGACATTTTA 


GGGGTCGATT 


TTGATAATAC 


TV TV TV TV «TV *»l »1 */""» 

AACAATGTTG 


/~i tv TV TV TVPTTT 1 

CAAAivjCjl lv» 

/-^rp 71 TV /""» TV (^<~* Ti. 

Kj 1 AALAbL CA 


TA & TA T! r P7A r T*T*7A TA 
AAAAXAX X AA 

TA C , C*C r VC TA TA 7AT" 
nLV, C X Vj/W\ X 


1 A A f\ 
1DUU 




GCAAATCAAT 


CAACGAATAA 


T L'iTrri'ATA 




AGTGAATTAC 


GCGACGACAC 


ATCAAGCGTA 


TTTAGAGTTA 


ATAAATCAAG 


CGAGCTATAT 


1 ft 




TGTTG CTG AT 


GGGACAGGAG 


TAGTCAAAGC 


TTCGCATCGT 


TTAAAGCAAC 


CTCTAGCGCA 


T £0 ft 


35 


TCGTATACCT 


GGTATTGAGT 


TGATGGATGA 


ATGTTTGAAA 


ATTGCTCATG 


TAAATCATCA 


T fiRft 
lOOU 




AAAAGTATTT TTGCTAGGGG CAACTAATGA AGTTGTAGAA GCGGCACAAT ATGCATTGCA 


1 74 ft 


40 


ACAAAGATAT 


CCAAACATAT 


CGTTTGCACA 


TCATCACGGT 


TATATTGATT 


TAGAAGATCA 


X O \J \J 


GACAGTAGTG 


AAcGnAnTTA 


AACTGTTTAA 


ACCTGATTAC 


ATATTTGTAG 


GTATGGGATT 


1860 




CCCTAAACAA 


GAAGAATGGA 


TTATGACACA 


TGAAAACCAA 


TTTGAATCTA 


CAGTGATGAT 


X 7 4& V/ 


45 


GGGCGTAGGT 


GGTTCTCTTG 


AAGTATTTGC 


TGGGGCT AAA 


AAGAGAGCGC 


CTTATATCTT 


X J O \J 




TAGAAAATTA 


AACATTGAAT 


GGATATATAG 


AGCATTAATA 


GATTGGAAAC 


GTATTGGTAG 


2040 




ATTAAAGAGT 


ATTCCAATAT 


TTATGTATAA 


AATAGCCAAA 


GCaAAAAGAA 


AAATAAAAAA 


2100 


SO 


GGCGAAATAA 


TCATGATGAC 


AAAAATAAAA 


CCGAGGAAAT 


CCTTAAATGG 


AGATTCTCGG 


2160 




tttittcggt 


TTATTTAATA 


ACGAAGCGGG 


ACT CATCGAG 


TTTGTTTCTA 


AATTCTTTTT 


2220 
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Av~V_\_» Inn X L. X 


1 1 1 AACTTTC 


CGCCTTCAAT 


C CAAG CAATC 


•W** IV / Wl 1 IV 1* IV Tk 

TTAGTACAAA 


2340 




ATTGTCTf'AP 


X lo ILL I AMlj 


X X A X GACTAA 


CGAAAAAGAT 


GGTTTTGTTT 


TGCTCTTTAA 


2400 


5 




^P^P^P A ^P/ 1 ** I ' A A a 
X X lAlLlnnn 


CATTTTTGTG 


CAAAAGTTTG 


GTCACCTACA 


GAT AAAG CTT 


2460 




CGTCAATGAC 


1 AAOA inXL I 


f^f* A TT ft ft /**■!»/ * 

wAl X AAL1G 


TGATATTAAT 


qirt ft ft ft ft ft ^*/^ TV 

TGAAAAACCA 


AGTTTTGCAC 


2520 


10 


vWvl Awl_Alw 1 


X un/ilnL I X X 


M*!' ft ft OTPPT"P 

1 1AAL I GGXX 


GATAAATAAA 


CTCACCAAGT 


TCACTAAATT 


2580 


CAATAATPTT 


nbu lul Ln X I. 


^"*/ "I" I'M* A A ' 1 1 w 1 1 

XXX AA 1 1 X 


i_ x~x 1TCGCTT 


ft ft * tv ?\ 

AAAG CCCATA 


IV RID IV ^% tk >»<«WM 

CATAACATTT 


2640 




TAAATTfViAT 


* *p*p*wi»p ft ft ^P^^ 

AX 1 x 


lu X AAG x X 


GTCCACTCAA 


GCCAGCACTA 


ATTGCGATAA 


2700 


15 


v-UU X i IL 


A ft** A TT A /"'/'"' ft 


TC CA CTTTG C 


^1 » Tk m /nil m /"^ 

CAACAGTAGG 


fk ^1 Tk IV IV IV fk 

CGACAAAGAA 


CCGCCAATGA 


2760 




X AX luviV^nA 


1 l\iA X X XG 


CCGGAACCAT 


TGATGCCAAC 


AAGCCCTATG 


ACGTCACCTT 


2620 




^ A ^R*TV^^'*t 1> I w 'P 
UninloLX 1 1 


T A A A PT A A TP 

X AML xaaxg 


*TV*AT*/"**T* ft ft ft/** 

TCATCTAAAG 


* fk jk ik ik nr/i rr> 

CGAAAAATGT 


TTTG1 1 1 1 l'A 


TGTTTGGGAA 


2880 


20 


X wnvLuLn x v. 




TCTTTATTTG 


TACGATAAAT 


ACGATATTCT 


TTTGTTACAT 


2940 




xxx lxvi X\j X 1 


X AU.UGAAAvJG 


TT CATTTGTA 


GACCTTCCTT 


ATTCACATTT 


ATCTAGATTA 


3000 




*T7i 71 TUT A. OTA 
1 AA lnlAL 1 A 


^*TV^ A ft ft /"•T^T" 


GTTAAATTTT 


AAAACCTGTT 


GTAAAGTGTA 


TAGAAGATTT 


3060 


25 


Ti^T^PA^PTTA TO 
XVJ 1 X AX X AX V. 


AGAG X VjGG 1 G 


TTTTGACACA 


IV ft ft ^/^fTVTl n ik rr^ 

AAATGTTAAT 


CATCAATGAT 


AACAATGATA 


3120 




^F^^^O h K A A h O 

X 1 l/ww\L X 


AAACTTATTT 


CAACTTACAT 


GATTGTATAC 


TATAATGTAT 


TTGTAATAAA 


3180 




<- IMAIHI 1 X X 


ft ft ft/* ft TV r y T s ft P 

AAAGAACTAG 


ACAATAATTT 


TGATAGCATC 


CATGTATAGT 


GATAGTATTT 


3240 


30 


AvJuVLAAl X A 


! i ft rp 7\ * rp ft p»p 

X X ATAATACT 


ATTTAGTTAA 


GTAGAGAAAT 


AGTTAAACAT 


TTGAAAGTGT 


3300 




GGTTTAATGG 


AATGTCAGCA 


ft T 1 ft f~*{~* ft IV /"» iv /-» 

ATAGGAACAG 


TTTTTAAAGA 


ACATGTAAAG 


AACTTTTATT 


3360 


35 


TAATTCAAAG 


ACTGGCTCAG 


TTTCAAG TT A 


ft ft ft ri x i >■> m/^ ft tv 

AAA X x AT CAA 


TCATAGTAAC 


TATTTAGGTG 


3420 


TGGCTTGGGA 


ATTAATTAAC 


CCTGTTATGC 


AAATTATGGT 


TTACTGGATG 


G , l"i"X"i~i'GGAT 


3480 




-TAGCSAATAAG- aagtaatgca 


CCAATTCATG- 


GTGTACCTTT^ 


TG TTTATTGG 


TTATTGGTTG ~ 


~ 3540 


40 


GTATCAGTAT 


GTGGTTCTTC 


ATCAACCAAG 


GTATTTTAGA 


AGGTACTAAA 


GCAATTACAC 


3600 




AAAAGTTTAA 


TCAAGTATCG 


AAAATGAAcT 


TCCCGTTATC 


GATAtACCGA 


CATATATTGT 


3660 




(2) INFORMATION FOR SEQ ID NO: 173: 









45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 86 8 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
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ATTAATCACT TGTTGTGTAG AGTCTTGTCC GTTTTGGTTA TGATTGTTAG CCATGATATA 120 

CCTCCCTTAC AACACTCGTG GACCAGAAGT TTTCTGATCT CTCACATTAA CTTCTAACTT 180 

ACGTACTGGC ATTTCTGTGA AATATTCTAC ATTCTTTTTA ATATCCGAAC GAATTGCTTC 24 0 

AGTTAAAGAT TGAACTTGAA CATTATTTGG TACGAAAAAG TCAGTTTTAA TGT CGAT AT A 3 00 

AGATTTATTT TTTTTGTTAT ATAGTTTCGC AACTACATTT GGTTGTCTTA CTTGATCATA 360 

TTTTGCAACC GTATCGAATG CCGTCTTTTC AACAGCTTTA CGAGATACGT AAACATG AC C 42 0 

AT CAT CGAAG TCTTTGTATA ATCCAGGTTT TCGATGCGTA GGTTTGAAGA TACTAAATAC 4 80 

TAATATAAGA CCTATTAATA TCAATAGTGC AGCAAGTGAA ATAAGTAATG GTTGGAACCA 540 

TTCAAATTGA AGGAAGTAGT CTTGATATTC AGTTATACGT CCATCTTGGA TATACATGAA 600 

TAACAGGAAC CCCACGaTTA CTACTATTAA TAAGCCAAGG ATAAAGTTTT TAAGTCGTTT 660 

CACCCCTAAC GACACCTCCT TAGTTAAAGT TAATTTAAAA A CAT ATT AAA TATGTACCCA 72 0 

TCAGTTTTTT TCTTAAACAT AATAAATTAA TAACTTTAAA TTTATTTTTA ATATATAAGA 78 0 

TGAAGTACCA TTTAGTAATA TATTCCCTAG TTTTTGTAAA TAAAACCTCA TTATTAATTA 840 

25 ATTyTCGTCA ATATGTTTTG AAGAACGATA TTCTAAAATA TCTGGGTCAC GATGTTTAAT 900 

TAAAACCTTA TTACTATTTC TCGGTTTCTC CTCACTCAAA GATTTTATAA GCGACCATAT 960 

CAT CG CT AT A ATGACCACGG AAAATGGTAA CGCAGCAATG ATTAATAAAT TTTGAATTGC 1020 

30 TTGAGTACCA CCTGTGTAAA TCATGATGAT TGCAAATAAT G C CATAATG A TACCCCAACT 1080 

CACTTTGACA AATGACTTCG GATTAATATC ACCACTTGAA CTCAACATAC CTAAAACATA 114 0 

AGTTGCCGAA TCCGCTGATG TAACAAAGAA AATCATAATA ACAAGTAAAG TAATTAAGCT 1200 

TAATACAAAA CCTAGCGGAT AATGTTGTAG CGTCGCAAAA GTTGCTGTTT CTGTCGCAGC 1260 

TTTASCAATA TCGGCAATAT GATTATCTTG TAAGTAAATT GCTGACGCGC CGAATACCGC 1320 

AAAGAATATA AAGCAAACTA ACGCCGGGAC AAAAAGTACA CCTAGAATAA ATTCTTTAAT 1380 

CGTACGTCCT TTTGACACAC GTG CAAT AAA TATACCTACA AATGGTG CCC AAGATATCCA 144 0 

CCATGCCCAG TAAAAGATTG TCCAATTTTG TAACCATTGG AATTTTTGAC CACCTGTCGG 1500 

AATGCGTAAA CTCATACTAA AGAAATTTGC AATATAATTA CCTAGAC CAT TCGTAAATGT 1560 

ATTTAAAATG TATAGCGTTG GCCCAACAAT AAAAAGACCA ATAAGTACTA CAAAAGCAAG 1620 

TAACATGTTG AT ATT ACT C A ACGTTTTGAT ACCTTTATCG ATACCTGACC ATGCTGACCA 1680 

SO AGTAAATAAT ATGGTTGCAA TGACAATCAA GATTACTTGC ATCGTGAAGT TACTCGGTAC 1740 

ATTAAATAAA AAATGTAAAC CTTCGTTTAT TTGCAATGCA CCGAAACCTA ATGTTGCAGC 1800 
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CATTGCCTTT TCACCTAATA AAGGCGTCAA TGTAGCG CTG ACTAAGCCAG GATATCCTTT 192 0 

ATGAAAGCTA AAATATGCAA ACACTAGCGC GACAATACCA TAGACTGCCC ATGCATGAAT 1980 

CCCCCAATGG AAAAATGAAA ACTGCATTGC ATCATTAATT GCAGATTGCG TGCCAGCTTT 204 0 

ATGAATAGGC GTTAATTTGA AGGCATGACT GATTGGTTCT GCCGTTGTCC AGAACACAAG 2100 

TCCTATTCCC ATACCAGCAC TAAATAACAT AGCAAACCAA GACGGCAATG AGAATTCAGG 2160 

ATCTTCGCCT TCTTCACCTA ATGTAATGTT TGCGTATCTC GAAAATAAAA TATACACACA 222 0 

GACAAATAAA ATAACTAAAA CGAGCAATAA ATAATACCAA GAAAAATGTA GCGCAATAAA 2280 

TGTAGTAATG TTTTGCGTGA GTTTTTCTAA CTGTTTCGGA AATATTGCTC CAAAAGCAAC 234 0 

AAATATCGTA CATATCACTA AAGATACCCA AAACACTAGA CTTACTGATT TATTTTTCAT 2400 

AAATACAAAC CCTTTCTGTG TAATGGTAAG TTCATACCCA TAACTGCAAC ATTTTAATCA 2460 

2Q TTTGTAATTT TATATAGACA CAATTAATAA TGCCTCATCT TTTAAAAATG ATATATAAAA 2520 

CACACTCAAA TTATTTATCA TTGAGCAACA AAGTATTTTA TTGTATTTAA GTAATGCCTT 2580 

TCTAGTGCAT TATTGATTTG ATACCTGCAA AGTTGCCATA TTTCCGTTTA GAATCAATAG 264 0 

25 TCGCTAGACA CAAAAAATAA GTCGCCTATA CAGTATTTTC TGCATAAGGC GACTTTACTT 2700 

ACTAATCTAT ATATTAATTA CTAATTTTCC AATCATTGAT TGTTTTTCCA ACAATTGATG 2760 

TGCTTGATAT AAGTTTTCAG GTGATAAACC TTCAAAAACT TGTGTCGTTG TTGGTTGGTA 282 0 

ATGCCCTGAT TCTATATTTT TCGTAATATC TTCTAAATAC TCATGTTGTT TAATCATATC 2880 

AGGCGTTCGA TGAATTGGAC GCG CAAACAT AAATTCATGT GTAAATGTTA TACTTTTTAA 2 94 0 

TTTTAATGCA TTTAAATCTT GATCTTCATT AAAAGCTACG ATAGTCGTAA TATGCCCTAA 3000 

TGGTTTTATC AGTTCAATCA TAGTATTGTA ATACAAGTCT GTATTATAGG TGCAAAATAT 3060 

ATAATCTACT_ AATGGAATTT-CTTTAAATTG ACGCACTAAA TCCTCTTTAT- GATTCAATAC 3120" 

GATATCTGCG CCCATCTTTT CACACCACTC TGTTGTTTCT TGTCGTGATG CTGTTGTAAT 3180 

GACAGTTAAA CCATACCGTT TAGCAATTTG AGTGGCTATA CTGCCTACAC CACCGGCACC 324 0 

ATTAATGATT AAGACAGACT TCCCTTCGTT TTCAGCAGGA TTCGTAGAAA TTTTAAATGT 3300 

4$ ATCAAAAAAC GTTTCATATG CCGTAATACC AGTTAGCGGT AGACTAACCG CTTCATTAGC 3360 

ACTTATGTTG TGTGGTGCTT TTGCAACTAT AGCTTCTGAC ACCAATTGAT ATGTCGCATT 3420 

TGATCCTTGT CTATTTGGCG ATCCAGCATA AAATACAACG TCACCCGGAC TAAATAATGT 34 80 

SO AACGTCTGGT CCGATAGCTT CAACAGTACC AATAGCATCA AACCCAAGTA CACGAGGTGC 3540 

TTGAGTGACT TCCATTTGTC GTTGCTTTGT ATCTACAGGA TTTACACTAA TGCTATTTAC 3600 
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ATTTCCTTCT 


TCCAATTTAA 


AGGGCTTCTC 


AAATCCTATC 


ATTTT CAT AT 


CGTTTCACCT 


3720 




CATTTATGAA 


CTTATTTCTT 


ATTATACAAA 


ATAGAAGCCA 


TGTGTGCTTA 




3780 


5 


TCATGACTCC 


TTTTTCATTT 


GAATATATAA 


ATAATTACAG 


ACGACTTTCG 


TATTAAATTT 


3840 




TAGACTTATT 


TCTACCATGT 


TGCTGAACAA 


ATTTACTTTA 


GATAAAAAAT 


TATTAAATTT 


3900 


10 


TGGTCAATTA 


ACAAAGTTAG 


TTTGTTAAAA 


CGTgATACTT 


TATTATTCCG 


TTACTTTAAT 


3960 


AACTTGTTTA 


CCAAAGTTAT 


CGCCAGTaAA 


TAAA\rI-rTTA 


AATGCATGTG 


GCGCATTTTC 


4020 




AAAACCATCT 


TCAATGGTTA 


CTTGTGACTG 


AATTTTACCT 


TCTTGAACCC 


ATGTTGCAAG 


4080 


IS 


CTGTTCACTA 


GCTTCTTTAA 


AAG CATTAGC 


GAATTCACTT 


ACCAAGAAGC 


CTCT CAT CAT 


414 0 


TACTTGCTTC 


TTAATAAGCG 


TACCTTGAAT 


ACGTGGTCCG 


ATATCGGCTT 


CAGGATGATT 


4200 




ATATGACGAA 


ATTGCGCCAC 


ATACTGGTAC 


ACGTGCAAAA 


CGATTTAAAT 


GCTTAAATAC 


4260 


20 


TTCATCGCCA 


ACTGTTCCAC 


CAACATTTTC 


AAAATAAACA 


TCAATACCAT 


CTGGTACTGC 


4320 




TTGTGCTAAC 


GCTTCTGCAA 


AATCCTCTTT 


CTTATAATCA 


ATACCAGCGT 


CAAAG C CCAG 


4380 




TGTCTCTGTT 


AAATAATTTA 


CTTTTTTGTC 


GCCACCCGCA 


ATACCTACTA 


CACGGCAACC 


4440 


25 


TTTAATCTTA 


GCAATTTGAC 


CTACAACTGA 


ACCTACAGCA 


CCAGATGCAG 


CTGAAACCAC 


4500 




AACAGTATCA 


CCGGCTTTAG 


GTTGTCCAAT 


ATCAAGCAGA 


CCATGATATG 


CTGTTTGT CC 


4560 




TGGCATTCCT 


AAAACACTTA 


AATATAAATC 


AAGTGGTACA 


TCTGTCGTTG 


GAACTTTAGT 


4620 


30 


AATTTGATCC 


GCTTGGACAT 


GATTAATGAT 


TCGCCAAGGC 


AACATACCTA 


CAACGACATC 


4680 




tcctttttta 


TAATCTGCGA 


GTGTCGAATC 


AATTACTTTT 


GCAACGACAT 


GGCTAACAAT 


474 0 


35 


CGGTTTACCA 


ATTTCAAAAG 


GCTGTACATA 


CGAATCTGCC 


TTAGTCATAC 


GTCCTCTCAT 


4 800 


ATATGGATCC 


ACTGAAATAT 


ACAGCGTTTG 


TACAAGTACA 


CCATCGCTCT 


CAAGTTTaGG 


4860 




CGTGTCAATC 


TCTTCaATTT 


TGAATGTATC 


CTCTTGAGGC 


ATGCCkTCTG 


GTATTTTGTT 


4920 


40 


AAAAAGAATT 


TGTTTATTTT 


G CAT CATT AA 


TCACCTTTCT 


TTATTTGAAA 


ci'i i rACTTA 


a o o n 
4980 


TTTGTTACTT 


AAGCGTTAAG 


TTTGAATTGT 


GTCtTCGTGA 


TGTCTGTATG 


CAAATACATT 






CTTAGTTGTT 


ATATTTTGAC 


TTAAGCACTG 


ATTCATTCAT 


GTAACTTCAA 


C CACATT AT A 


51DU 


45 


TTTGCTATAA 


TCATAAATTT 


AAAATGTTAC 


GACTTAGACA 


TTTTATGGAA 


ACTCTCAAAC 


5160 




AATAGATAAT 


TTTTGAAAAG 


CTCTAATATT 


ACAAGCTTTT 


TTGCCCCAGA 


AAAACTAGCA 


5220 




GTTGCTTTAT 


TTTTTCCATA 


AGAAGTCGAT 


TAACTCATTA 


GCAAC ATTTT 


CATTCTCATG 


5280 


SO 


AAGCTGACTA 


TGTTGTGCAG 


GCTCACCTTC 


ATATTTAGAT 


TCTCGATAAC 


TTTTCGGACT 


5340 




ATTTCCCAGT 


AAATATTTTA 


ATGATTTCGA 


AGAACTATTA 


GACACTCTGC 


CGTCTGAATG 


5400 
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ATCTTTTAGC 


APGPGT AATT PP'TV* an a/y 


TTGATTCATT 


CGACTTGGTT 


TAC CATCTTT 


5520 




ATCAACTGTA 


ATTTPATTGA PATPTTPATT 


LATATTTAAA 


ACAC CATT AA 


ATGTCCCTGC 


5580 


5 


AATATTCACT 


TGTTTRTTTA APTGTGGPAP 
* w x x iui iln /\v_ io 1 \Jf\JV»«M.Lj 


x UACxTGTCG 


i 1ACCATATG 


TCATCATATA 


5640 




TTGTGCAAAT 


GTTAAGTTPP rrtTTYia^Tfi 


a f^f^r* TV /"'•TV TV TV /""« 


TTGAATTTAT 


CGAAATTGTA 


5700 




TTCAG ATTG T 


*»*vv— . x x nu x v__fk \j X /"Yv_/A X X X X X 




GCATTCTTAT 


CCAAATAGCC 


5760 


10 


TTGTCTGTTA 


TTTTCAAGTT PAATTTTPAP 

x x x x v»-»v^w x x V^nn X X X X V^AL 


a a t a /"v* a tt^* 
AM X AvjO A X i L 


ACTGCATCTT 


TTCTTAGTTT 


5820 




CCCTTTAAAT 


GTCAPTGPAP PATPPTTTGA 


a afVT'a Aooh 
AALo 1 AALjL_A 


G TvjATGATAT 


CTTT AG TT A C 


5880 


15 


CCCTCTTTTT 


TCTGPTTGPT TPAPPATAAA 


L- ilTl'UiUAA 


TTUGCACTAC 


CACCAAATCC 


5940 


ATGTAAGAAC 


AATGTTGGAA TTfinPTTTTT 
iui l VJvWVl X X uUV. X X X X X 


a»p»» aiwppn 
AAL-AAAX ioL 


TGTTGTTGTA 


TTTTAAATGT 


6000 




TTGTGCPTG T 


PGTTGAPTAA art r*f^ a ct* a t 1 

vol iVAriLlAA AL-Av_l— Al_VJAX 


a a T" a a T*a^ t\ /~» 
AA X AATAGAG 


^^^>nTa Ilk m m 

CCTATAATAA 


TAGCGACCGC 


6060 


20 


TAACAATGTC 


GTAATAIVTTli PA & IV A RTTTT 


CTTCACACTT 


TTAACTC CCA 


TTCATGTCTT 


6120 




TTATATAAGT 


a t A A a rzn a t 1 /^ *ty~* a tt a a a a tv 

X.MAAtj\jAi.\j luAl 1AAAAA 


TGTCCTTTAG 


TTGATTTTGA 


ATACATCATT 


6180 




AACTTTTAAG 


AXVmvi 1 IWJ nnnvj X X vj X \~ L~ 


GTTAACGTTT 


GTTAATTGAT 


TGCTTCTTTA 


6240 


25 


GCTTTCAATG 


GTGTGTPAPP PaTTYiaTTR a 


T»a*panr«a a tv t»tv. 
lAlAlAAAIA 


TGTATATG CA 


TGTTTAATTT 


6300 




ATCTCTTTCT 


ATAAATAAAP aprTaprarr' 


AL 1 CCsACTGA 


TAGGCCITIT 


AATATCTATA 


6360 




ATT ATTT AAT 


XXL-XXXXVjXX ILuuLlnALi 


CTTTGTAC CA 


ATAAG CA CTT 


TTCTTAGGAT 


6420 


30 


AAPGTTPTTG 


*va 1L1 LAAAA X V_VjA(JA X AbA 


TV «T» T\ TA TS Z"'/^' Tv *w * 

ATAAACCATA 


TCGTTTTTCA 


TAACCATTTG 


6480 




A C CAAGAG AA 


PAPATPPATT Xi aTParra a a 


«r> i\ ti a^»T*a x /^i /-» 
I AAAbTAAC C 


TTT AA C ATTT 


GCACCATCTA 


6540 




T AAT AG CAT C 


TG P A A T A A PG TTP AAA TfiTT 


ulLl 1AUAIA 


Tv ^^^^ 7v tv ^ r^i *< i * 

ATCAATACGT 


GCATCATCAT 


6600 


35 


GAACTGTTTT 


TTP AG A TTP A ATA AATTPAT 
* x vnvn X X V^rV j% X Ann X X \_A X 


L-X 1 lAiAlLL 


TAAACCATTT 


TCAGTGATAT 


6660 




AAATPTTATG 


at AP'PT'AfSP A Th fiTPTTTi R 
a. x.n,o 1 lrtuurt -lnftl-L-1'i Jl/iA 




GaTTTGATCA 


TATAAAGCTT 


6720 


40 


GAGGATAGAT 


PATPPAGTPP pagtptptpp 


o apyrarpTP 
vjA*jv» X AL-vj X L. 


T\ TV T"/ - * TV TV TV T» 

GACATCAAAT 


TCACGTTGTC 


6780 


CGACACCTTT 


AAGTTGGTAT TT AG A A PPC2P 
■t^-rwj x x VJ\J 1 1\ X XI /-Vji-»At-HLSL- 


Lll 1AILALL 


ill/ 'in/v^/^TtTTT. 

1 GTCGCZATTA 


TG CGTG ATTT 


6840 




CAGATTPTP P 


ATPGTA APPT fTPATrva AT 




GTAATTGATA 


CCTAAGAAGT 


6900 


45 


CGTTTAAATP 


T* v pf2/"2 / "t/"*/"* a TPTaaaxTPr 
1 1 IVjVjLIuLA ILlAAAAXoHj 


^htv*tp7v tv np/*»<T»n '/-< 
CATAATCTTC 


ATCTGTAATG 


TTTAATTTAC 


6960 




CGCCATTAAC 


AGATAAGATA TGTTGCACAC 


CTTCCATCGT 


TTCACGAGAA 


TACTTACCTA 


7020 




AAT ATGTTG C 


ATCTAAGATG AATTTATTAT 


GGATGATATC 


TTCTAATTCT 


GCTGCACGAA 


7080 


50 


CATCTTCAGG 


ATTTGATGGA TCGAACGGAT 


ATTTTGTTGG 


CAATGCGTGT 


ACAACACCAA 


7140 




TTTCTCCTTT 


GTATCCGCCA TCTTTAAATA 


ATTTTACTGC 


TCTAGCATGA 


GCCACCATCA 


7200 
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CTACTAAATA TTGACCATCA CCAATAGGTC CAATTTCATT GAATGTAGTC CAATATTTTA 7320 

CTTCTGGGAA TTCTTTAAAA CAATATTCAG CATAATCTAC AAAGTAGTCA ATCGTTTTAC 7380 

GATTTAGAAA ATCGCCATCT TTGTGTAaCA CTTCTGGTGT ATCAAAATGA TGCAATGTTA 744 0 

CAAATGGTTC AACATGACGT TTATGACACT CTGCAAATAA CTTATGGTAA TACTCAACAC 7500 

CTTTAGGGTT AACTTCGCCA TATCCATTTG GGAAGATACG AGACCATGCA ATTGAAATTC 7560 

GGATACCATT AACACCGAAT TTTTCACTTA ATTCTAAATC CACTGGATAT CTGTTATAAA 7620 

AATCACTCGC TGGTTCTGCA GTGTACCAAT AGTTTTCTTC TAAATACGTA TCCCATGCTA 7680 

CGCGACCTTT ACCATCCGTA TTTGTCGCAC CTTCTGCTTG ATATGCTGCT GTTGCTCCAC 7740 

CAAAAATAAA ATCTTCAGGT AATGTTTTAG TCATATGAAA AACTCCTATT CTTAATTTTC 7800 

AAATTGTTGT TGAACGAAAT CAAGGGCTGC TTGGCCATCT CGTGTCAATT TGATATATTC 7860 

AGCACCTTGA GTCTTCGCTA ATTTAATACC TAATCTATCT GTATCTTGCT TAATAT CTTC 7 920 

ATAGTTAGAC GCAACTTGTG GCGCTAAAAT GATTAATTGG TACTCTTTCA TAATGTCCAT 7980 

ATGTGCGCCA TATCCGCCAG cTGCCGCTTT CACTGGCACA TGATATTCTT CAGCTGCTTT 8040 

25 ATTAAGTGCA TTGGCTAATA ATC CACTTGT ACCACCACCG GCACAAAGTA CTAAGACATT 8100 

TGTTTGTTCT GTGATATTTG AAGCTTTAGC TGCATCGTCT GATACACCAC TTGCCGCTAA 8160 

AATTGAATCA GCTTTTTTCG TATCAAAGTT TGCTGCAACT TTTTCTTTTA AATCTGAATT 8220 

ACTTTCTTTA CGTCCTTCTT CTTCATCAAG AATTTCACTA TCATAAACTT TTAGGAATGG 82 80 

GTAGTAAATA ATAATATCTA CAACAATCAA AGTAATAGCT AGTACGAATG ACCATAAACC 834 0 

AAAACCTGTA CCCATGATAA TGCCCAATGG ACCTGGTGTT GTCCAAGGTA AATTCACACT 8400 

AAAACTATTC ATTCCTAACA CTTCAACGAA AAGTTTGAAA ATCCATACGT TAACAATTGG 84 60 

TGCTAATACA AATGGAATAA AGAACACAGG ATTCAATACT AGTGGTGCAC CAAATAAAAT 8520 

TGGTTCGTTT ACACCAAAGA ATGTTGGTAC AACTGATGCA CGTCCAATCG CTTTGTTTCG 8580 

TTTAGATTTC GTCATCCACA TAAACATGAA CGGGACG AC C AATGTTGCAC CCGTACCTCC 864 0 

AAATGTAACG ATAAACATTT GTGTACCTGA TGTAATAATT TTATCTGCGT GTTCTCCAGC 8700 

45 TTGAAGCAAC TTGAAGTTCG CTTCGATATT CG CAT ATGT A ATGGCTGCAA TTGCTGGCTC 8760 

TACAATTGAC GGACCATGAA TACCTACAAA CCAGAATAAT GCAAAGGCAC CAAAGATAAT 8820 

TGTGACACCA ATCCATCCAT CTGCTGCTGT AAATAATGGT TCGAATAATT TTAAAATACC 8 880 

50 TTCCGCTACA TTTGATTTAA AGCTGTTGCG AATGACTAAA T CT AATG CAT AAAGAATGAT 8940 

GATTACCGCT GAAAATGGAA TTAAGTCCTT AAATACTTGT GAAATATTCG GCGGTACTTC 9000 
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AAATGCTGAT AAGAATGCTG TTAATAAACC TTTAGTTCCC ATAAATGCAC TTAAGAATCC 
ACCATCTTTG GCTGGATCAG AAGCTAAGAA CAAGAATCCA CACATCGCTG CTAGCATTGT 
AGAAATAAAG TTAATTTGAT TTGTACTTTC TAGCTTACGG TTAAATGAAT CTGTTAAAGA 
TTTCGCTGTC GTTCCTGCTA CTAAAAATGC TACAAGCCCC ATCGTATAGT TATATGGTTT 
CATTAAAATG GCTTCCATGC CTTTATCCCA TTTAAAACCA AATATATTTG GCACATATGC 
AATTAATAGA AAGATACTTG AGAATAAGAT GACAGGCATT GCAGAAATAA ATCCATCACG 
GATGGCTCTT AAATATATGT TACGTGATAA TTTCTCGAAA AATGGCTTCC CTTTTTCAAT 
TTGTGCGATC AATTTTTGCA TCATTGTCAT CACCCTCTTT TATAAAATTC TAATAAATGC 
TTCATTAAAT CTTTCAGTAA AATTGTTGTC ATTAAATGAT CTTGACCATG CATCATCGTT 
ACACTGTATG CAATATCATC ACCTTGCGCT TCTTTAGCCA ACAGGCTTGT TTGTGCTCTA 
TGCGCTTCCG CAATGCAATT GTTTCCTTCT TCAATCAGTG CATCTGCTTT TGCAAAATCT 
CCAGCTTGAG CTGCTGTTAA TGCTTCTAAA AACTTAGAAC GTGCATCCCC TGCAAATGCA 
ACAATTTCAA AACCTAATAA TTGGACTTCT TCTCTATTCA TAGCATTAAT CCCCTTTTAA 
ACTTATTTTC TTTGTTTCCA AGATGTCGCA GTATCTTTTA ATACTTTATT TAAGTCATCA 
ATATTTTTGA AACCAGTTGT ACGTAACCAT TCACGAGCAG CATCTTCACC TTGTTCAATG 
TATACTTGAA CAGCACCAGA CCATGTAGCA CGGCCACAAA GTACCCCGTT GAATTTAGCA 
CCAGCTTCGT GAGCAAATTT TAAAGTTTCT TGGAATAATT CCGCAGAAAC ACCAGCACTT 
AAGTAAATGT ATGGTAAATG AGTTGCTGCA TCTTGATCTT TAAAGTGTTG TGCCGCTTCC 
TCTTTTGTAT AAACCACTTC ACCTTCAGCA AAGCCTTCTA CATATTTCAT GTTTACTGGT 
ACTTCAACTT TCAATACATC AACGTTAAAG CGTGGTTCTG AGAATAATTT CATTGCTTCG 
TTA«rCTTTC ^AGGCTTAAC TTTTGCGAAT TcAACAcTAC^ CGTTATcAGG -AATGTTGTCA - 
TCGTATGTTA ATACTTCTAA AAAGAATGGA ATATCTTCTG CAACACATTC TGAACCGATT 
CTTTCAATGT ATGCTTTCTT TTGAATGTTA ATTTCTTCAG CATCATCAAC ATCATAGTAA 
AGTAAGAATT TAACAGCATT TGCGCCTTGT TCTTTTAAAC GTTTTGCAGA CCACTCTACT 
AAACAGTCAG GTAAACGACC TTTAGCGTTT ACGTCATATC CAGTTTTTTC ATAAGCAAGT 
AATAATCCAC AATCTTTGTT ACGTGCATCT GAAGCTGGTA AACCATATTC AGGATCTAAT 
AAAATTGAAG ATGCATATTG TGTTAATTCT TCCGCAACTA ATACTTTTAA TTGTTCAATT 
TGAGCTACAG TTGGTTCTTC AGTTTGATGT TTTGCCATCA TGCGTTTTAA AGCACCACGT 
TGGTCAAATG CTAATGCAGA AATGATACCT TCGTTGTTAC TTAATTGTTC AATTGATGCG 



9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 - 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
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TCATCATAAT TATTTAAATT GACATAACCT GTTTGTGCTT CTTGTGCATT CAGCATGCCT 10920 

AAAGTATTGG CTTTTTTTAG TAAATCGTGG TCGTTTTCAT GATTAAGAAT TGCTGAAGTA 10980 

ATTCCAGCAA CTGTAGAATC ACCTGAACCA ACCGGATTTA ATACACTTAT TGTCGGAATA 11040 

TTCACTCTAT AGAATGTATG ATTGTGCTTA GCGAATGCAC CTTGTGCACC TAAAGACACA 11100 

ATAATCCACT CAATCCCTTC GAATAAGGGT TGTGACACTG CCTGTTTCAA ACTTTCTAAA 11160 

CTTTCATCAA GTGGCTGGTT AAGCAATTGA TATAGTTCAG AAATGTTTGG TTTAATGACT 11220 

GTAGGTTTGT ATGGATTTTC CAAAACTGTT TGCAAAGTtG CACCCGAGCA ATCTAATATC 11280 

ACAGGCACAC CTTTGTTTTG G CATCGTTCA ATGATTTGTG CATAATAATC TTGATTTAAT 1134 0 

CCTTTAGGTA AGCTACCTGA AATAGCAACT GCTTCAACTT TTTCTAATAA TTGTTCAAAA 11400 

TGTTTAATAA ATCCTGCAGC CTCTTGATTA TCAATCTCCG GTCCCTGCTC TAAAATTTCT 114 6 0 

GTTTGTTGCC CTT CATGTAA AATTGCAATG CAGTTTCGTG TTTCACCCTT AATGTTATAA 11520 

AATGCATG CT TGATGTCGGC ATGATCTAAT TTTTTAGCAA TAAATTGACC TAATTCACCG 11580 

CCAATAAAAC CACTCGCAAG GACTGGCTCA CCTACTTGCG CAAGTACTCT TGTTACATTT 11640 

AAACCTTTAC CACCAGCTGT TTTACTTACT TCTTGAACAC GATTAACATC ATCTAATTTC 11700 

AATGCTGTTA ATGGGTATGA AATATCAACG GATGGATTTA ATGTTAAAGT TAAAATCATA 11760 

TGTGTCGTCC CTTAATCGTG GTATTCGCCT CTGTCCCATT TTTCTAAGAA TTCATCAAAG 11820 

AAATGTGGAT CAGCTTGATC TGCATTGCTT GTTTCTAAAT GTTTAATTTT AGCGATTAAT 118 80 

TTTTTGTTCT CTTCAGTTGG TTTATATTCA GCATTAATAA ATGCATCGAT AATATCGCAC 11940 

ATTAATAACT CACCTATAAT ACGTCCACCG AAGCCAATAA CGTTCGCATT TAATTCTTCT 12000 

TTAGCGTATA ACGCTGATGT CATATCACGT ACTAGTGCTG AACGAACGCC AGGTACTTTA 12060 

TTTACAGCAT TGTTAATACC AACACCTGTT CCACAAATAC AAACACCTAA GTCTGCATTA 12120 

CCGCTAACAA CTTGTTCGCC AACTTTTTTA CCAAAAATTG GATAATGTGT TCTTGTGAAA 12180 

TCGTATGTTC CTACGTCAAT GACTTCATGT CCTTTTGATT TTAAAAATTC AGATACACGC 1224 0 

ATTTTTGTAT CTGTAACAAT ATGGTCGCAT CCTAATGCAA TCTTCATAGT AATTTTTCCT 123 00 

CCTTAGCACA TTTTATTAAG CATATCTACG CGGATTTGGT GTCTACCACC ATCGTATTTA 12360 

CCTTCAACAA AACCTTTAAC GACATTTTTC GCTAATGTGT CTCCAACAAT TTCAGATCCC 12420 

ATAGTGATCA TTCTTGAATT GTTATGGCCT CTAGTCATAT ATCCAGAGCG TTCATCTGAT 124 80 

ACTTCAGCAG CAATCATGCC TTTGATTTTT GT AG CAACCA TAAAGCTACC TGCACCAAAT 12540 

GCATCGATAA CAATACCTAA GTTACCTTCT TGACTTTGAA GATCTTTTGC TACAGCCAAA 12600 
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TCTAATAAGT ATGATTTGAT GACTTCTTTT AATCGTTTGC CAGCTTCATC TGAACCAATA 12720 

ATAATCGCCA TAATAAGACT CCTTTTTACT TTAATTTTGA AATACCTTTC TTAAAATGTG 127 80 

ACATATTTAT TTGTAGGTTA TGAAAATCTT GAGAAAAGGC TTTCAATTTG ATTACGTTTA 1284 0 

AATTATAAAC ATAAACAAAC AATAAATCAA CATAATATGT TTATAATATG TTTGTTTATG 12900 

ACGTATTTTC AAACAATAAG TGAACATTCA TATTGTGGTG TTGTTTTAAT TAGGTATTCG 12960 

TCTGAAATTG TAGTAAAACT TTGTCGAGGT TCCCGTTGaC ATAAATTTGC ATAAAAAAtA 13 020 

GCCCATAAAT GAATGCAAAT TCACATTCAC TTATGAGCAT ATAGATACAT ATTTTAACAA 13080 

TGCAGTTATA CTTTTAATTT AGTCGACTAC TTCAATATAT GTTTTAATCG TTTCTACTTT 1314 0 

TTCTTCATCT TCATAGTCCA TGACCACTGC AGTCAATTCG TTTAACTGAC AAAATGATGT 13200 

AAAATCTTCT TTGCCAACTT TCGTATGATC GATTAACAAG TATTTTTCAA TTGAATTACT 1326 0 

20 TAGTGCCAGT TGTTGCGTAT AGGCTTCATC TAATGTAGAT GTCATCACAG CACCTTTATT 1332 0 

TACTGCGTTA CTACTAAAGA ACATCTTGCT AAATCTTAGT TTTTCCAACA TGG CGTTCGC 133 80 

cATTTCACCT ACAAATGCTT CTGTAATATG GCGCATTTCA CCAC CAATTA AATAGACACG 1344 0 

25 AAAATGTGCT GTTTGTTTTT CTAACAAAAT TTTATACACC GGCAAACAAT TCGTAATAAT 13500 

TGTGAGCGTA TGATGATTGA CTTCTTCTGC TAATAGTTCC ACTGTTGTTC CTGGTCCGAA 13 560 

AAACAAAGTA TCCCCATCTT CAATTAATGA TGCAGCTTTT TT AG CT AT AA ATCGTTTTTC 13620 

TGCAATTTGA CGGGTATGTT TTTCTTTATG CGATATTTCT TTATACTGAA ATGTTGAATT 136 80 

ACTGCGTGCA C CACCATGAA TCTTCGTTAA AATCCCTTTA TTTTCCAATT CAATTAAATC 13740 

TCTTCGAACT GTCATATCAG ACACATTTAA ACCTTCGACG ATTTCATTCG TTCTTATCGT 13800 

GCCCTTTTTA TTCACTAGTT TAGCAATTTC GTCCAAACGT TCATGTTTAT TCAATGTAAA 13860 

ATTGCfcTC - — - - - 

(2) INFORMATION FOR SEQ ID NO: 174: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
SO TTAAGTCAAC TTTGTCTATA CGGTTTGGAT CtTCTaCCCA ATGTCTTATA AAAGACAATC 60 
CCGCACCTGA AACATAACTC ATGAAATAAG AAAATGGTAT ACCATTAATT TGATCATTTT 120 
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AATCTTTACC 


CATACGAAAC 


ATCAATTGAT 


AAAATGCGAT 


GTCTTTTTCT 


ATCATTTCTA 


240 




TTAAAACGGT 


CATAATTTGA 


TGTATGTTAT 


CCGTGGATAA 


CTTAACTGCT 


CCATTTAACT 


300 




TCTCATCATG 


AATGAAGTCT 


CTTATTTCCT 


CCAACTGCTG 


GTCCTCTAAT 


TTTTCAAGCA 


360 




AATCATACTT 


ATCATAATAA 


TGCGTATAAA 


ATGTACTACG 


GTTAACATCA 


GCTAAATCTG 


420 


10 


CAATTTGTTG 


CACAGTAATC 


TCTTCTAATT 


GGTGTTGATG 


TAAAAGTTCA 


AT AAATG CAT 


480 


TTCTCATTGC 


AACTTGTGAT 


TTTCTAATAC 


GTCGATCTAT 


AGTCATTTAT 


ATCAAGTCCT 


54 0 




CCCCAATGAT 


TATAAACGTT 


ATGTTCATTA 


TCCCACAAAT 


CTCCAACATT 


GATGATTGGC 


600 


15 


ACACAATGTT 


TACCTGTTTA 


ATATAGGTGA 


TACAAACAAA 


CAGAAAAAGG 


TGATAACAAT 


660 




GAACCAACAT 


TTACTAGGAA 


ATCCAAAATT 


AACTGTAACT 


CATGTCAATG 


AAGTTAAAGC 


720 




CGGTATTAAC 


CACATCGTTG 


TCGACAGTGT 


TCAATATGGA 


AATCAAGAAA 


TGATTATGGA 


780 


20 


AAAAGATGTC 


ACTGTGGAAA 


TGCGCGATGG 


CGAAAAATTA 


TATATTAATA 


TTTTCAGACC 


640 




AAATAAAGAT 


GGCAAATTCC 


CTGTAGTTAT 


GTCTGCAGAT 


ACTTACGGTA 


AAGATAATAA 


900 




GCCTAAAATC 


ACAAATATGG 


GTGCCCTTTG 


GCCAACATTA 


GGTACCATTC 


CGACATCTAG 


960 


25 


TTTTACACCT 


GAAGAATCAC 


CAGACCCAGG 


ATTTTGGGTG 


CCAAATGATT 


ATGTTGTAGT 


1020 




TAAAGTTGCA 


TTACG CGGTA 


GTGACAAATC 


CAAAGGCGTC 


TT AT CT C CAT 


GGTCAAAAAG 


1080 




AGAAGCGGAA 


GATTATTACG 


ArTGATTGAA 


TGGGCAGCAA 


ATCAGTCATG 


GAGTAATGGA 


1140 


30 


AATATCGGGA 


CAAATGGTGT 


TTCTTATCTT 


GCGGTGACTC 


AATGGTGGGT 


CGCATCATTA 


1200 




AATCCACCAC 


ATTTAAAAgC 


AAt GATTCCT 


TGGGAAGGCT 


TAAATGATAT GTATAgAGAA 


1260 


35 


GTAGCCTTTC 


ACGGAGGTAT 


mCCAGATACT 


GGCTTTTATC 


GTTTCTGGAC 


TCAAGGTATT 


1320 


TTTGCGAGAT 


GGACAGATAA 


TCCAAATATC 


GAAGATTTGA 


TTCAAGCACA 


ACAAGAACAT 


1380 




CCTCTGTTCG 


ATGATTTTTG 


GAAACAGCGT 


CAAGTGC CAT 


TATCACAAAT 


TAAAACACCT 


1440 


40 


CTACTAACAT 


GTGCTAGTTG 


GTCTACACAA 


GGTTTGCACA 


ACCGTGGCTC 


TTTTGAAGGA 


1500 




TTTAAACAAG 


CTGCATCTGA 


AGAAAAATGG 


CTATATGTGC 


ATGGACGTAA 


AGAGTGGGAA 


1560 




AGTTACTACG 


CTAGAGAAAA 


TCTCGAACGC 


CAAAAATCAT 


TCTTTGATTT 


TTACCTTAAA 


1620 


45 


GAAGAAAATA 


ACGATTGGAA 


AGATACGCCT 


CATGTCATTT 


ATGAAGTTAG 


AGATCAATTT 


1680 




TATAAAGGCG 


AATTCAAATC 


AGCGTCACGT 


GTCCCTTTAC 


CTAACGCAGA 


ATATACACCA 


1740 




TTGTATTTGA 


ATGCTGAAAA 


TCACACATTG 


AATCATGCAA 


AGATTAGTAG 


CGCGCATGTC 


1800 


SO 


GCACAATATG 


ACTCTGAAGA 


TAAACAACAA 


. GATGTAAGTT 


TTAAATATAC 


I GTTTGACAAA 


1860 




GATACTGAGT 


TAGTTGGAAA 


. CATGAACTTA 


. AAACTATGGG 


TAAGCACTAA AGACTCAGAT 


1920 
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10 



75 



20 



2$ 



30 



35 



40 



45 



SO 



CCTGATTTTA ATCATATTGA AAATGGTCAA GTAGCTACTG GTTGGTTACG CGTATCACAT 
CGTGAATTAG ATCAAGAAAA ATCCTCAATC GCGCAACCTT GGCATAAACA TGAAACAGAA 
TTAAAGTTGT CACAAGATGA GATTGTACCT GTTGAAATCG AATTGTTACC TTCAGGCACG 
CTATTTAAAC AAGGCGAAAC ATTGGAAGTT GTTGTAAAGG GTAGTGAAAT TGTAATTGGT 
AATAGTACTC CTGGCATGAA AACACGTTAT GAACATGAAG AAACCGTAAA TAAAGGCATG 
CACATGATTT ATACTGGTGG TAAATATGAT TCACAATTAA TCATTCCTAT CGTTAATTGA 
TATGCAGCAA TTACGGTCGC TTTTGATTAA AAGTGACATA GTGATAGGAC TGTATAACAA 
GAGAAAGCCA CACGCTTGGA ATCTTAAACC AAGGTGTGGC CCTTTTTATT ATTGATGGCT 
ATTTAATTTT ATAACACTAT CGTATTTTCT TTTTCATGAA TCATTTCAAT AATGACATTA 
TCTTCATTCA TTACTGCTAC TTTAGGTGCA TGGTTTTTAA TTTCTTCTTC ATTCAACTGT 
GCATAAGTCA TGATTATGAC TACATCGCCT ACTTCAACAA GTCTTGACGC TGCACCGTTT 
AAACAAATTT TACCACTACC TCTTTCACCA GCTATTACGT ATGTTTCAAA ACGTGCACCA 
TTATTATTAT TCACGATGGC TACTTTTTCA TTTGGCAAGA TGTCTACCGC TTCCAATATA 
TCTGAATCAA TCGTAATGCT ACCTACATAA TTTAAATTTG ACTCAGTCAC TCTTGCTCTA 
TGAATTTTAG CATTCATCAT TGTTCTTAT C ACTTTATTCA GCTCCAATTA TTATATTATC 
TATTAAACGC GCTTTTGAAA ATTTAACAGC TAAcGAGATA AATATGCGTC CAGTTATTTC 
GTGTTGTTCT ACTAATTGAG GATAACTATA AACAGCAACT TCTTCAATGC GTTCACTTAT 
ATGTGATTCA AGATATTCAG TAACCCTGTC TATAATTACT TTACTTTGAC GTTCACCGTC 
TTGATACAAC GCTTGTGCTA ATAGCAAACT TTTACTTAAA TGTACCGCTT CTTGTCGTTC 
TTGCTCCGTT AAATAAACAT TTCTTGAACT TTTCGCCAAA CCATCTGCTT CTCGAACGAT 
ATCA&TACCA ATAATTTCAA CGGCATGATT- GAAGTCTTTT ACCmTTTGCT- GGaGAATAGC- 
CAATTGCTGG GCATCTTTTT TACCAAAATA AGCATAATCC GGCATAACAA TATTAAATAG 
CTTATTAACT ACTGTTACCA CCCCATCAAA ATGCCCTGGr CCGtTCGCTC CTTCTAACAC 
ATCAGCTAAT GGGCCTACTT TGACATCAAT AC CT AATTCA CCTGGATACA TATCTTCTAC 
TGCAGGATGA AAAACAATGT CCGCTCCTAC TTCTGATACT AATTCTAAAT CTTTATCAAT 
TTGTCTCGGA TAAGCATCGA AATCTTCGTT TGGACCAAAT TGTAATGGAT TAACAAATAC 
ACTCACAATT GTAATATCAT TTGTACTAAC TGATTCG CGT ACCATCGTTA AATGTCCATC 
ATGTAAGGCA CCCATTGTTG GGATAAAACC AATCGTTGTG CCTGAGCGTT TGGCTGCTTT 
AACAATGTGT TGCATCTCTT TTAC CGTAGT AATCAGCTTA GTCATTGTTA TTAACCTCAT 
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GATCGTATTG TTTTAAACCA TCCACACCAA CACTAAAATC AGCAAATTGC TTCACAAATT 
TCGCTTTATG TTCAACACCA TAATTTAACA TATCGTGATA AACCAATACT TG AC CAT CTG 
TACCTTTTCC TGCACCAATA CCAATGACTG GAATTGTTAA GTGCTTGCTA ATTTCTTCTG 
CTAAATCATT TGGAATTGCT TCAAGTACTA ACGCAACTGC ACCAGCTTGT TCTACATTTT 
TCGCGTCTAA AATAAGTTGc TCCGCTGCTT CTTTCGTTGC ACCTTGTAAT TTATACCCCA 
TAACGCCAAC ACTTTGAGGT GTTAATCCTA AATGTGCAAC AACAGGAATA CCAATTGCCG 
TTGCTTTTTC AATAAATGGT GTAATATGCG CTCCTTCTGC TTTAATTGCA TTTGCATTCG 
TCTCCTGATA AAGCTTTAGA GCATGATTTA AGTCTTGTGT CATAGAGATG CCTACTGCAC 
CAATCGGCAT ATCAACAACT ACAAATGTAT TTGGTGCGCC TCTTCTTACT GCACGACCGT 
GATGAATCAT ATCTGCTAAC GTCACTTGTA CGGTACTTTC ATAACCTAAT ACAGTCATAC 
CAAGTGAATC CCCAACAAGA ATCATATCAA TACCCGCTGC TTCCACTTGT TTAGCACTTG 
GAAAATCATA AG CTGTTACC ATAGAAATTT TAGTTTG CTT TTGTTTCATA TCTATTAATT 
GACTTACTGT TTTCAATGTT ATTCAACCTC TTTTTGCAGT ATnATTAGA 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
TTATCTTTTG TTGTTTCCTT AGACAAACGA CTAACCACAT TATAATGGAC TAATTTATTA 
ATTFTATTTA ATTCCATTAA GTTATCCGTA ACACTAAGTG AAGATGCGGA GTTCACTCTC 
GTTTGTACTC TTCGTTTTAA TAAAG CACCT CGTAATAATA CAATCATTCT TCTTATTAAT 
GATGCTTGTC TATATACCTG TGTTCTTTCA GCATAACGCA TATAGTTTTC AAGTACACTA 
TTCGTTATTT GTCCTTCATC TACTAAAGAC TCTAATGTTT TkGTTTCTAC ATTAAAAGCA 
ATTTTTTGTA GACGTTCTAA TTCTTTAGAG TTTTCATCAT CTTTCTCTAC AGTTTTTAAA 
AATGCTAATT TATCATGATA TTCTTTAATC ACGTTACCAT ATTTAAAACT TGTTTCGAAA 
GTAGATTTTT GATTTAGATA ATCAATAACT TGTTCTAATA TATAAATTCT AGCAACTTTA 
AACGACATAT TGCCAATTAC TGTTTTAGGT GCAGGTTTCG TTAATAATGG CAATAATACT 
TGCGCAACTA C CAAACT AAT AATAACCATA CCAGATGCAA TAAATAATAA GTCGTTTCTA 
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ATTGTTCCAT GCACACCACA TAACGTCATA ATTAAAGCGT ATAAACTTCG CTTTGGTGGT 72 0 

TTCTCAGTCG TTGGATTATC ATCATCATTT TTAGTCATCA TTTTTTGGAA TGGACTGATG 780 

5 GCTAAATAAA AATAAGGATA TAAGACATAA ACCCAAACAA ATCTAAATAG ATAGACAGCT 84 0 

AAAGCAACAA CAATAGTGAT G C CT ATT AAA AAGATTAAAT TGTGCGGTTC TGTTTTGATA 900 

ATTTTAATAA TAACTTCAGG TACTAAAAAT CCTAATATTG AAAAAACAAA GCCATTTAAA 960 

10 

ACATAACCTA GTATATTCCA TGTATGATTG TAACTCATTT GCAGTTGTGT ACGTACTTGC 1020 

ATAATTCTGT CACGTTCGAA ACCATGTACA AGTCCTGCAA CTACTGCTGC AATGATTCCT 10 80 

GATGCGTGaA ACAATTCAGC AATTAAATAC GTAACAAATG GTGTTAACAA TTGAATAATT 114 0 

15 

GTAAACATAT TAATGTTTTC ATATCCTCGA CGCATCAATG TTAATCGGAA C CTT ACT AAT 1200 

GCCATACCTA TAAGTAAACC AACCACTGCG CCACCAATTG ATGCAATTAA AAACAACTGA 1260 

20 ACAGCATCAA CAAGTGAAAA AGCACCTGTA ACTAATACTC CAACAGCTAT TTTAAATGAA 132 0 

ATAATACCAG CAGCATCATT CAATAATGAC TCACCTTCAA G AATTGT CAT TGCTCCTTTT 1380 

GGCAAGACCT TTCCTTTAGT GATTGCTTGC ACTGCTACTG CATCAGTAGG ACAAAGAATG 144 0 

25 GCAGCAATTG CAAATGCTGC TCCAATAGGT AAATCTGGCC AAATCCAATG AATAAATAAA 1500 

CCTACACCTA TCACAGTAGT AATGACTAAT CCTAATGCCA TCATCATCAC TGGCTTAATA 1560 

TATTTCCTTA AATGGACTCT AGAAACATTA ACACCTTCTA CAAATAACAA AGGCGCAATC 1620 

30 

ATTGTTACCA TAAACAATTC AGAATCAAAA TTAAATTGAA CAGGGATTGG GGTAATAAAT 16 60 

AGTAACATGC CCAAGAAAAT TTGTATAAAT GCTAGGGGTA CTTTAGGTAT GAAAGTATGG 174 0 

ACAAACGAAC TTAGTATCAC AACAGCTATA AATATAAGAA TTGTTTCAAA TATTTCCAAA 1800 

35 

CTTTCACCTC TCTAAAAAGT ATTGTTTAAT TGAAAATTAA GTATCACATC TCGTTGTAAT 1860 

TATAdTTAG AGGATAAATT GAGTTAGCG/T CCACAAAAGC ACTTTTU^A'f AGATATATGT 1920 

CTACGATTGC AGTACTTAAA TTTGCAATTA TTTAATTTTA TTTTATCACT AATTGTTTGT 1980 

40 

ATAAATAAAC AACTTGCTTT CACATAACAA CATTAACTTA TAATACAAAA AATGAGCACC 2040 

TTAAAATCGA CTAACCAATT TCaAAGTACT CTTTTAATGA TTAATTTTGA AAACAGATTT 2100 

45 TCaAAAGCAT TGTTATGCTT AACAATTTAG CCAACACTTC AATCGTTTTG ATACCATTTC 2160 

TTACGATGCT CTTCTCGTTT TTCAGCACGT AATTGTAATG CTTCTGTAGA GTTTTGTTCA 2220 

TTTGAACTTA ATAATATTGA TGCATGTGTG TGAGCATCAT TTTTTCGATA CATATAAGCG 2280 

50 CCGTTGCGAT AAGCAGCGCG AGCGACTAAG TGCATGCCGA CTGGTGAAGT TAAATTAATA 234 0 

AAAACAAGTG ACAGTAATAA ACGCACACTG AAAAATCCTG TATTCACAAT AAAATAAATT 2400 
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CTTAAGAAAA CATCTTGGAA TTTCACGATA CCTATTGCAC TAATAAGAGC AATAAAACTA 2 520 

CCTAACAACA ACATCACAGC AGCAATAAGA CTAAAGATTT CTTTTGTTAT TTCCATTAAA 2 560 

CACATGCCCC CCACCAATAA AGCGTGATAT TGAAACAGAA CTTACAAAAG AT ATAATGG C 264 0 

AATGAGCATG ATTGAATCTA AGAAAGAAAC GGTGCCCATA AGTACACTTA ACACACCCAC 2700 

AATTGACATT ACGACAGCAC TTGTTGTATC AAATGTAACG ACACGATCTG CTGTTGTAGG 2760 

TCCCTTGATT AATCTAAATA AACAGATGAT TAATGCAATT CCAAAAATAA TGAGTGAACT 2 82 0 

AATAATCATA ATATGTGTTA TTGTTTGTAT CATCGCGACA CCTCCAATAT TAAGTCTTCA 2880 

TAATGCTTAA TACTTCTTAA CAAACTATCT TTTTCTTTTT CTGACACGTC GATACTATGA 2 94 0 

ATAAAAAACT TTTTAGAGTC TTGAGAAATT CGTATTACTG TAGACC CTGG AGTTATAATA 3 000 

ATTAAAATTG TTAAAAATGT TATTGACCAA TCACTTGTTA GTCTTGTTTC ATATGAAAGT 3060 

20 AATCCAGGGT TCATATCTTT TGTTTTAAAA AGAATATAAT TAATCGTGCT AATGCTAGAT 3120 

GTTATTAATT GATATAAATA AACACCTAAA AATTTAATAG CTACCCATAT TTTTCTAACA 3180 

TAAAAATCAT CGCTGAAAAA CCTGTGTAAT ATATAAATGA CAATTAAACC AATTAGATAT 324 0 

25 CCAGAAAAGA AAGTCGAGAA TTTAAAATGA TCTTCATCTT GAAATAATAC CCATAAGAAT 33 00 

GCAATGATAA TATTTAAAAC TATTTGATTC ATTTAGTCCT CTCCTTTCAA ATG CGGATTT 3 360 

ACAAGTTTTT GATATAATTG ATCACTCGTG TTCAACTCAG TTGCATCACT TGTAACATTT 342 0 

30 AACACAACAG GTGCAG CAAT TCCGATTGCG ATAACCACAA CTACTAAAAT ACTTAAAATT 34 80 

CTTTTTCGAT ATAGCGGGAT TTTCTTAAAA TTAACTTCCT CCCCATCTTT ATCTCCAAAA 3540 

TACATATAAA AAAGTATCCT AAATAAACTG TACATTGCAA TTAGACTAGT AATAATCATT 36 00 

AACGCTAGTC CAATATAATT GCCATTTTGC AATGCACCTT GGAAAATAAG TACTTTCCCC 3660 
GGAAAGCCAC TAAATGGAGG CACGCCGCCA ATAGCAAAAA TCATTATAAT AAACGCAACT 3720 
CCAAATAAAG GTTCTTTTTT AGCTAAGCCA TTCAAATATT GATATTGTCG ATAGCCTGTA 37 80 

ATGTAAACTA AACTACCAAT AATAAAAAAT AGCAATGTTT TTACAACAAT GTCATTTACC 3 84 0 

AAATAAAATA TTGCACCATT AATACCTGCA AACGTGTTTG TTCCTAAACC TAAAATGATA 3 900 

AATCCTATTG AG ATT ATG AC TTGGTAAGCT GCAATCTTTT TAATATCTTT ATAAGCAATG 3 960 

ACACCTATAG CGCCGATGAC CATAGTTATA GCAGCCATAG TTGCTAGCAA TGGATGTATG 4 020 

AGATCATTAT GTTGATCAAA TAGTAAAGTG AAGAATCGAA TTAATGCATA GGCCCCTACT 4080 
50 TTGGTCATTA ACGCTGCAAA TAATGCTGCA AGCTcAGTAT TTAACACAGC GTAGGCTTTG 414 0 

GGTAGCCACA TAAAAAGGAC CAGCGCTGCT TTCGCACTAA ATGCGACTAA GAAGATTAAT 4200 
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AAGTTTAATG TACCTACTGT TTTATAAAGT AAACCTATAC CTAATAAGAA TAGCCATGAA 4 320 

CCAATAATAT TCAAGACAAC ATAAATAATT GCAGCACGTA ATTGTTCTAC AGATTGTCCA 4 3 80 

AGTGTAATGA GTACAAATGA CGCTAGTAAC ATAATTTCAA ACATGACGTA TAAATTAAAT 4 44 0 

AAATCTGATG TTAGAAAAGA GCCTATCACG CCAACACTTA AAAATAATAT GAACGATGGC 4 500 

AAGTGATAAC GATTTGCTTT ATGTTCGCCA CGCCCAAATC CGTATG CCAT AATTAAAGTA 4 560 

ATCACAAACG AAGCGGTTGT AACCATAATT AAACTTAAAG AATCTCCTAA AAACTGTATA 4 620 

CCAAAGGGCG CTGACCATCC TCCAAAGTCT AGCGTAATTG GACGGTGACG CTGAACATAA 46 80 

ATTAATAGCA TTAATGAAAT AATTGTGGTG ATAGTCATTG TACCTAAGTA TAAATATTTA 4 74 0 

GAAATACGAT CATTATTTTT TAAAAATACA AGGATTAAGG CACAAAGGAA TGGTAATAAC 4 800 

ATTGGTAAAA TCAATAAGTT ACTTAGCATC ATCTTCCCCC CTTAGGCCTT CAATTTCATC 4 86 0 

20 TTCTTTTGTT ACTTTATAAG TTCTATAAAC AAGTACAAGT AAAAACGCAG TCATCCCAAA 4 920 

CCCTATAACT ATTG CAGTT A GTACAATAGC TTGTAACAAG GGATCAACAA ACAATTGGTT 4 98 0 

TCCACCAGTT ATTAGTGGTT CTGATCTACT AGAACCATAC GTTCCCATAC TCATAATAAT 504 0 

GAGATTACCA G CATG AGTAT ATATTGAAAT TCCGATTACA ATACGAATTA AATTGATTGA 5100 

TAAAATCATA TATGTTCCTA TAAACACTAA AAATCCTATA ACTAGTAATA ATATTAAATT 5160 

CATGATCGAC CTCCGCTAAG CGACAACATC ACTGTGACAA TAACACCAAC AACTGAGAAT 5220 

AAAATACCTA ATTCAAAAAG TGTTATTGTA CTTACATGAA TTTGTCCTAA AATTGGAAGT 52 80 

ATCCAAGTTG TTT CAT ATTG AGACAAAAAT GGTTTTCCAA AAAACATAGG TATTATCGCA 534 0 

GTAATAGATG ATAC CAATGC TCCAATAATC ATTAAAATTC TAAAATCAAT CGGTAAACTT 54 00 

TCTAAAACCT CTTCAACATT AAAAGCCAGA AACATTAAAA TAAACGCTGA ACTAAATATT 5460 

AAACCACCAA- TAAAGGGAGG AGGAGGATTA TTATGACGTG GGAAGAAGAG ATAGAATCCG 552 0~ 

AAAGTCAATA AAATAAATAC AACAAGTTTC GTGACCGTTC TTAACACGAC ATCATTCTCT 5580 

TTCATCTTGT CCCCTCCGAT CTTGATAATT TAATAATGtg TAAATACCTA GCCCAGTAAT 564 0 

AATTAACACT AATCCTTCAA ATAATGTATC TAATGCTCTA AAGTCACCAA GTATCGCATT 5700 

45 TACAATATTT TTAGCACCTG TTAGTTTGTC AGCTTTTAAA TAAAAGTCTG ATATTGATGA 5760 

TAAACCATCT GTTTGTTGTG TAATAAAAAT TAATGATACA ACAATAAGTG CCATCAAGAG 582 0 

TGATACAGAA ATTTTAATTA TTTCTCTTTT TTTGTTAGCG TTAGATCTTG GCACGTTTGG 5880 

SO TAATCTTGAA AAACTGACAA TAAATAGTAT CGTCGTTATT GTTT CAACTA CTAGCTGAGT 5940 

CAATGCTAGA TCAGGGGCTT TCATTGCTAT AAAGAATAAG GTCACAACAA ATCCGATGAC 6000 
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GACAGTTACG ATTGCTAATA TAATTTCTAA TGCCCCAAAT TCAGAAACAT GTAACTGATG 6120 

TACTTTAGGA AGTCCaATTC GAATATAACC ATATCCAATG ATAATCATAA ATATGCCTAA €180 

5 GGT CAT AATA ATGTACTGGT TTAAACGATC TTGCATAACA CGTTTAAATC GCTTCGTAGC 624 0 

AAACTTTTCA AAATGTCGAT ATACCATCTC ATAGCTTTTT GAAACTGAAA TCTGTCTAAT 6300 

TTTACCTGTG AACACTTTTT TCCAATCTAC TTTGATTGCT AGTACACTAC CCAATAAAAT 6360 

W 

AATGATGATG GTTAAAAGAA GCGGTATGTT AAATCCATGC CATTGCGAAA CATGTGGTGC 64 20 

CAATTGATCA ATTTGATGAT TACCACCTGA TACAGCTCTT AATGCnAGAA CGATAATCCC 64 80 

CTTCCCAAAT ATATnTGGTA CAAAAAAGAT TACAGGTACT AGCACCATTA aTATAAGAGA 654 0 

15 

TGGTAAACTA aACAACCATG GTTCGTGGAT ATTTTTTTTA GTAAAAACCT TAGAATCATA 6600 

TTTTGtCCAA AATACTTCTT TTACCATGTA TAGTGCATAT GTGAATGTAA AAACACTCGC 6660 

AATAACACCA ACAAACACGA TAGCTATCAT TGAAATCAAA CTAAATTGGG ATAATTGTCC 6720 

20 

AGTTTGTGTT AATGCATCTA AAAACATTTC TTTACTTAAA AATCCATTTA AAAATGGTAC 6780 

TCCAGCCATA GATAGAGCCG CTATCGTCAT GACTAGATTC ATTTTAGGAA ATAGTTGACG 684 0 

25 CATTCCACTT AAAATTCGTA TATCCCTTGA ACCTGCTTCA TGATCTAAAA TACCTACTCC 6900 

CATGAAAAGC GCACATTTAA AG ATGG CATG ATTCATTAGa TGAAATAGcG CACCArATAA 6 96 0 

TACmAATACA TAAATaGATG CTATTGCGTC TTGTTGGTGT TG AG CAT ATC CGCCACCTAT 7020 

30 ACCCACCATA GCCATAATCA TCCCAAGTTG ACTGATTGTA GAGTACGCTA GGATACCTTT 7080 

TAAATCCCAT TGTTTTAAAG CTGTAATTGA ACCAAATAAC ATTGTTATTA AACCAACAAA 714 0 

CGTAACGATA TATACGTACA TATTGCTAnG ACCTAATAAT GGTGTAAATC GAAGTAATAG 7200 

35 AAnGATACCA GCTTTTACCA TCGTGGCTGA ATGTAAATAA GCACTTACAG GTGTAGGTGC 7260 

AGCCATTGCT CTAGGTAGCC AGTATGAAAT GGAr aTTGTG CTGATTTTGT AAATGCACCT 7320 

AATAAAAACA TAAAAATCAT AGGGATAAAC AATCCATGAT TCTTAATATG ATCTGCTTGT 73 80 

40 

CCTAATATCT CTGTGATGTT ATTCGTTCCT GTCATGATAT ACAGCATAAT AAAACCAACT 7440 

AATAACGCCA ATCCACCAAA TACTGTAATC ATAAATGATT GAATCGCACC AAATTGACTG 7500 

TCACCATTGT TATACCAATA TGAnATCAAT AAAAATGATG ATmCACTCGT TAATTCCCAA 7560 

45 

AAaATGTACA TCmATATCGT ATTGTCTGAT AATACaaTAC CAATCATACT GAACATAAAT 762 0 

AACGTTAAAT AAAAATAAAA CCTTGGTAAA TTGTCTTTTC GAGAGGATAA ATATTGAGTT 7680 

SO GCATAGAAGA ATACTGCAAT TCCAATAAGT GAAATAATAA GAGAAAACAT TAAACTTAAA 774 0 

CCATCTAAAC GTAAATCTAA ATTAATATCT AATGTCTTAA TCCATGGAAT AGAGGTAGAA 7800 
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GGTGCAACCA ACGCTATGTA CCCGGCATAT TTAGCCAATG CTCTACGTTT AGACATTAGA 
AGTATCATCG CCATAATCAC AAGTATAGCA ATTAATAAAT AAACCAAACT CATTATTAGC 
CTCCTTTGTT TCTATAATTG TAATGAAATA TAAATACTAT GTTCACACTC ATTTTCTAAA 
CCGATAAAAT TTAGTGTTTC AATAGCAGAT TGATGCCCTA AATACTTTTG AATGACTGGT 
ATAAGTATAC CTTTTTGATA AGCATGATAT GCAAATGTCT TACGCAATGT CGTTAGTCCT 
ACATTATCTA TACCAGCTTC AATTGATGCT TGGTGAATTA TTCGATATGC TTGCTGTCTA 
GATAATACTT GATTTGTTCG TAGTGATTGA AAAAGAACGT CTTCATTCGA AAGACTCCTG 
TCCTCTATAT ATTGAAGTAG TTCTTTCGAT AATGTTTCTG GTAACCTAAT TTTAATCAA 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 
CCCGATTTTT TTACGTAATC TAATACATAC GGCAAAATCA ACTTTAATCA AAAAAGACTC 
ATACACAATG CCTTTAAAGC ACATGTATGA GTCCTTTTTA GTAGTTTATA TCAAAAAATA 
GTTTAATGTA TAAATTAGTT TTTGTTTACA GATGCGTCGT AGATTGATTC TACAGCATCA 
CCTAAAGCTT TATCGAATTC TTCTTTAGAT TGATCAGCTC TTAAATCACT AG CTAATGCA 
CGTGAGAAAC TTGCGATAAG TTCAGCGTTA TCTTTAAGTA ATTCATTTGC TTTTTCTCTG 
CTGTAACCAC CTGATAATAC AACGACACGA ACAACATTAG GATGTTCAGC TAACTCTTTG 
- TATAAGTTTG GTTCAGTAGG _ AATTGTTAAT "TTCAACATTA - CTAATTGATC^ AGCATTTAAG 
CTATCTAAAC CTTTTTTAAG TTCAGCTTTT AATACTTTTT CAATTTCAGC TTTGTCTTTT 
GCATTAATAT TAACTTCTGG TTCGATAATT GGAACTAAAC CTTTAGCAAT AATTTGTTTA 
GCAACTTCAA ATTGTTGTTC AACAACGTCT TTGATACCTT GCTCATTT 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



.078651 9A2_L> 



857 



EP 0 786 519 A2 





AT AGAGTn CT 


GGnACTTACT 


ATGACATATG 


GCGCTAGAAT 


GGCTGAGCCA 


GGTGAATTTA 


60 




CAAAACGTGC 


CTTTTTAAAT 


GGTCGTATTG 


ATTTATCTCA 


AGCTGAAGCA 


GTTATGGACT 


120 


5 


TTATTCGCTC 


GAAGACAGAT 


AGAGCTTCTA 


AAGTTGCGAT 


GAATCAAATT 


GAAGGTCGTC 


180 




TAAGTGACTT 


AATCAAAAAA 


CAACGTCAAT 


CTATATTAGA 


GATACTCGCT 


CAAGTGGAAG 


240 




TGAATATTGA 


TTATCCTGAA 


TACGATGATG 


TTGAAGATGC 


GACTACTGAA 


TTTCTTTTAG 


300 


10 


AGCAGTCTAA 


AGAAATCAAA 


CAGGAAATTA 


ATCGTTTATT 


AGATACCGGT 


GCGCAGGGTA 


360 




AAATTATGCG 


TGAAGGTTTA 


TCTACAGTTA 


TTGTTGGTAA 


ACCAAACGTA 


GGTAAATCAT 


420 




CGATGTTAAA 


TAATTTAATA 


CAAGATAATA 


AAGCGATTGT 


AACTGAGGTA 


GCAGGTACTA 


480 


15 


CTAGAGATGT 


CTTAGAAGAG 


TACGTCAATG 


TTCGTGGCGT 


GCCATTAAGA 


TTAGTTGATA 


540 




CTGCTGGTAT 


ACGTGAGACA 


GAAGATATAG 


TAGAGAAGAT 


TGGTGTTGAA 


CGCTCTAGAA 


600 




AGGCTCTTAG 


CCAAGCAGAC 


TTAATTTTAT 


TTGTATTAAA 


CAATAACGAA 


GCATTGACwC 


660 


20 


AAGAAGATTA 


CACATTATAT 


GAAGTGGTTA 


AAAATGAAGA 


TGTAATCGTA 


ATTGTTAATA 


720 




AAATGGATTT 


AGAGCAAAAC 


ATAGATATTA 


ATGAAGTTAA 


AGATATGATA 


GGTGATACGC 


780 


25 


CATTAATTCA 


AACTTCAATG 


TTAAAACAAG 


AAGGTATTGA 


TGaATTAGAA 


ATACAAATTC 


840 


gAGATTTGTT 


CTTTGGTGGa 


GAAGTACAAA 


ATCAAGATAT 


GACTTATGTT 


TCTAATTCAA 


900 




GACATATTTC 


ATTATTAAAA 


CAAGCAAGAC 


AAACGATACA 


AGATGCGATT 


GATGCAGCAG 


960 


30 


AATCTGGTGT 


GCCTATGGaT 


ATGGTACAAA 


TTGATTTAAC 


TAGAACTTGG 


GAAATATTAG 


1020 




GAGAAATTAT 


TGGTGAGACT GCAAgTGATG 


AACTCATCGA 


TCAGTTATTC 


AGTCAATTCT 


1080 




G CTTAGGT AA 


ATAGTAATTG 


AAATAGACGG 


AATACCGTCT 


TAAGAAGGCT 


AGTAAGATAT 


1140 


35 


CAAATAAGGA 


GGTTTATATT 


GTGGTTCAAG 


AATATGATGT 


AATCGTTATA 


GGTGCGGGAC 


1200 




ATGCAGGTGT 


AGAAGCAGGT 


TT AG CATCTG 


CAAGACGTGG 


TGCTAAAACA 


TTAATGCTAA 


1260 




CAATAAATTT 


AGATAATATT 


GCATTTATGC 


CATGTAACCC 


ATCTGTAGGT 


GG AC CAG CT A 


1320 


40 


AAGGTATCGT 


TGTTCGTGAA 


ATTGATGCTT 


TAGGTGGACA 


AATGGCAAAA 


ACAATCGATA 


1380 




AAACACACAT 


TCAAATGAGA 


ATGTTAAATA 


CAGGTAAAGG 


ACCTGCTGTA 


AGAGCACTAA 


1440 




GAGCGCAAgc AGaTAAAGTA 


CTTTATCAAC 


AAGAAATGAA 


ACGCGTGATT 


GAAGATGAAG 


1500 


45 


AAAATTTGCA 


TATAATGCAA 


GGTATGGTAG 


ACGAACTTAT 


TATAGAAGAT 


AATGAAGTTA 


1560 




AAGGTGTACG 


TACAAATATT 


GGTACAGAGT 


ATTTATCTAA 


. AG CAGTAATT 


ATTACAACGG 


1620 




GAACATTTTT 


ACGTGGTGAA 


ATCATTTTAG 


GTAATATGAA 


> GTATTCAAGT 


GGACCAAATC 


1680 


50 


ACCAATTACC 


ATCAATCACA 


> TTATCAGACA 


t ATTTAAGAGA ACTTGGTTTT 


' GATATTGTTC 


1740 
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AAATACAACC AGGTGACGAT GTAGGTCGTG CATTCAGCTT TGAAACAACA GAATATATAT "i860 

TAGATCAATT GCCATGTTGG CTAACGTATA CTAATGCTGA AACACACAAA GTTATCGATG 192 0 

ATAATTTACA TCTATCTGCA ATGTATTCAG GGATGATTAA AGGAACCGGG CCACGTTATT 1980 

GCCCTTCAAT TGAAGATAAA TTTGTTCGAT TTAATGATAA GCCGCGACAT CAACTTTTCT 204 0 

TAGAGCCTGA AGGTCGTAAT ACAAATGAAG TATATGTGCA AGGATTGTCT ACAAGTCTTC 2100 

CTGAACATGT GCAcGTCAAA TGTTAGAGAC GATACCAGGT CTTGAAAAAG CAGATATGAT 2160 

GCGTGCCGGC TACGCAATTG AATATGATGC GATTGTG CCA ACGCAGTTAT GGCCTACACT 222 0 

TGAAACGAAA ATGATTAAAA ACTTATATAC TGCAGGTCAA ATTAATGGTA CATCTGGTTA 2280 

TGAAGAAGCA GCAGGACAAG GATTGATGGC AGGTATTAAC GCTGCAGGTA AAGTGTTAAA 234 0 

CACAGGCGAA AAGATATTAA GTCGTTCAGA TGCATATATT GGTGTCTTAA TCGATGATCT 24 00 

TGTAACTAAA GGTACTAATG AACCTTATCG TTTACTAACA TCACGTGCAG AATATCGTTT 2460 

GTTAcTACGT CATGATAATG CTGATTTGAG ATTGACGGAT ATGGGATATG AACTTGGTAT 252 0 

GATTTCTGAA GAAAGATATG CACGTTTTAA TGAAAAACGT CAGCAAATTG ATGCGGAAAT 2580 

TAAGCGTTTA TCAGATATTC GTATTAAACC AAACGAACAT ACGCAAGCGA TTATTGAACA 264 0 

ACATGGTGGT TCTCGCTTAA AAGATGGTAT TTTAGCTATC GATTTATTAC GCAGACCTGA 2700 

AATGACTTAC GATATAATTT TAGAACTTTT AGAAGAAGAA CATCAATTGA ATG CAGATGT 2760 

TGAAGAACAA GTAGAAATAC AAACAAAATA TGAAGGTTAT ATCAATAAAT CACTACAACA 2820 
AGTTGAGAAA GTTAAGCGTA T 



(2) INFORMATION FOR SEQ ID NO: 178: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3025 base pairs 
T (B> TYPE: nucleic acid 

_. _ (C)_ _STRANDEDNESS-: —double- . _ 

(D) TOPOLOGY: linear 



2841 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

ATCTAATTTC AAACCCGGTG ATAAATTGCC AAGCGTGACG CAATTAAAAG AACGTTATCA 6 0 

AGTAAGTAAG AGTACTATCA TTAAAGCATT AGGCTTATTG GAACAAGATG GTTTGATCTA 12 0 

TCAAGCACAA GGCAGTGGTA TTTATGTGAG AAATATTGCT GATGCCAATC GTATCAACGT 18 0 

CTTTAAGACT AATGGTTTCT CTAAAAGTTT AGGTGAACAC CGAATGACAA GTAAGGTACT 24 0 

TGTTTTTAAG GAGATTGCAA CGCCACCTAA ATCTGTACAA GATGAGCTCC AATTAAATGC 300 
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CGAATATTCT TATTATCATA AAGAAATCGT 
CTCTATCTTC GACTATTTAG AATCAAACAT 
5 CTTTAATGTA GATCAACTCA CTTCAAGTGA 

ACCATGTTTA CGTTACCACC AGACTTTTTA 
TGACATCGTA TTTCATTATC GTCATGCACA 

w 

ACATAAAAAC GTCTATATCC CAGTTATAAA 
ACAATGGCTC AAATTGCTAT TATCTTGCTT 
AAAGACAGGC ACAACTGCTA CAACAACTAC 

IS 

TGATTCTTCT ACAGGTCCTA ATTCTTTGGC 
TCCAAGTGCT AATCCTCGGG CAATAGGGTT 
ACCTAGGGCA TAAATAATGA CACCATTTAA 

20 

ACCACCGATA CCAGCTGATA CTGGTAACGC 
TGATAAAATA ACATCATTGG CAAATTGTGC 

2s CGCTACAACT GTACCGATAC CAATACCTCC 

TTCACGCTTT TTATATAACG GAATCGCAAA 
AATAATGTCT CCACCTATTT TGTAAGTCTT 

30 CACACCAAAT ACCATACTGA CAAATAGCGG 

AAATAATATG GTCGCTAAGA AAAATGGTAT 
GTTTaGTGCT AAGTGGTTAA TCaTGAGCTT 

35 CTTTGTCACC TTTAGATCTC GAAGTAACTT 

AAAT&AGTAA TAGTATTGTT GAGACGATTA 
TAATGACACC TAAAGAGTTA ACAACTGAGA 

An 

TATTTGTTAG TGTCGTTCCT ACTTTTTCGA 
ATAATACAAA TAATAATACT AAACCGATTA 

ATTCAATTAT TTTCGATACA AAGAGTACTA 

4S 

GTGCTGGTTT TGATGCGTCT TTTTGTTGTT 

ACTAAAGTAT AGATGGCTCA CTTCGATTTG 

ATAGGTAAAA TGCATAAAAA AAAGGATTAC 

SO 

TCTTTCATAA ATGAACGTCC AACTTGCATC 

55 



GAAATATTTA AATGATGATA TTGCTAAGGG 420 

GAAACTTCGT ATTGGTTTTT CAGATATTTT 4B0 

AGCTTCATTA CTACAATTGT CTACAGGTGA 54 0 

TACAATGACT GGCAAACCCT TTG ATT CATC 6 00 

GTTTTATATT CCTAGTAAAA AGTAATAAAT 660 

CTGGAGTATA GACGTTTTTT TACGATAATA 72 0 

AGGTTTTTCG TTTTAGAAGA ATATTGCTAC 78 0 

ACCAACTAAC ACTAAAGCTA TACTTGCCAT 84 0 

TGGTGCTACA CCTAATGTGT GACCACTTGT 900 

AGTAATTCGG AAAAGCTTTA AGAATTTATT 960 

AATAACTGCT AATGATGTTA ATTCTTTTAT 1020 

AATCGCTGTA GTTGCTGCTT GAGGTAACAT 1080 

TAACTTCGCA AAAGTTAAAA TAATTAATAA 114 0 

GATGATACGA TGCCAATGTT TAACAAGCAC 1200 

ACAGATTGTT GCCGGTTCTA AGAAGAAGTA 126 0 

ATACGGAATG CCTGTTAAAT AGAGGAAGGC 1320 

TGCGAATAAG AAGAAACGAT TAGTTTTTTC 13 8 0 

AACGGATAAC AGTATTCCGA AGTAAGGTGT 144 0 

GTGCCTCCTC TATTTTGATC TTTTTTGTGA 150 0 

TCATAATAAT TTgTGTGACA TAGCCAGTAC 1560 

TTAGTCCAAT GATTAAAAAT GGTGCTTGGC 162 0 

TACCGGCTGG TACGAAGAGT AAGCCAATGT 1680 

CTTCGCCTAA CTTAACAGCA CCAGTACATA 174 0 

CTGATGCAGG CATAGGAATT GGCATAAATG 180 0 

AAGCAATTAC AATGACTTGG TGAAAAAAGT 1860 

TCACGACCAT TGCCTCCTAC GTTTGATTTA 192 0 

CGTGATTTTT AGTCCGAAAT ACAAAATATC 198 0 

TGTTAAAGTA ATCCTATCGA CGCTTTAAAA 2 04 0 

TTGACACCAT TTGTCAATAT TACCATATAA 2100 
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TGAATACGTA TAAAATAAGT GGGATTCAAT CGTTTTTCAT AACGATTCAA TGGCTCTGTT 2220 

GTTTCGTATT TATGATTCGT TGTATGTATG GTTGTAATAC CATTATGTGT GCCAATCCCA 2280 

ATAATATTTT GTTGCTTTAA CATGTGAATT TTATCGTCAA TTTCAACAGG TAAGCTTTGA 234 0 

TCAAAATTCG CCGACATATC ATTCGCAATT GCACTTGCGT TATTATCATC TTTGGCTTTA 24 00 

GTCGCACGCA CTTTATTGAC TGCTTGTTCA ATACGTTTTT GACCAAACGG TTTCAAAATA 2460 

TAGTCTGTCG CATTTAATTC AAATGCCTGT ACTGCGTATT GGTCATGTGC AGTTGCAAAA 252 0 

ATAATCGCAG GTGGCTCTTT CATCTTTTGA ATCTTAGCTC CTAATTCGAT CCCATTTTCA 2580 

TCCATTAAAT TGACATCTAA AAATATAATG TCATATTGAT TGATCAGTAG TGCTTCCAAT 2640 

GTTTCTTTTA CATTTTCTGC CTCATTAATT TCTTCAAAAC CACCAATTTC ATTTAATAAA 2700 

TATGTTAATT CATTACGTGC TAATGGCTCA TCATCTATGA TTAATGCTTT CATATTTATT 2760 

CCTCCTCTTG TCTTTCATAA GGAAGTACAC ACCAAAAAGT GGTACCGCTC GATGTCGATT 2820 

CAAATTGTAA TGCTGCGGAT TTTCCAAATA ATC CTTTTAG GCGTAAGTTT AAATTTTCTA 2880 

AAG CACTACC AGTTCCAGAC TCTGATTCTA CAGATGTnTC TCCCaACAAA TGCATTTTAT 294 0 

25 CTTTAGAAAT ACCCTGACCA TTATCTTGTA CAATAATACG TACATGTGTT GCAGTTTCTT 3000 

TAATCACTGA CACGTCAATA TCGTT 3025 
(2) INFORMATION FOR SEQ ID NO: 179: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 



35 



40 



45 
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-Jxl) -SEQUENCE^ DESCRIPTION:- SEQ-ID- NO: - 1-79 : - - - 

ACAGAATTTC ACAGCATTTT TAGATGAAAA AATAAGCCAG T CAT AG CGTT GATTTAACAA 6 0 

ATGAATATCA AAATTTAGTG GCTTTATATC AATAAAGGGT TTGTGAATAA TTGATACTAA 120 

ATCACTTTGC ATGTCATTTG TTTGTGTCAT AACTACAACT GGCTTCATAT TTAAACGTCA 18 0 

CTCCATTATT TAATGTTGTT CATTTAAGCG TTTTATAATT TCATAAGCAC CTTGCTCTTT 24 0 

TAATTTGTTA CTCACTGTTT TGCCTAACTC AACCGGATCT GTTCCGTTCA TTGTATATTC 300 

AAATCGTTCT TTACCATCTG GGGTCATAAT TAAACCTGTA AATTCGATTT CGTTTTGATC 360 

TGAGATTGTA GCATATCCTG CAATTGGCAC CTGACAACTA CCATCCATTT CTGCTAAAAA 42 0 

CGTTCGTTCA GCAGTCACAC ATTTTGCAAC CTCATCATTA TGTACTTTGC TTAATAATGT 480 
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TAACAATGTA 


TCTCTATCAA 


GATAAGATGT 


TnCAATATCA 


TCTGACCAGC 


CCATTCTTCT 


600 


TAAACCAGCT 


GCAGCTAAAA 


TAATCGCATC 


ATAATCTTCA 


GTTTGTAACT 


TTTCTAATCG 


660 


TGTATCTATA 


TTACCTCTAA 


TCCATTTAAT 


CTCTAAATTA 


GGATACTTAG 


ATAATATTTG 


720 


TGCACCACGA 


CGTAATGAAC 


TAGTACCAAT 


AATACTGCCT 


TCTGGCAATT 


GGGATAGTGG 


780 


TGTATGTGTT 


TTAGAAATAT 


ACGCATCAAA 


AGGTAATTCT 


CTATCAGGGA 


TACAACCTAA 


840 


TGTTAAACCT 


TCCGGAATTA 


CACTTGGTAC 


GTCTTTAAGC 


GAGTGTATTG 


CCATATCGAT 


900 


ATi-i-rn-rcA 


AAAAGTTCAT 


GTTGTATTTC 


TTTAACAAAT 


AAGCCTTTGC 


CTCCGACTTT 


960 


AGACAATTGT 


TTATCTACTA 


TACGATCGCC 


TTTCGTGaCA 


ATTTCTTTAA 


TTTCAATTTC 


1020 


TAGATTTGGC 


TCGACAGCTT 


TTAATTTATC 


AATAAATTGC 


TGGCTTTGTG 


TTAAAGCTAA 


1080 


TTTACyTCTT 


CTGGAGCCAA 


CGACTrATTT 


ACGCATGTTC 


AATTCCTCCT 


AGGAACGGAT 


1140 


TGCTCTAGAT 


TATTTTCTCA 


ATTCACAAAA 


TGTGTTGCAA 


AAAATAAATT 


AATCATATTT 


1200 


AAGCAAAATA 


AAATAATGTT 


ATAGTATATT 


AAATATCTTG 


AATTCAACCA 


TTTGTTGATT 


1260 


CTAAGTAAAA 


TATAACTTCC 


ATATAATACT 


GTAATAATTG 


AAGAGAGTAT 


TACCTTCGGG 


1320 


TCAATGAATA TACGTTCACC AACTGAAATT 


ACACCCCACT 


GTGTACCTAA 


AATAATACTA 


1380 


AATATGAGAA 


TTATCCACCC 


ACTTAACGTT 


GAGTAAAACA 


CAATTGATTC 


AAGTGTAGCA 


1440 


ACGCTACCAA 


TTCTAAAGTA 


TTTTTGATCA 


AAACGTTTTT 


CCTTCAAATT 


ACGGTATTGC 


1500 


ATGATATACA 


GTAATGCATT 


GACAAAAGCT 


AAGG CAAAG A 


AGACATAACT 


TAACACAGCT 


1560 


AGACCGATAT 


GGACTAACAG 


TAACTCGTCT 


ACAACAGCAA 


TTTTCTGAAC 


CTTATTAGTA 


1620 


TAATGTGTCG 


GTTGAAATGT 


ATTCATCCCT 


AAnAGTGTTA 


ACCCTATTAA 


ATTCCAAGGA 


1680 


AAAACACAG 












1689 


(2) Information for seq id no: iso: 









(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

nTGGnTGGCT TTTCCTATTG GACCAAATGG ACCnTTTACC TGGCCnTTCC CAGGACACCC 60 

CGCTTGTGCC CACATTCCAA TCGGAAAAGG TGTATGTGGT ACAGCCGTTT CAGAACGTCG 120 

TACACAAATT GTAGCTGATG TTCATCAATT CGAAGGACAT ATCGCTTGTG ATGCTAATAG 180 
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CGATGCCCCT ATAACGGATC GATTTGATGA CAATGACAAa GAaCATCTTG AaGCAATTGT 
TAAAATTATT GAAAaGCAAC TCGCATAAAA GGACATCAGC ATTTTCAATA AAGTGTTGAC 
AGTTAGCAGG AAAATGTTAC AATAATCTTT GTGTGAATTA ACGAAAGTAG CAGTTGTATA 
TTATTGAGCG CTATGTTGTT CCCAATGCGG ACGTGTCACG TAACTGTCGC T ATAAGG TG A 
AGACACATAA AACAATATAT CTTAGTAAGC ATGCAACACT CTTTTTTGTT TATTCATAAC 
AACAAAAAAG AATTAAAGGA GGAGTCTTAT TATGGCTCGA TTCAGAGGTT CAAACTGGAA 
AAAATCTCGT CGTTTAGGTA TCTCTTTAAG CGGTACTGGT AAAGAATTAG AAAAACGTCC 
TTACGCACCA GGACAACATG GTCCAAACCA ACGTAAAAAA TTATCAGAAT ATGGTTTACA 
ATTACGTGAA AAACAAAAAT TACGTTACTT ATATGGAATG ACTGAAAGAC AATTCCGTAA 
CACATTTGAC ATCGCTGGTA AAAAATTCGG TGTACACGGT GAAAACTTCA TGATCTTATT 
AGCAAGTCGT TTAGACGCTG TTGTTTATTC ATTAGGTTTA GCTCGTACTC GTCGTCAAGC 
ACGTCAATTA GTTAACCACG GTCATATCTT AGTAGATGGT AAACGTGTTG ATATTCCATC 
TTATTCTGTT AAACCTGGTC AAACAATTTC AGTTCGTGAA AAATCTCAAA AATTAAACAT 
CATCGTTGAA TCAGTTGAAA TCAACAATTT CGTACCTGAG TACTTAAACT TTGATGCTGA 
CAGCTTAACT GGTACTTTCG TACGTTTACC AGAACGTAGC GAATTACCTG CTGAAATTAA 
CGAACAATTA ATCCGTTGAG TACTACTCAA GATAATACGG TCAATACCAA CACCCACAAT 
TGTGGGTGT 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

~ (D)_ TOPOLOGY linear— ._ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
AAATCCCTTt GTtaAAgTsC AAAtTTTTCc AACrgCTTTA AtArGACCCA TATTACCtTC 
TTGGATTAAA tCmAGGaATG AcATACCACG ACCaCGTATC TTTTAGCAAT ACTTACAACT 
AAACGTAAGT TCGCTTCTGC AAGTCTTGAT TTTGCTACTT CATCACCTTG TTCAATACGT 
TTGGCTAATT CGATTTCTTC TTGTGCACTT AATAAGTTAA CACGCCCAAT TTCTTTAAGG 
TACATACGAA CTGGGTCATT TATTTTAACA CCTGGAGGGG CACTAAGATC ACTTGGATTC 
AGTTTCTCGT CAGTATCTGA ACTATCTTTT TCATTAACTA GTGAAATATC ATTATCATTT 



0786519A2J_> 



863 



EP0 786 519 A2 



w 



15 
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GCAATTTCTT CATGACTTAA ATGACCCTCT TTTTTACCTT TTTCAATTAA TTGCTTCTTA 4 BO 

ACATCTTCTA ATGTTAATGT CGGATCAATT U Tri Xyl TI T T TAATTTTAAC TGTGTTATCA 54 0 

GACATGAAAC GGCCTCCCGA TTTTAAATAT GAACATTCGA AATTTATTCA ATATTGCTAT 600 

TTTAAACGAA ATTCTTAATT AATTCCATCC ATATTTTnAA TTTTATTTTA CAAATTGGGA 660 

ACTAAATCCC CAATATTTAT TTTTCAATAG TGGTGGTT 698 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

ACTTGATGAT GTATACAATG TATTTCAAGA ATATTATCAA AAAACATCTA ACATTAAGTT 60 

TTGTAGAATT CACAATTCTA G CT ATTATCA CTTCTCAAAA TAAAAACATC GTTCTTCTTA 120 

25 AAGATTTAAT TGAAACAATC CACCATAAAT ACCCTCAAAC TGTTAGAGCT CTCAATAATT 180 

TAAAAAAGCA AGG CT AT CT A ATAAAAGAAC GCTCAACTGA AGATGAAAGA AAAATTTTAA 24 0 

TTCATATGGA TGACGCGCAG CAAGACCATG CTGAACAATT ATTAGCTCAA GTGAATCAAT 300 

30 TATTAGCAGA TAAAGATCAT TTACATCTTG TTTTTGAATA ATATCTCTAT TACGCAAGTG 3 60 

TGCTGTATTC TAAAGTGCAC TTGTGTTTTC TATTTTTTAA TAAAACCTCA GCACATAATG 420 

AACAACTTTC TATTTTCTAT ATCACTTAAA ACCATTTCCG AAATTAAACC TCAGCACATT 4 80 

CAAAGCCCCA CTTTATTCTT AAAAATATTT TTTAACTCAT ATGTATTAAA CCGCTTTCAT 54 0 

TATAAAAAAT ATCTCTATAT TtTATCTGtT Tt TATTAAT C GAAATAGCGT GATTTTGCGG 600 

TTTTAAGCCT TTTACTTCCT GAATAAATCT TTCAGCAAAA TATTTATTTT ATAAGTTGTA 660 

AAACTTACCT TTAAATTTAA TTATAAATAT AGATTTTAGT ATTGCAATAC ATAATTCGTT 720 

ATATTATGAT GACTTTACAA ATACATACAG GGGGTATTAA TkTGAAAAAG AAAAACATtT 780 

ATTCAATTCG TAAACTAGGT GTAGGTATtG CATCTGTAAC TTTAGGTACA TTACTTATAT 84 0 

CTGGTGGCGT AACACCTGCT GCAAAtgctG CG CAACACG A TGAAGCTCAA CAAAATGCTT 900 

TTTATCAAGT CTTAAATATG CCTAACTTAA ATGCTGATCA ACGCAATGGT TTTATCCAAA 960 

GCCTTAAAGA TGATCCAAGC CAAAGTGCTA ACGTTTTAGG TGAAGCTCAA AAACTTAATG 1020 

ACTCTCAAGC TCCAAAAGCT GATGCGCAAC AAAATAACTT CAACAAAGAT CAACAAAGCG 1080 
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AAAGTCTTAA AGACGACCCA AGCCAAAGCA CTAACGTTTT AGGTGAAGCT AAAAAATTAA 
ACGAATCTCA AGCACCGAAA GCTGATAACA ATTTCAACAA AGAACAACAA AATGCTTTCT 
ATGAAATCTT GAATATGCCT AACTTAAACG AAGAACAACG CAATGGTTTC ATCCAAAGCT 
TAAAAGATGA CCCAAGCCAA AGTGCTAACC TATTGTCAGA AGCTAAAAAG TTAAATGAAT 
CTCAAGCACC GAAAGCGGAT AACAAATTCA ACAAAGAACA ACAAAATGCT TTCTATGAAA 
TCTTACATTT ACCTAACTTA AACGAAGAAC AACGCAATGG TTTCATCCAA AGCCTAAAAG 
ATGACCCAAG CCAAAGCGCT AACCTTTTAG CAGAAGCTAA AAAGCTAAAT GATGCTCAAG 
CACCAAAAGC TGACAACAAA TTCAACAAAG AACAACAAAA TGCTTTCTAT GAAATTTTAC 
ATTTACCTAA CTTAACTGAA GAAGAACGTA ACGGCTTCAT CCAAAGCCTT AAAGACGATC 
CTTCAGTGAG CAAAGAAATT TTAGCAGAAG CTAAAAAGCT AAACGATGCT CAAGCACCAA 



1200 
1260 
1320 
1380 
1440 
1500 
1560 



?5 "* * ^w^wwio /wiuuiwwui ill LlAi UAAATTTTAC 1620 

1680 
1740 

2o AAGAGGAAGA CAATAACAAG CCTGGCAAAG AAGACAATAA CAAGCCTGGC AAAGAAGACA 1800 



ACAACAAGCC TGGTAAAGAA GACAACAACA AGCCTGGTAA AGAAGACAAC AACAAGCCTG 
GCAAAGAAGA CGGCAACAAG CCTGGTAAAG AAGACAACAA AAAACCTGGT AAAGAAGATG 
25 GCAACAAGCC TGGTAAAGAA GACAACAAAA AACCTGGTAA AGAAGACGGC AACAAGCCTG 

GCAAAGAAGA TGGCAACAAA CCTGGTAAAG AAGATGGTAA CGGAGTACAT GTCGTTAAAC 
CTGGTGATAC AGTAAATGAC ATTGCAAAAG CAAACGGCAC TACTGCTGAC AAAATTGCTG 
30 CAGATAACAA ATT AG CTG AT AAAAACATGA TCAAA CCTGG TCAAGAACTT GTTGTTGATA 

AGAAGCAACC AGCAAACCAT GCAGATGCTA ACAAAGCTCA AG CATT AC CA GAAACTGGTG 
AAGAAAATCC ATTCATCGGT ACAACTGTAT TTGGTGGATT ATCATTAGCC TTAGGTGCAG 
CGTTATTAGC TGGACGTCGT CGCGAACTAT AAAAACAAAC AATACACAAC GATAGATATC 
ATTTTATCO^ ^ACCAATTTT AA CTTATAT A _CGTTC A TT AA_ CACATTCTTA. TTTG AAATGA - 
TAAGAATCAT CTAAATGCAC GAGCAACATC TTTTGTTGCT CAGTGCATTT TTTATTTTAC 
TTACTTTTCT AAACAACTTC TGAAACGCCT CAACACTTTC TACTCTGATT ACATATATGA 
CATTTTTAGG CATTAAAAAA TCGAACTAGA CAAGATGCTC ATTG CATTTC GTACTAGTTC 2580 
GATTCATGAA TAATTAGATT TAAAATGTCA TTTGAATCCA AGTGACAACA TTATTTATAT 2640 
TTAGAATATT AACGTTAGTA TAAACGTCCA AACACAAATA AAAGCAACAA ATATAATACT 2700 
GTATTTTAAC GTCATTTTTA ATAATGCAGA TTCTTCACCA ACTTTTTTAA CAG CTGCAGT 2760 
^ CGCAATGG CA ATTGATTGTG GTGAAATAAG TTTCGCTGCT ACACCACCTG CAGTGTTAGC 2 82 0 

TGCCACAAGT AATGAACCGC TTGTTGAAAT TTGTTGTGCC ACTGTCG CTT GAATAGGTGC 2880 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
24 00 
2460 
2520 
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TGGAGAGAAT AATGGGAAAA TTGCTCCCGC TTTAGCAATA CCTTGTCCAA TTGCTACAGT 3000 

CAAACCACCG TATGTCATAA CTTTAGCAAT AGCTAGGATA GCTGAAATTG TAAGGATCGG 3060 

TAACCATAAT TCTTTAATTG CTTCGACCAA TAAAGCACCT GCACTTTTCC ATTTTAACTT 3120 

CGTAATTAAA ATTGTAATAA TTACTGTTAA TAAAATCGCT GTCCCAGTTG CACCAATTAA 3180 

ATCGAGACGC AACGCAATTC CTTTAGGCGA TAAATCACTC ACAGTATTTG GAATTGGCAA 324 0 

TTTTATTACT AAACTTTCAA GTGCACCTCC AGGTTGGAAT AATTTTTTGA AGAATGGTGC 33 00 

ACTCCATACT AATACAAAGG CAGTTAAAAT TACGAACGGA CTCCAAGCAA AGACAATTTC 3360 

TTTAGGCGTT CGTTTTTGAA TTTTATGTTC AGACGCTTCC AATCTGAAAA TGTTTTTCGG 3420 

TTTAAATTTA CGACAAACAA ATGCTAACAC CACCATTGTT GCTAGTGATG GAATAATGTC 34 80 

TGCTAGTTCT GGACCATGGA ATATTGTTAA TAATAATTGT AATCCAGTAT ATGTACCACT 3 54 0 

20 CACTGTTAAA ATGACAGGTA AAATTTCTTT AATACCTTTC ATACCATCTA CAATGAATAC 3 600 

TAAAACAAAT GGAATAATAA AGTTTAAAAT TGGAAGTGTT AATGCTGAGT ATCTCGCAAC 3 660 

ATCTAATGTT GTAACGC CTC CACTTAAGTT AAACGTATCA ATAATACTAA CTGGTAAACC 3 72 0 

AATTGCACCA AAGGCACCCG CCGCACCATT AGCAATTAAA CAT AA CAT CG CTG CTTTTAA 3 780 

TGGTTCAAAT CCAAGTTGAA TTAATAATAC TGCACAAATC GCAATTGGCA CACCAAATCC 3840 

TGCTGCACCT TCTAAAAATG CGTTGAAACA AAATCCAATT AATAATAGTT GGATTCTTTG 3 900 

GTCCACTGAA AT ACTTG CAA TACTATCTTG AATAATAGAA AATTGTCCTG TTTTAATAGA 3 960 

AACTTTATAT AACCAAACTG CCATTAAAAC GATATATCCT ATTGGGAAAA TACCGGCAAC 4020 

AACGCCTTCT GTAATCGCAC CTGCTGATAC ACGCGCTGGT AATTCAAATA CAAATAAAGC 4080 

CACAATCAAT GTAACAACCA AAGTTGT CAA TGCTG CATAA ATGCCTTTCA TTTTAAAAAC 414 0 

GGTT&AGCAT AATAAAAATA AAATAATAGG TACTGCTGCA ACTAAGGCTG ATAATCCGAC 4200 

40 ATTATCGAAT GGATTTACAG TAAGTAGTGT CATAATGACT CCCTCTCTTT ATATAAAATA 42 60 

TTT AT CATTC TGATTAATCT ACAACCTATT TCAACTTATA TTTTGCGATG ATCACATATT 4320 

TAAAATGTAA CACTCCTATA TGTGACAGGC AATCGAATTT TTACAAAAAG TTCACAAAAT 43 80 

ATACACAATA TTTAACTATA ATAmATAATA TATCaTntTA ATTATAAATA CTAGATATTA 444 0 

TTTATAATAA TCTCAGGAAT TCGCTTCAAA ACTGCATCAT GAGAGTTTAT ATTTTTATTG 4 500 

AGAATCTCTC ATTTTATGAA TTGTAGGAAG TAAACAAAAT ATGACAAGCG TCAAACCAAT 4 5 60 

GATAATGATA AATATCATAT TAAACCATAG TAAATTGAAT TGATGATGGT GTTGTATTTG 4 620 

CCAAATTTCT AATACTGTGA AGATAGACAT ATAGCTCATA ATCTCTAAAT TTAACGTACT 4680 
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AAATCGTTCA TAGTATCTAC CTGCAATGAA AAATATAAGC CAAATCACTA TAAATGCGCT " 48 00 
ATTAATCAAA AGCAGCACCC ATTTATCAGC AAAATTATCA GCATCCCCTG CTAAATTATA 4 860 

ATGAATAGGC ACTTTGGTTG GTAATTTTGG ATAGGTCACT ACTGTATAGC ACATCATAGC 4 920 

TAAGTAAATA AGTAGACTTA ATATTGTAAA AGACCTGATT TTAGACATTC TATCGCCTcT 4 980 

TcTTTACATT TTATGTATAA CACTCTGCCT ATTTTACCTT TTAATaCATT ACCCCAAcGA 50 40 
TtAAaCAATA t GTAaTGATA CTATAATTGC GTCAGGAGTA TCCGCTTGTT AAATGTGCAT 
AG CTT AT ATT TAGCTGTTTA ACATGCCACA TAATGATTCG AATTATT 
(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



5100 
5147 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CACTTACTTC CACCATTATC ATAACTTTAA AATGGATATA nTTCATCAAA CATTATCTAA 60 

AGGOGTCGCA CCTACACCAA CACCATCCAA CAATTAACTT ACAACTCTGC GATTACTTCT 120 

TCAGCAGCAA CTTTCACnTG CGTAATACAA TCAGGTAGTC CAACCGCTTC AAAAGATGCA 180 

CCAGTTACTC TAAGTCGTGG ATATGTTTGT TTAATATGTG CTTGAATCTG TCTAATTTGT 24 0 

TGAATATGAC CGACATGGTA CTGTGGCATA CTTTTCGGCA AACGATTGAC AATTGTAAAT 300 

35 TCAGGATCAC CTTTAAATGT CAT CATTTG A CTTAAATCTC TACGTACAAT CGATACTAAT 360 

TCATTATCTG TATGATCATC AACCACAGTA TCACCTGGTT TACCTACATA CGCACGAATC 420 

AAAACCTTAC CTTCCGGTGT AGTAAATGGC CATTTTTTCG ATGTCCAAGT ACATGCGGTA 4 80 

ATGTCTGTAT CACTCGTTCT CGCAATTACG AAGCCAGTAC CATCATGGGT ATTTTCAATG 540 

TCTTTTTCAT CAAATGCCAA TACAACAGTT GCAACAGTCG TACTATCCAT CGTTTTAAAG 600 

TAATCAAATG CTGGATCTTG TCCGAACCAA TTTAAAAACA CTTGATGTGG TGTCGTTACT 660 

AATACGCCAT CATACACTTC TTCTAGTTGA TCATTGTAAA CAATTTTATA TTGTTTTTGA 720 

GATGTAATTA TATCATCCAC TGACGTATTG TAG CGTATTG TCACACCTTT ATTTTTAACA 780 

50 TCTTGTTCTA ATGCTTCAAT AAATGAGCTT AAACCATGCT TAAATTGTTT GAATTGTCCT 840 

TTCGGTG CGC CAGGATATAA TTGTCTTTGT TTCAGACGCT TATTTTTCTC ATCCTTCATA 900 

CCTTTTATCA GACTTCCGAA TGCCTCTTCT TTTTCTTTAA AATTAGGAAA CGTACTCATC 960 
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TCAAGTACCT CATTACCTAA TCTTGCTCTG AAAAATGCAC CAACAGAAAT GTCACCATCC 
TGCATTTGAG TAGGTTTTTT TAATAAATCA AACCCTGCTC TTAATTTACC AAGTGGCGAT 
ATTAATTTTG TAGTAACAAA TGGTTTAATA TCTGTTGGAA T AC C CAT AAT TGAACCACCT 
GGAATCGGAT ATAATTTATT TTTCGCAAAA ATATATGATT GTCCAGTCGT ATTTGTAACA 
ATATCTTGTT CTAATCCAAT ATCTTTCGCT AATTCTGTCA TAATCGTTTT TC 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1080 
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1260 
1312 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
TTTTACAATA AAAATATGAT ATACTACTTG TCGTATATAA GGAACGGAGG ACAATTTATG 
CATACATTTT TAATCGTATT ATT AAT CATT GATTGTATTG CATTAATAAC TGTTGTACTA 
CTC CAAGAAG GTAAAAGCAG TGGACTTTCA GGTGCCATCA GTGGTGGTGC TGAGCAGTTA 
TTCGGTAAAC AAAAACAACG TGGCGTCGAT TTATTCTTAA ATAGATTAAC AATTATTTTA 
TCAATATTAT TTTTTGTACT TATGATTTGC ATAAGTTATC TTGGTATGTA AGGTCCGGCG 
ATGTAAATGT CGGGCTTTTT TATTTATAAT TAAGAATGTA ATAGTTTAAC AATAAGCTAT 
GTAAAATATA TAGCCTAGTT AAGTATGCAA AGGGAGCGTT AGATTTATGC AGATAAAATT 
ACCAAAACCT TTCTTTTTTG AGGAAGGTAA ACGTGCCGTG TTATTACTAC ATGGTTTTAC 
AGGCAATTCG TCTGATGTTC GTCAATTAGG TCGATTTTTA CAAAAGAAAG GTTATACATC 
AT ATDCAC CG CAATATGAAG GCCACGCGGC ACCACCAGAT GAAATACTGA AATCTAGTCC 
TTTCGTTTGG TTTAAAGATG CGTTAGATGG TTATGATTAT CTTGTTGAAC AAGGTTATGA 
TGAAATTGTT GTTGCTGGTC TATCATTAGG TGGGGATTTT GCTTTAAAAT TAAGCTTAAA 
TAGAGATGTA AAGGGTATTG TAACGATGTG TGCTCCTATG GGTGGCAAAA CTGAAGGTGC 
CATTTATGAA GGCTTTTTAG AATATGCACG CAATTTTAAA AAGTATGAAG GTAAAGATCA 
AGAGACTATT GATAATGAAA TGGATCATTT TAAACCAACT GAAACTTTAA AAGAACTAAG 
TGAAGCATTA GATACGATTA AAGAGCAAGT TGATGAAGTG TTGGATCCTA TTTTAGTGAT 
TCAAGCAGAA AACGACAATA TGATTGATCC ACAATCCGCA AATTATATAT ATGAC CATGT 
AGATTCTGAT GACAAAAATA TCAAGTGGTA CAGTGAATCT GGACATGTTA TTACGATTGA 



60 
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180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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VjALjAx 1 1 i AA 


CATTAGAAAG 


GAGGGGCATA 


ATGAATTTAA 


AGCAATCTAT 


1200 






A 'I'M' & R TP 7\ TV /-» 

Al lAAlLAAu 


CTGAATATGA 


ACCTATGTCA 


GTGTCAGATT 


TTCAAGATGC 


1260 


5 




a pp ja ptp ppp 


ACTCGTTTAG 


AGATTTAATT 


AAGGTGCTTG 


TGGAGTTAGA 


1320 




APAATPAfmA 


TT R & TPP A AT 
X 1 An X LbAAL 


P^ TV TV IV TV TV TV /"» 

GTACAAAAAC 


AGACAGATAC 


^> ft H K A Ik ft k ^^^^ 

CAAAAAAAGC 


ATAGTTATAG 


1380 


10 


Ann Tr a a a 


A A ATTP&T& TV 
AAA1 IvjAlAA 


7V TV TV TV /"^/^^T W|» 

aaGGAACGTT 


iv tv^tns^n iv iv * m 

AAGTCAAAAT 


tv iv iv iv iv ft t 

AAAAAAGGCT 


TTGCATTCTT 


1440 


AAfiAfTTflA A 
nnun^* V« X vvnn 


V»A 1 vj/\o<jA 1 A 


TPP TV TV TV «T» TV 

I GGAAGATAT 


ATTTATTCCC 


TV /"Vi - • TV TV TV ft 

CCGACGAAAA 


TTAATCGTGC 


1500 




v.1 1 Vrj/\ X Vj\jn 


/SfiTR otptt n 
v>A 1 At- i\j I 1 A 


1 1 " 1 '/"»*T^ TV /""^ 7V TV TV 

in gtagaaat 


ft tv iv rr*/ - ^ fv 

CCATCAATCA 


ft ft ft rv^y"^ iv m 

AAAGGTGAAC 


ATAAAGGTAA 


1560 


IS 


AATfYlA nfi/V« 
nHl ^vjrAMurv7Vj 


GAAGTTAAGT 


CGATTGAGAA 


G CATTCTGT A 


ACTCAAGTTG 


TTGGTACGTA 


1620 




X AVVj x Vjnnuv. X 


AGACATTTTG 


GCTTTGTTAT 


TCCGGATGAT 


Jh 4« n n ■■■■■■ m 

AAACGTATTA 


TGCAAGATAT 


1680 




X X 1V_A1 


*. JA 7APPTPTA TV TV 

AAA(j\jfLAAA 


GTTTAGGCGC 


AGTCGATGGT 


CATAAGGTAC 


TTGTACAAAT 


1740 


20 


TT*A ptta apt tat 
1 rVw X AAvj 1AX 


PPTP TV T>/~i/*~«n^< 1 ' 


CAGATAATCC 


AGAAGGACAT 


ATTTCTGCTA 


TTTTAGGACA 


1800 




*T* A A A A A TP A <-p 
X AAAAA X VjA X 


CCTGGCGTAG 


ATATTTTATC 


TATTATCTAT 


CAACATGGCA 


TAGAAATTGA 


1860 


2S 


Al X 1 V_ 1 Vj A X 


f* TV TV /■*T|V^ > r > *»»TV /""» 

gaagtgttac 


AAGAAGCTGA 


AGCAGTACCT 


GATCATATTG 


AAAATACTGA 


1920 


Ml X AAALtXjL. 


CX»TCATGATT 


TACGTGATGA 


ATTGACAATC 


ACAATTGATG 


GTGCTGATGC 


1980 




*pa A A/^ A n*ra 
XMAAVjAC HA 


P Ti TP TA P^ /""» TV TV 

viA 1GAL.GCAA 


TTAGTGTTAA 


AAAGTTAGCG 


AACGGTAATA 


CGCAATTAAC 


2040" 


30 


^Pl^^^^ A 7A ft 
IVjlAnurlAl 1 


PTP TA TP T f+ TA 

GCTGATGTCA 


GCTATTATGT 


AACAGAAGGT 


TCTGCATTGG 


ATAAAGAGGC 


2100 






PP A V"^ TV TV / '»fftf» 

VjCGACAAGTG 


TATATCTTGT 


TGACCGTGTA 


ATTCCAATGA 


TTCCACATCG 


2160 




ATTAAGTAAT 


GGTATTTGTT 


CATTGAATCC 


TAATGTTGAT 


CGTTTAACTC 


TAAGCTGTCG 


2220 ; 


35 


CATGGAAATC 


GATGCTAGTG 


GTCGCGTTGT 


TAAACATGAA 


ATTTTTGATA 


GTGTTATACA 


2280 




TTCTGATTAT 


CGAATGACGT 


ATGATGCGGT 


AAATCAGATT 


ATTACTGAAA 


AGGATCCTAA 


2340 




CATTCGCGAA CAATATAATG AAATTACGCC TATGCTAGAT TTAGCACAAG ATTTATCTAA 


2400 


40 


TCGTTTGATT 


CAAATGAGAA 


AACGACGTGG 


TGAAATCGAT 


TTTGATATTA 


GTGAAGCAAA 


2460 




AGTATTAGTT 


AACGAAGACG 


GTATACCAAC 


AGATGTTCAA 


TTAAGACAAC 


GTGGCGAGGG 


2520 


45 


TGAACGTCTA 


ATTGAATCAT 


TTATGTTAAT 


TGCAAATGAA 


ACAGTTGCTG 


AACATTTTAG 


2580 


TAAGTTAGAT 


GTACCTTTTA 


TTTACCGAGT 


GCATGAGCAA 


CCTAAATCAG 


ATCGCTTAAG 


2640 




ACAATTCTTT 


GATTTTATTA 


CAAACTTTGG 


CATCATGATT 


AAGGGTACTG 


GCGAAGATAT 


2700 


SO 


TCATCCAACA 


ACACTTCAAA 


AGGTTCAAGA 


AGAAGTAGAA 


GGTCGACCTG 


AACAAATGGT 


2760 




CATTTCAACA 


ATGATGTTGC 


GTTCAATGCA 


ACAAGCGCAT 


TATGATGATG 


TGAACTTGGG 


2820 




ACATTTTGGC 


TTATCAGCTG 


AATATTATAC 


GCATTTTACA 


TCACCAATTA 


GACGTTATCC 


2880 
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A G A AGTG AAH 


V_Vj X 1 V7V7VJ/V\VJ 


APAAATTGPP 


TPAP.TTAGCT 
x uxivj x x i*ivj tn» x 


GAACATACTT 


CTAAACGTGA 


3000 




a pg tpgtgpt 


ATTG A GG PAG 


A A pgtg a t a p 


TPATGaATTG 


AAAAAAGCAG 


AATATATGAT 


3060 




TP A ar ATATT 


ggtgatg a at 

V-FVJ X VJi"V X UAA X 


TTG A AGGTAT 


TGTPAGCTCA 


GTAGCTAACT 


TCGGTATGTT 


3120 




rATTf; A ATTG 
\^r\ X iunnl IVj 


PPAAATAPGA 


TAGAAGGTAT 


PrGTTCATATT 


GCGAATATGA 


CTGATGATTA 


3180 




i lALLo 1111 


pa afiAfirriTP 

unnVJAUUVJ X V_ 


AAATGGPATT 


AATTPrGTGAG 


CGTCAAGCTA 


AAGTATTTAG 


3240 


10 


a a r v*mn r m.'h. c 

J\r\ 1 1 v7w luAL 


APAGTTAAGG 


TTAAAGTGAP. 

X 1 /WW X VJn- 


GPATGTTGAT 

njv^-^^ x w x x \jn 


GTAGATGAAC 


GATTAATTGA 


3300 




1111 V — t\J-\J-\. X 1 


<^TannT a tgp 


PTTTAPPGAA 


AAA TG ATCG A 


TCACAGCGCC 


CAGCGCGAGG 


3360 


15 


Xiviununnl 1 


PAUGPPA AA A 


PGPGTGGT AA 


ATPATTAGAT 

r\ X \— X X ^\\Jr \ X 


AAATCAAAAT 


CTGATGATAA 


3420 


nnn r T , {~ , rz w v a a g 

ovjvj- 1 1 nAu 


aaaaaannTa 


AGP A A PGTAA 


AGGTAAAAAP 


CAACGTAATA 


ATGATAAATC 


3480 




Auulivil nVjr 1 


A 2&.PPAT A AGP 


P A TTTT ATA A 
V— r\ X x X ininn 


AGATAAAAGT 


GTGAAAAAGA 


AAGCACGTCG 


3540 


20 


Taafaaaaaa 


TaaPP APP A A 


TP APPTG AGT 


ATGAATGGPT 

/A X VJ/Vl X VJVJv. X 


AAGAAGAAAT 


CACCAGGTAC 


3600 




ill IAvjwVjvjAA 


Art 1 V_VJ 1 AAwVJ 


PA AGAP ATP. A 


TT ATA ATATT 


GAAGATAPGA 


TTGAAGCGGG 


3660 




AH I ib 1A1 Hj 


r» a a anr* a p a n 

l w AAw<-TV»A\»-ftV_' 


A A ATA A A ATP 


AATTPGPPGA 

i^V^X X V^VJV— V-.VJUTV 


GGTAGTGCTA 


APPTTAAAGA 


3720 


25 


TAGTTATVj L.vj 


iwAAvj l 1 AAAtt. 


a PPPTP A A AT 


GT ATTTG A AT 
VJ X n. XXX VJ/\n X 


AATATGCATA 


TAGCACCATA 


3780 




/■V" a jv /** 7v a nnr* 
\_wAAwi\Aw\jrO 


AAlCb 111 1A 


ATPaPP ATPP 


TPTTPGTTPT 

X V— X X VwVJ X X \— X 


PGAJVAATTAT 


TATTGCACAA 


3840 




r/VTPS A AH/"* 

vjV-Qj 1 uAAA 1 v. 


Al 1AAA1 IwVJ 


GTGATPAA AP 

w 1 V_fA 1 VvXVV.V-. 


APGTGAGATT 


GGTTATTPGA 

VJVJ X X X X >— -VJ«k 


TTGTG PPGTT 


3900 


30 


AnAVjL 1 1 lrVl 


TTT a Rnr , aT^ 

1 lvjAAV_rV-A 1 


GAPATTGTA A 


AGTATTAPTT 

n\J X i~V X X i~LV_ X X 


GGTGT t G CAC 


GAGGTAAGAA 


3960 




AAA ATaTPlT 


aaa rrTra an 


PTTTP a a an A 


AAAAGPAGTP 


AAAPGAGATG 


TTGCGCGCGA 


4020 


35 


TaT^aa appp 


v_-wl 1/\1 1/V-V-J 


PP A TTT A GTT 


GPTTAATPGG 


G CTATATTTG 


ATATAGTTAT 


4080 




A lu 1 1111 


gta a aTTara 


A A APT ATP- AT 
MnnVJ X 1 Von X 


TTGTTTGATT 

X X w XXX X X 


TATTATTTCG 

XI X X mm, — > A N— i 


GGGACGTTCA 


4140 




TGGA2PTPG ap 


Art/^TinTPPPP 


PGAGPTPATT 


AAGPGTGTCG 


GAGGGTTGTC 


TTCGTCATCA 


4200 




apa ap An 


TTraTa a^p a a 


PTGGPAAATP 


AAACAATAAT 


TTCGCAGTAG 


CTG C CTAATC 


4260 




GCACTCTGCA 


TCGCCTAACA 


GCATTTCCTA 


TGTGCTGTTA 


ACGCGATTCA 


ACCTTAATAG 


4320 




raTaT^rraa 


A p a PTPPPGT 


TTG A AGTPTG 
1 X wnnu X v— x vj 


TTTAGAAGAA 


ACTTAATCAA 

X x^b^— — v n— rtn 


ACTAGCATCA 


4380 


45 


iT-rl 1\jVj1 1 Vj 1 


1 1 A 1 W\v- 111 


TPATGATGPG 
1 V—A 1 vxM. 1 VjA-vj 


AAAPPTATPG 


ATAAAPTACA 


CACGTAGAAA 


4440 




GATGTGTATC 


AGGACCTTTG 


GACGCGGGTT 


CAAATCCCGC 


CGTCTCCATA 


TTTGTAGCCT 
TGCATTGCAG 


4500 
4560 


50 


ACAGCCTTTG 


TGGTTGTGGG 


CTTTTTTATT 


TTGTGTTTTT 


CAGGGGATAA 


AATTTGTTGT 


GAGTATTGAT 


ATAGCAGTGT 


TTGTATAGGT 


GTTTATTTGA 


TGGAGGAAAG 


4620 




AGTAATAAGT 


GATTATGAAT 


TAGTTTTTGA 


GATATAAGGG 


GACAGTGATG 


TGTGTCAAAT 


4680 
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TTATACGCAA AAAATTCTCC ATGTTATATA TGTCAATATA AAAATGTGAA TCGTCTACAC 4 800 

TTAATTGGAT AAATGGCTAC TGAAAAAGAA CTTTTCATTT TTGTTACGTC ACTAAGTGGG 4 860 

TGTAGTTATA AAGAGATGAG CCGAGTTTTG ATATTTTCAT TAGAATCAAT ATGCCTATTA 4 920 

ACACAATCAG CAATAGTTGA CGAGACGGAA ATAAAAGAAG TCGTAGTTAA GAAATGCATT 4 980 

TCACAACATA CCATTGTAGC CATTTTTATT GTTTTGGATG ATAAACTCTT TTTGGAATTT 5040 

TTAGTTTTTA TAATTTGCAA CTACACTACT T CTTTT ACT A ATATTAATGT CTAAGTAATC 5100 

GATAAAAAAT TTTCCATTGA ATAAATGAGA AGTTAAAAAC TTTACTTAAC CTTTCycATT 5160 

GCATTTTCCT ATTCACGATT TTAAGAACCC AACATACTAC AAACGAATTT TAAAAGGCGA 5220 

GAGTAAAGCT TACTTGTTTA TTATACATAT TTAAAATCCA AGAGTCAGAA CAGACTACTC 5280 

CTCTTTATAA CTATAAAAAA TAGCTATGAA AAAATCTATC GTCATAGATT CCTTCATAGC 534 0 

20 TAATCTTAGT ATGTTTATTT TTATTTTAGG ATGCTATTTA TCAACTCAAC ATATAACTCA 5400 

CTATTTTTAT AACCTTCTAA TATATCATTA ACTTGTCTAA TAGGTATTTC TGGTACTTCT 5460 

CTAATGTTTT CCAATTTTGT TTTAAATTGT TTTTTTGTTA TTTGCTCTTT ATTTGTAGCC 5520 

AATTGGAACA AGTAAGAATC TAGCATATTA ATTTCTTTAT ATGAATACAT ATATCTTAAT 5580 

AACACTAAAT CTCTAGTTTT TAAGTTAGGC GCTAGTTCTT CTTGTAATTG TTCTATTGAT 5640 * 

TGTyTCATTA ATAACAATCT CATTTCTAAT TCTTCATTAT TCATTTTATC ACACTCTT t T 5700 

TATATTAATG CTTGACCAAC TTGGGAAACC CAAAACCCTA TGCTTCTTGC AGTAGAATCT 5760 

TTAATACCAG TTCCCATCAA TGCTTGTGAA ACTTGACCTT GTACATTTCC CCATGTAGCC 582 0 

35 TCTTCTTGTT TTAATGCATT ATTCAATGCG GGATTTACAA ATTTATCCCA TCTTTTTTTT 5880 

ATGATTTTCC GGCACGGGGA CTGATTTCTT TAACACCATT AAACACAGAT TTTTTATTTT 5940 

TAATPATAGC TTTATAGTAT CATGTTGGCT AAGCTATAAA TAAGTCAGTT TCTCTAAAAA 6000 

TTAAATAACT GAATGTAAGA CAATCAACAA wCCAAATTTA TACTTCATCT AAACCACTGT 6060 

GGTCGTCATC TTTTTGCTTT TCTTTTTCTT TCTCTCGTTC TTGTTCTTTT TTGTACTCTT 6120 

CTTCAAATTC TTTTTCTTTC TTTTCTACTT CTTCTCT 6157 

(2) INFORMATION FOR SEQ ID NO: 185: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CATTTGTTAT TCTGAGTAGC CAATTTGGCA AAGATGAACA AACGTCTGAA CAAACGTATC 60 

AAGTTGCAGT CGCATTAGAG TT AATT CAT A TGGCAACACT TGTTCATGAT GACGTTATTG 120 

ATAAAAGCGA CAAGCGTCGA GGCAAGTTAA CCATATCAAA GAAATGGGAT CAGACAACTG 180 

CTATTTTAAC TGGGAATTTT TTATTGGCAT TAGGACTTGA ACACTTAATG GCCGTTAAAG 24 0 

ATAATCGTGT ACATCAATTG ATATCTGAAT CTATCGTTGA TGTTTGTAGA GGGG AACTTT 3 00 

TCCAATTTCA AG AC CAATTT AACAGTCAAC AGACAATTAT TAATTATTTA CGACGTATCA 3 60 

ATCGCAAAAC AGCACTGTTA ATTCAAATAT CAACTGAAGT TGGTGCAATT ACTTCTCAAT 4 20 

CTGATAAAGA GACTGTACGA AAATTGAAAA TGATTGGTCA TTATATAGGT ATGAGCTTCC 4 80 

AAATCATTGA TGATGTATTA GACTTCACAA GTACCGAAAA GAAATTAGGT AAGCCGGTCG 540 

GAAGTGATTT GCTTAATGGT CATATTACGT TACCGATtTT ATTAGAAATG CGTAAAAATC 600 

20 CAGACTTCAA ATTGAAAATC GAACAGTTAC GT CGTGAT AG TGAACGCAAA GAATTTGAAG 66 0 

AATGTATCCA AATCATTAGA AAATCTGACA GCATCGATGA GGCTAAGGCA GTAAGTTCGA 720 

AGTATTTAAG TAAAGCyTTG AATTTGATTT CyGaGTTACC aGATGGACaT CCGaGAtCAC 7 80 

TACyTTTAAG TTTGACGAAA AAAATGGGTT CAAnAAACAC GTAGTATTTA TGnAAAAGTA 84 0 

TTGAAAGCGC TTTACCAACC TGTTAATATA TAATAGTAAT ATAC 8 8.4 
(2) INFORMATION FOR SEQ ID NO: 186: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

40 AATTTCATCT GCTCGTGCAA AATCTTTGTT TTTCCTTGCT TCATTACGCT CTTCGATTAA 60 

TTTTTCAACA TCTTCATCCA ATAATTCATC TGCATTTTTA GATTTTAACG GTACACCTAA 120 

AACATCGCTG AAAATTTGAT AAACTGCTTT AAATTTATCA ATTACTTCTG TTGATGTTGT 180 

GTTCTCTAGT ACATATTTAT TCGCAAGTkT TGCTAAATCA TACCAAGCTG TAATTGCATT 240 

AG CTGT ATT A AAATCATCAT TCATAACTGT TTCAAAACGA TTTAAAATCG CATCAATTTG 300 

AT CAAT AT AT GTCTGTTGAT TTTCAATATT AGTAGCAATT TGTGCGCGCT CTTCAATTAA 360 

TTGATAACTA TTGCGAATAC GCTCTAGTcC aCTACGTGCT GATTCTACCA ATTCTAGATT 420 

ATAGTTAATT GGGCTTCTAT AATGTACGCT AATCATAAAG AATCTTAGTA CATCTGGATC 4 80 
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ATTATCAATA 


. TTAATGAAAC 


CATTATGCAT 


CCAATAATTA 


GCAAATGGCG 


CATGATTATG 


600 




TGCTTCTGAT 


TGTGCTATTT 


CATTTTCATG 


ATGTGGAAAT 


TGTAAATCTG 


AACCACCCGC 


660 


5 


ATGTATATCA 


ATTGTAGGTC 


CTAGCTCATG 


AAATGCCATT 


ACAGAACATT 


CTATATGCCA 


720 




TCCTGGTCTA 


CCTTCACCAA 


ATGGGCTATC 


CCAACTAATC 


TCGCCAGGT t 


CGCTTTTTTC 


780 


10 


CACAATGTAA 


AATCAAGTGC 


ATCTTCTTTA 


TGCTCTCCTG 


CATCTATACG 


AGCACCCACT 


840 




TTTAAGT CAT 


CTATGGATTG 


ATGACTTAAT 


TT AC CAT AAC 


CTTCAAATTT 


ACGTGTTCTA 


900 




AAGTAAACAT 


CGCCACCACT 


TTCATATGCA 


TAACCTTGAT 


CCACCAAATC 


TTTAATAAAT 


960 


15 


TGAATAATGT 


CATCCATATG 


GTCCATTACC 


CTTGGATTTG 


AAGTCGCTTT 


TCTAACATTT 


1020 




AACGCACCAA 


CATCTTCATG 


AAAAGCAGCG 


ATATATTTTT 


CTGCAATTTC 


GGGAACAGAC 


1080 




TGATTTAATT 


CTTGAGAACG 


TTTAATTAAT 


TTATCATCTA 


CGTCTGTAAA 


ATTTGATACA 


1140 


20 


TATTCTACAT 


TATATCCTTG 


GTATTCAAAG 


TAACGTCTCA 


CTACGTCATA 


ATTAATTGCw 


1200 




GGTCTTGCGT 


TACCAATATG 


AATGTAGTTA 


TATACAGTAG 


GACCACATAC 


ATACATTTTT 


1260 


25 


ACTTTCCCTG 


GTTCTATAGG 


CTTGAACACT 


TCTTTTTGAC 


GTGTAAGCGT 


ATTATATAAT 


1320 


GTAATCATCT 


TGAATCTCTC 


CATTCCTAGT 


CTTTTCAAGT 


TGTCGTTCTA 


AATGCTTAAT 


1380 




TTGTTCATAA ATTGGATCAG 


GTAGATGGCG 


ATGATCAAAT 


GTTTTTCCAA 


CTCGAACACC 


1440 


30 


ATCTTGCTTA ACAATATGTC 


CTGGTATACC 


AACAACCGTT 


GAATAACTTG 


GAACTGATTG 


1S00 




TAAAACAACT 


GAATTTG CAC 


CAATATTTAC 


ATTTGAATTT 


ATTTTAATAT 


TTCCTAAAAC 


1560 




TTTCGCACCG GCTG CTATTA AAACATTGTC 


TCCTATATCT 


GGGTGTCTTT 


TCCCTCTTTC 


1620 


35 


TTTCCCTGTC 


CCACCAAGTG 


TCACGCCTTG 


ATAGATTGTC 


ACATTATCAC 


CAATTGTACA 


1680 




TGTTTCTCCT 


ATTACAACGC 


CCATACCATG 


ATCTATAAAT 


AGACGCTTTC 


CAATTTTAGC 


_ 1740 




ACCTCGATGG 


ATTTCTATAC 


CTGTGAAAAA 


TCTTGAAATT 


TGAGATATCG 


CGCGTGCTGC 


1800 


40 


AACATATTTT 


TTTTGGTTGT 


ATAACTTATG 


TGCAATCAAA 


TGACTCCAAA 


CTGCATGTAA 


1860 




ACCTGCATAC 


GTTGTAATGA 


CTTCTAATGT 


TGAACGTGCC 


GCTGGATCCT 


G CTCAAAT AC 


1920 


45 


CATTTTTATA 


TCGTCTCTCA TTCTTTTTAA CAAGATCATT TCCTCCTCAA TGATTGAACT 


1980 


ACGTAAATAC 


ATAATTGAAG 


TACCTGCGAA 


ATTAAATATC 


AAAAAAGCAC 


CACTAACATA 


2040 




CAAATTGTAT 


TGTTAGAGGC 


GCTTCCGCAC 


GGTTCCACTC 


TGAATTTAGC 


GAATAACATT 


2100 


SO 


AATAATATTG 


CGGGCGCTTC 


CAAATTATCA 


AGGAAACTAA 


GTCAACTTAA 


TGCTCATCAC 


2160 




TCTCATTATA 


TATTTAATTC 


ATTTTACGAA 


GGTGCATTCA 


TTAATTTCTA 


CGTTGTACTC 


2220 




ACAG CAACCG 


TACACTCTCT 


GCATCGTATA 


AATTTAATTA 


CTAATCCTTC 


GTTTTATATA 


2280 
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ATAAAATTCA AGTATATACT ACCTTGATCT TGTCTATTTC ATTACTTATA TTGTTTTAAA 2400 

CGGTTTAGCA ClUTrTCTTT ACCAAGTACT TCAATTGTAT TTGGTAATTC AGGACCATGC 2460 

ATTTGGCCTG TTACAGCAAC ACGAATAGGC ATAAATAATT GCTTGCCTTT TATTC CTGTT 2520 

TCTTTTTGAA CTTCTTTAAT TGTCTTTTTA ATTTCAGCCG CTTCAAATGG TTCAAGTGCT 2580 

TCTAATTTAC TGAATAAGTG CGTCATTAAC TCTGGTACTT GCTCTCCATT AATCACTTGT 2640 

TGTTCTTCTT CACCAAGAGC TGGCATTTCT TTAAAGAACA TTTCTGATAA AGGTACAATT 2700 

TCACCGGCAT AACTCATTTC TTTTTGATAA AGCGCAATTA ATTTGCGTCC CCAAGATAAA 2760 

TCCTCTTCTG ACGGCACCTC AGGAATCAAA TTTGCTTTAA TTAAATGAGG TAATGCTAAT 2820 

TGGAATACTG TTTCAGTATC TTTTTGTTTC ATATATTGGT TATTAACCCA TGCTAATTTT 2880 

TGCTTATCGA AAAATGCTGG TGATTTTGAC AAACGCTTTT CATCAAAGAT TTTGATAAAT 2940 

20 TCTTCTTTAG AAAAGATTTC TTCTTCACCT TCAGGAGACC AACCTAATAA CGCAATAAAA 3000 

TTAAATAACG CTTCAGGTAA ATAACCTAAG TCACGATATT GCTCAATAAA TTGTAAAATT 3060 

TGCCCATCAC GTTTACTTAA CTTTTTACGT TCTTCATTAA CAATTAATGA CATATGACCA 3120 

25 AAACGAGGTG GCTCCCAGCC AAATGCTTCA TAAATCATAA TTTGTTTAGG CGTGTTTGAA 3180 

ATATGAT CAT CACCACGAAT TACATCTGAA ATTTGCATGT AATGATCATC TATAGCTACT 3240 

GCAAAATTGT ACGTTGGAAT GCCATCTTTT TTTACGATAA CCCAGTCACC AATACCATTT 3300 

GAATCAAATG AAATATTTCC TTTTAC CAT A TCATCAAATG AATACGTTTG GTTTTGAGGT 3360 

ACTCGGAAAC GAATTGATGG TTGGCGTCCT TCTGCTTCAA ATTGTTGACG TTGTTCTTCA 3420 

GTCAAATGCG CATGTTGACC ACCATAGCGA GGCATTTCAC CACGAGCGAT TTGCGCTTCA 34 80 

CGTTCAGCTT CTAATTCTTC TTCTGTCATA TAGCATTTAT ATGCTTTATC TTCTGCTAGT 3540 

AACTGATCTA TTAATGGTTG GTAGATATGT TGACGTTCAG ATTGACGATA TGGTC CGTAG 3600 

40 CCATTGTCTT TATCTACAGA CTCATCCCAA TCTAATCCTA AC CATTT AAG ATT AT CAAAT 3660 

TGTGATGtTT CTCCATCTTC TAAATTACGT TTTTTATCAG TATCTTCAAT TCGAATCACA 3720 
AAATCTCCGT TGTAATGTTT AGCATACAAG TAATTGAATA ATGCTGTTCT TGCATTACCA 3780 
ATATGAAGAT ACCCAGTTGG ACTTGGTGCA TATCTTACTC TTATACGATC GCTCATTTTT 3 840 

TTCACTCCTA AATTAAATAT CAGATTTTCA AGTTAGTTCA TATAAATTGT TCATTTGCTA 3 900 

TCTTCGACCG TCATAACAAA TGTCTAACTC GTCTTATTGT TAAAACGAAA CAATGCTTTT 3 960 

TAACATGACC TTAAAATAAT TTCATTGTTT AATCATAACA TAATTCCCTG GGTAATATGC 4020 
TTAAATTTTA AATAGAAAGC TGTTGTTTTT TCAACACTTT AAAAAAGCTA TCCCTAAGAA 4080 
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TTAAACTTCA AATTAACTAT TCAAATACGT TAAAATTGAT TCTAATTTTG TATGTCTTGA 
TTGCTATAAG AATAACTTTA TTAATATCTA AAATTTAACA CTTAATGAAC TTGTTTCAAT 
GATATATTAG CACTATTTGT ATTTTTTGAT AACTAATATG TTTTGCATTT ATTTATAGTT 
ATACTTCAAA TTACAAACTt CGCCATTTCA TATACCTTTT AATATCTATT TTGTTTTCGT 
CAACTACAGT TTTTATAATG ATACTGTATC TTCGATTTTT TTAGCAAAAA CAATTCTTCC 
TGAAGATGTT TGCAATAAGC TGACTACTTC TAAATTGACA TGACTG CCAA TAAGATTTTT 
AGCATTATCA ACAACTACCA TCGTACCATC ATCTAGATAT CCTACTGCCT GACCAGGCtC 
CTTACCCATT TTTGTCAGTA AAATATGCAG TTGATCACCT TGATGTACAT TAGGTTTGAT 
TGCTTCTGAT AAATCATTAA CATTTAATGC TTTGATACCA TGTACATGAC AAACTTTATT 
TAGGTTGAAA TCTGTCGTTA TAATACTTGC ATGATATTGT TTTGCTAATT TTAATAACAT 
CGTATCAATA TCACTATGTG TTTTAGTTGG ATGTATAACC TTTGTAGGAT AGTCTAAATC 
ATACAATTCA TTTAAAATAT CTAAGCCTCT TTTACCCTTT TCaCGTTTAA CACTGTCATT 
TGAATCTGCA ACAATTTGTA ATTCATTAAT AACACCTTGT GGAATTAAAA TATTGCCATC 
GATAAAACCG CAACGAATGA CTTCTAAAAT ACGACCATCA ATAATTGCGC TTGTGTCGAT 
AATTTTTGGC GTAg c aCTTT TaGTATGTTG TGACATGGAA CGCG CTATAT TCTCAGGTAA 
AAACATTAAC ATTTCATCTC GTTTTTTAAG GCCAAATTGG AAACCGAAAT AACATAGTAA 
TATCGTAATT ATGACAGGAA TGAAATGATT AAAAATAGAG TTGCCAATTG ATTCTAATAT 
AAACGACACC ATAACAGAAA TAAGTAATCC GATTATTAAA CCTATTGTTG CGAATAGTAT 
TTCAACAGCA CTTCTACGCA TAATAAAATG TTCTAAACCT TTTATAGCGT TAGTAACTCG 
TCTAATAAAT ACACCAAAAA TTAAGAACAT AAAAATACTA CCGATAATGC CATCTACATA 
GTGATTTTTT AAAAAGCTGG AGTTTTGTAA TCCAAGATCA TTTGCAATTT CAGGAATAAT 
AATTATTCCT AATGCGCTCC CAATAATTAA GTAAATAATA ATAACCATTA GTTTAACGAT 
ATTCACACAA TGTCCTCCTT TCTTGATGTT TTATGAATGA AGAGCAAATG ACAATACTTC 
ATGTACAGTA GTTACACCTA TTACTTGTAT ACCTTCAGGA TATGTCCATC CGCCTATATT 
ATTTTTAGGA ATAATTACAC GTTTGAAACC TAGTTTTGCA GCCTCTTGCA CGCGTTGTTC 
TATCCGAGAT ACACGACGTA CCTCACCCGT TAAACCAACT TCTCCAATAT AGCAATCTAA 
TCCGTCGACA GCTTTATCTT TAAAGCTAGA TGCAGTTGCT ACAATTACAC TTAAATCAAC 
TGCTGGCTCC GTTAACTTTA CACCGCCAGC TACTTTGATA TAAGCATCTT GTTGTTGTAA 
TAGATAATTT TCTTTCTTTT CCAAAACAGC CATCAACAAA CTTAATCGAT TATGATCAAT 
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TATTAAAAGT GGTCTGGTTC CCTCCATGGT TGCAACAATT GTTGAACCTG GAACATTTGT 6000 

TGAACGTTCT TCTAAAAACA TTTCAGATGG ATTATTTACA CCTTTTAATC CACTTTGCTT 6060 

CATTTCGAAG ATTCcCATTT CATTCGTTGA ACCAAAACGG TTTTTAACAG CTCGCAAAAT 6120 

TCGATATGCG TGGTGTTCAT CGCCTTCAAA ATAAAGCACA GTATCaACCA TGTGTTCTAG 6180 

CAATCTTGGG cCCAGCAATT TGACCTTCTT TCGTTACATG ACCCACTATA AAAGTTGCaA 624 0 

TGTTCATTTG TTTAGCAATA TTCATTAAAC TTTGTGTACT TTCACGAACT TGTGAAACAG 6300 

AACCTGGCGC AGAGCTGATT TCAGGATGAT ATATTGTTTG AATCGAATCC ACTACTAATA 63 60 

AATCAGGTTG TTCTTCTTTT ACTGTTTGAT AAATAACTTC AAG AT CTGTT TCAGCTAATA 6420 

CTTGCAATTC ACTTGAATCT TCATCTAATC GCTCTGCACG TAATTTAGTC TGACTAAGCG 64 80 

ATTCTTCTCC AGTAATATAT AGTACTTTTT TCTTTTGAGA TAACGATGCA CAAATTTGTA 654 0 

AAAGTAACGT TGACTTACCA ATACCTGGAT CCCCACCAAT AAGTACTAAC GATCCGCTCA 6600 

CAATACCTCC ACCTAATACA CGGTTGAATT CTGCTGAATC TGTTAACACT CTCGGCGTTG 6660 

TTTCATGTTT AATACTATTT AATTTTTGTA CTTTACCTGC TAATTCCTTG GTTTTAACTC 6720 

CATGTTTAGG ATTGGCTGCT TTTTCAACAA TTTCCTCCAT TTGATTCCAA GCGCCACAAT 6780 

TAGGACATTT CCCCATCCAT TTAGGAGATT GATAACCACA AGCCATACAT TCAAAAATCA 6 84 0 

CTTTTTTCTT GGCCArAATT GCAcCTCCAC TTTCTT 6876 
(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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' (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
CAACTCAAAC AGCAGAACAA CGTCGTGAGT TGATTAATGG TGTATTTACT 
C CAT ACATT A AAAATATGAT GTACGTGTTA GCAGATAATA GACATATCTC 
GACGTATTCA AGGCGTTCCA AAGCTTATAT AACGGACACT ACAATCAAGA 
ATTGAGTCAA CATATGAATT GAGTCAAGAA GAGTTAGATA AGATTGTCAA 
CAACAAACGA AGTTATCTAA AGTTATTGTA GATACAAAAA TTAATCCAGA 
GGATTTAGAG TTAAAGTCGG CACAACTGTA TTAGATGGTA GTGTTAGAAA 
CAATTACAAA GAAAATTTAG AAGAGTTAAT TAATTATAAA GAGGAGTGAC 



GACATTAATC 
ATTAATAGCT 
TTTTGCAACA 
ACTAGTAACT 
TTTAATTGGT 
TGATCTTGTC 
ATAGATGGCC 



60 
120 
180 
240 
300 
360 
420 
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ATGTCCGTAA CTGATGTAGG TACTGTATTA CAAATTGGTG ATGGTATTGC ATTAATTCAC 54 0 

GGATTAAATG ACGTTATGGC TGGTGAGCTA GTAGAATTCC ATAACGGCGT ACTTGGTTTA 600 

GCCCAAAACC TTGAAGAGTC AAACGTGGGT GTGGTTATTT TAGGACCATA CACAGGTATT 660 

ACTGAAGGTG ACGAAGTTAA ACGTACTGGT CGTATCATGG AAGTACCAGT AGGTGAAGAA 720 

CTAATCGGAA GAGTTGTTAA TCCATTAGGA CAACCTATTG ATGGAcAAGG ACCGATTAAC 78 0 

ACAACTAAAA CACGTCCaGT AGAGAAAAAA GCTACTGGTG TAATGGATCg TAAATCAGTA 84 0 

GATGAGCCAT TACAAACAGG TATCaAAGCA ATTGATGCTT TAGTACCAAT TGGTAGAGGT 900 

75 CAACGTGAGT TAATCATCGG TGACCGTCAA ACAGGTAAAA CAACAATTGC AATTGACACA 960 

ATTTTGAACC AAAAAGATCA AGGTACGATT TGTATCTATG TTGCTATTGG TCAAAAAGAT 1020 

TCAACAGTAA GAGCAAATGT TGAAAAGTTA AGACAAGCAG GCGCTTTAGA CTACACTATT 1080 

GTTGTAGCAG CATCAGCTTC TGAACCTTCT CCATTATTAT ATATTGCACC ATATTCAGGT 1140 

GTAACAATGG GTGAAGAATT CATGTTTAAC GGTAAACATG TTTTAATCGT TTA 1193 
(2) INFORMATION FOR SEQ ID NO: 188: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 
35 TGCTAAGAAG TCAAAATAAA CTAACTATnA AACATCTAGT ACGATTATTA AAGTGACAGA 60 

TnATAAAATT GAATTATTnA GAGAAGGAGA TATAAAGTTT GAAGAAATAA AAGAAAGACT 120 

AGGTACAGGT ATTATTTATG AATAAGTTAA TACTTGGGAT TTATTTATAC CGAATTTTTT 180 

CACGAGCATA CTTTTATTTA CCGTTTTTAT TAATTTACTT TTTGATTCAA GGTTATTCCA 240 

TAATACAATT AGAAATATTA ATGGCGTCTT ATGGCATTGC AGCATTTTTA TTCTCTCTAT 3 00 

ACAAAGAGAA GTGTTTTAAA ATTTGTAACT TAAAAGATTC TAATAAATTA GTTGTTAGTG 360 

AAATATTCAA AATCATCGGT TTATTGTTGT TATTATATCA AAATCAATAT TTAATTTTAG 420 

TAGTGGCACA AATATTATTA GGGTTAAGTT ACTCAATGAT GGCGGGTGTT GATACCGCAA 4 80 

50 TAATTAAAAG AAATATAACA AATGAGAAAT ACGTACAAAA TAAGTCAAAT AGCTATATGT 54 0 

TCCTATCATT ATTAATTTCA GGGATTATAG GTAGTTATCT TTATGGAATA AATATTAAAT 600 

GGCCTATAAT AATGACTGGT ATATTTTCAA TTCTAACAAT TATAATTATT CGATGCACAT 660 
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TACCAGAAGA 
TTTTTATAGG 
TTATTTCAGT 
AATACTTGAA 
CATATCAAAG 
TAACT CGTCC 
ATTATGCAGA 
TATATACAAT 
ATTTAATAAT 
AAAGGGAACA 
GGGAATCCAA 
CTTGCTTGGT 
TGGAGTGTAG 
AGAACAAAGG 
GATTTTTCAG 
TATGAATTAA 
AATGAATTTT 
GGAAGAAAAA 
AGAAACAAAG 
TTAAATTTAA 
GTATTAACAA 
TTACATACTT 
TTAACTCACA 
ACTGGGCATG 
AAAGGACAAA 
TGCAGTAGAT 
ATTATCATAT 
AACACCTGAG 
TAGTAAAAAA 



GAAGTTTTGG 
ATTTATTCCA 
ATTAACTTGT 
TTATAAGTTT 
TTTCATAGCA 
ACAAACTATA 
AACGTTATAT 
AGGAACTATT 
AATATTTyAT 
ATATACCATA 
TTAAGTCATT 
CTTTAATTGA 
CTAATGTAAG 
GCGACTATGT 
AAATAGAAAA 
CAGTTTATGC 
TAAAAGAAGT 
TTTAATACTA 
GTTTTAATTT 
GAAATGTAAA 
TTATAGCTTT 
ATAATTTTTG 
AAGGAGAGAG 
ATCGCAACGA 
CTACCGGCAA 
GGCATTATGG 
TTAGATTTAG 
AAGATTCGAA 
GAAGTGGATC 



ATATTGCATT ATTCTTTTTT AAGAGCGTTA ATATTAGGAT 
ATTAATATAT ATAATGATTT AAAACTGAAT AATTTACAAT 
TACACAGTTA TGGGTTTTGT ATCTTCACGT TATTTAACTA 
GTGTCAGAAA TTTGTTTAGT AATATTTTTA ATAATATATA 
GTTACTATTT CTATGATATT TTTAGGTATT TCTTCAGGGT 
AATAAACTTT CTAGCAGTAG TAACTTAAGA GTGATGCTTA 
TTTATTTTTA ATATCGCATT TTTACTTATG GGTGGTTACT 
CAATACTTAA TATTATTTAT TTCGTTATTA ATTTTTATAT 
TTTACAAGGA GAGAGCAACA TGAAAATAAA AACTGAATTT 
TGAATACGCA GCAGGTGCAG ATGTGAGTGA TTCTATTAAC 
TCCATTTGAA GTAATTGAAT TACCGGAAGG g ACT AAAT AT 
CTATGATGCA ATTCCTGTAT GTGGCTTTGC TTGGATTCAT 
TGTTAGTGGC AATTCAATTT CTATAAAAGC AGATTTATCA 
ACAAGGTAAA AATAGCTTTA CTAGTGGGTT GTTGGCTGAA 
TCACTATGTA GGACCTACAC CACCTGATCA AG AT CAT CAA 
GTTAGATCAT TCTTTAAATT TGAAGAATGG GTTCTACTTG 
AAATCAACAT AAAATTGATC AAACAAGTAT TAACCTTATA 
AATATCTCAT CAATATAAAA TTGTTCAATT AAAAGTACAA 
ATATATTAGG TACGGCGTTC GCTATAATGC AAAGAAGTAA 
CTTAGTTATT GTAATGTGAA TTTATTTGAA AAAATAGAAA 
TACATTAATT AAAATTTATT TTTAAAAACA AGTAAACAAT 
AAAATTTTCA ATTTGTGTTA TATTGATTTT GTAAGATACT 
AGTATATGAA ATTAAAATCA TTTATAACTG TAACTTTGGC 
CTGGCGCTAC TGTGGCAGGT AATGAGGTAT CTGCAGCAGA 
CTCAAAAAGC TAAAGAAATG CAAAATGTTC CATATACAAT 
CTTTCAATCA ATCTTACTTA AATTTACCAA AAGATAGCCA 
GAAATAAAGT TAAAGCTTTG TTATATGATG AACGCGGTGT 
ATGCAAAATC TGCCGTTTAC ACGATTACTT GGAAAGATGG 
TTAAGAAAGA TAGCTACACA GCAAACTTGT TTGATTCAAA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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CAACATGAAG CATTTAATTT TACAGTGATG ATTATAAAAT AATTGCCTTG ATACAAAGAT 
TACTCGTAAA TGACATCTTT GTATTAAGGC TTTTTCTAAA TTTAAAAGTG ATGGGTTAGA 
GGTCATTGAG CTTTAAAATA TTCAAAATAC AAAACATTAA TGGCCAAAAA TAAAAGCCGC 
CTTTATCTGG GCAGCTTCAA TAATAAGAAA GACATATTTC ATTTTATACT AAATAGTTAT 
TGTGATGAAT CTTTCGG CGG TTTAATTACT GCAGCAAAAA TTGCTGTGAA AATCGTGAAC 
AATACTGCCA TGATAATTGG ATTCACTACA TTTAAG CTGT CTCCACCTAC TAGGCTATTA 
AGTACAAAGT TAACCATTTG CATTAATAAT AATGCCCAAA AGAATGTTAC GAGGTGTTTC 
ATGTCATTCT ACCTCCACTT TAATTATATA TATTTTATTT TAAGTGAAAG TTAGAAATTT 
GTATAGTAAC ATCTCATATA TTTTGACCAT ATTATACAGT TTAAATAAAT GATTTTATCT 
GAATGGCTAT TCTAAATTAA GCG CATTAAA ACCAATTTCA TACTGAAATT TGACGATAAT 
AAAGCATTAA AATTTTATTA ACTAGTCAAT ATTCCTACCT CTGACTTGAG TTTAAAAAGT 
AATCTATGTT AAATTAATAC CTGGTATTAA AAATTTTATT AAGAAGGTGT TCAACTATGA 
ACGTGGGTAT TAAAGGTTTT GGTGCATATG CGCCAGAAAA GATTATTGAC AATGCCTATT 
TTGAGCAATT TTTAGATACA TCTGATGAAT GGATTTCTAA GATGACTGGA ATTAAAGAAA 
GACATTGGGC AGATGATGAT CAAGATACTT CAGATTTAGC ATATGAAGCA AGTTTAAAAG 
CAATCGCTGA CGCTGGTATT CAGCCCGAAG ATATAGATAT GATAATTGTT GCCACAGCAa 
CTGGaGATAT GCCATTTCCA ACTGTCGCAA ATATGTTGCA AGAACGTTTA GGGACGGGCA 
AAGTTGCCTC TATGGATCAA CTTGCAGCAT GTTCTGGATT TATGTATTCA ATGATTACAG 
CTAAACAATA TGTTCAATCT GGAGATTATC ATAACATTTT AGTTGTCGGT GCAGATAAAT 
TATCTAAAAT AACAGATTTA ACTGACCGTT CTACTGCAGT TCTATTTGGA GATGGTGCAG 
GTGCGGTTAT O^CGGTGAA^TTTCAGATG GCACSAGC^AT^ATAAGITAT GAAATGGGTT 
CT6ATGGCAC AGGTGGTAAA CATTTATATT TAGATAAAGA TACTGGTAAA CTGAAAATGA 
ATGGTCGAGA AGTATTTAAA TTTGCTGTTA GAATTATGGG TGATGCATCA ACACGTGTAG 
TTGAAAAAGC GAATTTAACA TCAGATGATA TAGATTTATT TATTCCTCAT CAAGCTAATA 
TTAGAATTAT GGAATCAGCT AG AG AACG CT TAGGTATTTC AAAAGACAAA ATGAGTGTTT 
CTGTAAATAA ATATGGAAAT ACTTCAGCTG CGTCAATACC TTTAAGTAT C GATCAAGAAT 
TAAAAAATGG TAAAATCAAA GATGATGATA CAATTGTTCT TGTCGGATTC GGTGGCGGCC 
TAACTTGGGG CGCAATGACA ATAAAATGGG GAAAATAGGA GGATAACGAA TGAGTCAAAA 
TAAAAGAGTA GTTATTACAG GTATGGGAGC CCTTTCTCCA ATCGGTAATG ATGTCAAAAC 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 
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TGAACCTTAT 


AGCGTTCACT 


TAGCAGGAGA 


ACTTAAAAAC 


TTTAATATTG 


AAGATCATAT 


4380 




CGACAAAAAA 


GAAG CGCGTC 


GTATGGATAG 


ATTTACTCAA 


TATGCAATTG 


TAGCAGCTAG 


4440 


5 


AGAGGCTGTT 


AAAGATGCGC 


AATTAGATAT 


CAATGAAAAT 


ACTGCAGATC 


GAATCGGTGT 


4500 




ATGGATTGGT 


TCTGGTATCG 


GTGGTATGGA 


AACATTTGAA 


ATTGCACATA 


AACAATTAAT 


4560 


10 


GGATAAAGGC 


CCAAGACGTG 


TGAGTCCATT 


TTTCGTACCA 


ATGTTAATTC 


CTGATATGGC 


4620 


AACTGGGCAA 


GTATCAATTG 


ACTTAGGTGC 


AAAAGGACCA 


AATGGTGCAA 


CAGTTACAGC 


4680 




ATGTGCAACA 


GGTACAAATT 


CAATCGGAGA 


AGCATTTAAA 


ATTGTGCAAC 


GCGGTGATGC 


4740 


IS 


AGATGCAATG 


ATTACTGGTG 


GTACAGAAGC 


ACCAATTACT 


CATATGGCAA 


TTGCTGGTTT 


4800 




CAGTGCAAGT 


CGAGCGCTTT 


CTACAAATGA 


TGACATTGAA 


ACAGCATGTC 


GTCCATTCCA 


4860 




AGAAGGTAGA 


GATGGTTTTG 


TTATGGGTGA 


AGGTGCTGGT 


ATTTTAGTAA 


TTGAATCTTT 


4920 


20 


AGAATCAGCA 


CAAGCTCGAG 


GTGCCAATAT 


TTATGCTGAG 


ATAGTTGGCT 


ATGGTACTAC 


4980 




AGGTGATGCT 


TATCATATTA 


CAGCGCCAGC 


TCCAGAAGGT 


GAAGGTGGTT 


CTAGAGCAAT 


5040 




GCAAGCAGCT 


ATGGATGATG 


CTGGTATTGA 


ACCTAAAGAT 


GTACAATACT 


TAAATGCCCA 


5100 


oc 
£0 


TGGTACAAGT 


ACTCCTGTTG 


GTGACTTAAA 


TGAAGTTAAA 


GCTATTAAAA 


ATACATTTGG 


5160 




TGAAGCAGCT 


AAACACTTAA 


AAGTTAGCTC 


AACAAAATCA 


ATGACTGGTC 


ACTTACTTGG 


5220 


30 


TGCAACAGGT 


GGAATTGAAG 


CAATCTTCTC 


AGCGCTTTCA 


ATTAAAGACT 


CTAAAGTCGC 


5280 


ACCGACAATT 


CATG CGGT AA 


CACCAGATCC 


AGAATGTGAT 


TTGGATATTG 


TTCCAAATGA 


5340 




AGCGCAAGAC 


CTTGATATTA 


CTTATGCAAT 


GAGTAATAGC 


TTAGGATTCG 


GTGGACATAA 


5400 


35 


CGCAGTATTA 


GTATTCAAGA 


AATTTGAAGC 


ATAACTATAA 


nAATCTTCAG 


TAACGTTGTT 


5460 




TTAGTTACTG 


aagatttttt 


CaGTTTCTTT 


ATACTAAGAT 


GAGCGACAcA 


CAATCGTCAT 


5520 




AATAAAATAT GAATATTTAT TAATAATAA 








5549 



40 (2) INFORMATION FOR SEQ ID NO: 189: 



(17 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

SO 

AGATTATAGT AAGATTGATA GTTTGGCGAC TGaAGCgCGa GaAAAATTAT CAGaAGTAAA 
mCCTTTAAAT ATTGCACAAG CTTCTAGAAT ATCAGGGGTA AATCCAGCAG ACATATCTAT 
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TGGTTAGCAG AACAATTAAA AGAACATAAT ATTCAATTAA CTGAGACTCA AAAACAACAG 24 0 

TTTCAAACAT ATTATCGTTT ACTTGTTGAA TGGAATGAAA AGATGAATTT GACAAGTATT 300 

ACAGATGAAC ACGATGTATA TTTGAAACAT TTTTATGATT CCATTGCACC TAGTTTTTAT 360 

TTTGATTTTA ATCAGCCTAT AAGTATATGT GATGTAGGCG CTGGAGCTGG TTTTCCAAGT 420 

ATTCCGTTAA AAATAATGTT TCCGCAGTTA AAAGTGACGA TTGTTGATTC ATTAAATAAG 480 

CGTATTCAAT TTTTAAACCA TTTAGCGTCA GAATTACAAT TACAGGATGT CAGCTTTATA 540 

CACGATAGAG CAGAAACATT TGGTAAGGGT GTCTACAGGG AGTCTTATGA TGTTGTTACT 600 

IS GCAAGAGCAg TAGCTAGATT ATCCGTGTTA AGTGAATTGT GTTTACCGCT AGTTAAAAAA 660 

GGTGGACAGT TTGTTGCATT AAAATCTTCA AAAGGTGAAG AAGAATTAGA AGAAGCAAAA 720 

TTTGCAATTA GTGTGTTAGG TGGTAATGTT ACAGAAACAC ATACCTTTGA ATTGCCAGAA 780 

GATGCTGGAG AGCGCCAGAT GTTCATTATT GATAAAAAAA GACAGACGCC GAAAAAGTAT 84 0 

CCAAGAAAAC CAGGGACGCC TAATAAGACT CCTTTACTTG AAAAATAATG CATAATCCTT 900 

TACAACTAAC ATAAAAGGAG CGAATGGATA ATGAAAAAAC CTTTTTCAAA ATTATTTGGT 960 

TTGAAAAACA AAGATGACAT CATTGGACAT ATTGAAGAAG ATCGCAATAG TAATGTTGAA 1020 

TCCATTCAAA TTGAACGTAT CGTTCCCAAC CGTTATCAAC CAAGACAGGT GTTTGAACCA 1080 

AATAAAATTA AAGAACTTGC TGAATCAATA CATGAACATG GTTTACTACA ACCTATTGTT 114 0 

GTAAGACCGA TTGAAGAAGA TATGTTTGAA ATTATTGCTG GAGAGCGCCG ATTTAGAGCA 120 0 

ATACAATCAC T AAATTT AC C TCAAGCAGAC GTTATTATTC GTGATATGGA TGATGAAGAG 1260 

35 ACGGCTGTTG TTGCATTAAT TGAGAATATT CAAAGAGAAA ATTTGTCTGT TGTTGAAGAA 1320 

GCGGAAGCCT ATAAGAAATT ATTGGAAATT GGTGATACAA CGCAAAGTGA ATTGGCAAAA 1380 



20 



2S 



30 



40 



45 



SO 



SS 



AGTTTAGGTA AAAGTCAAAG CTTTATTGCA AATAAGTTGC GTTTATTGAA GTTGGCGCCG 144 0 

AAAGTACTAC TTCGCTTAAG AGAAGGTAAA ATTACTGAAC GTCATGCGAG AgcGGtATTA 1500 

TCATTGTCTG AT AG CGAACA AGAAGCGTTG ATTGAGCAAG TCATTGCACA AAAGCTAAAT 1560 

GTGAAcAGAc TGAAGATAGA GTACGCCAAA AAACGGGGCC CGAAAAAGTC AAAGCACAAA 1620 

ACCTTCGCTT TGCACAAGAT GTCACTCAAG CACGAGATGA GGTAGGCAAA AGTATCCAAG 1680 

CGATTCAACA AACAGGATTA CATGTTGAGC ATAAAGACAA AGATCATGAA GATTATTATG 1740 

AAATAAAAAT TCGAATATAT AAACGTTaGT AGTAGGATGT CGTATACATG ATGACTAACA 1800 

CATAAAAGAC AAAGCTAAGA TCATAACAGC TTTGTCTTTT TTTTTTGTTT TACGTGAAAC 1860 

ATAAAAATTT ATATTTATAT GTTGATCAGG CTGGTACATA AATCAATGTT CTATGCTCTA 1920 
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TTCTAGTCAA 


CCTTGCTGGG 

1 A. VJV- X \JWw 


GTGGGACGAC 


GAAATAAATT 


TTGCGAAAAT 


AT CATTTCTG 


2040 




TCCCACTCCC 


TAATTTGAGC 

* J**V XXX VJJT.VJ V* 


TGGATATACT 


TTCATTTGAA 


CCCTTTATTG 


CTAGTTTATG 


2100 


5 


A AAHTATCAT 


GAAAGCTTTA 

\jrwivj v*» x x x *» 


TGAACATCGC 


TTGAGTTGCC 


TTTACAGTAG 


AAAATTTAAG 


2160 




TTTTACACTT 
XXX inwiv x x 


TGTGTGAATG 

X VJ X Sj X X w 


ATACGTTTTG 

• * X **VNJ X X X X VJ 


TATTGAATTA 


ATTATAGAAA 


GGTACGTTGA 


2220 




/Awrt, X V-J X X X X V_ 


AATTGGAAGT 


GCAATTCTTC 


ATTTTGTCAT 


TGGTGGTATC 


GCTGTTGCAT 


2280 


10 


T AG CTTCAAT 

X /\w V- X X UW X 


TATTGCTGAT 

X MX X X VJ w X V*^» X 


AAGGTAGGTG 


GTAAGTTAGG 


AGGTATTATA 


GCT ACT ATG C 


2340 




UVJvI^AU X W X X 


TfTTCi CGGCT 

X V* X X VJV* NJ>J>» X 


ATTATCGCAT 


TAG CTTT AG A 


TCATCGTGGT 


ACGCAATTAG 


2400 


15 


TGGAGATGTC 


GATGAATCTT 


AGTACTGGAG 


CAATTGTCGG 


TATTCTGTCT 


TGTATATTAA 


2460 


Uluinl xxx X 


vjnun x v_ x v— x v— 


T ACATT AAGC 


ATAAAGGTTA 


TCGGAAAGGC 


G CAAT ATTCA 


2520 




Var/Vj X X X -X X VJ 


X X VJVJ XXX VJ X V, 


ATTTCPCTOG 

XXX V- V— V— X w 


CAAT ATT CAG 


TATTAGACAT 


TT AT AG TTTG 


2580 


20 


GAAAATGPGT 


GATAATTAGT 

VIA X X X AO X 


TGTATT CAGT 

x v_j x x x vnvj x 


TATTAAGTAA 


TAAATTATTG 


GAGGCAGAAC 


2640 




IxTPATGAAAT 


TAAPATTAAT 

X e\n\*~t\ x x nn x 


GAAATTTTTT 

wnnn x x x x x x 


GTGGGGGGAT 


TTG CAGT ATT 


ATTAAGTTAT 


2700 




/\ X i<j 1 /\ X v_ 1 Vj 


TAArAfTArr 

X nnvnV. X nLL 


TTGGAAAGAA 


TTTGGCGGTA 


TATTTGCaAC 


GTTTCCGGCA 


2760 


25 




lYITY ,f l , ATf5 J l u l" 
IVJlUlAlvl X 


TATTACAGGT 


ATGCAATATG 


GTGATAAAGT 


CGCTGTGCAT 


2820 








THfiTATflA PA 


GGGGTATTAG 

VjVJVJ\J X «W X X «xVJ 


TTTGTATTTT 


AGTTACATGG 


2880 




/\ X X v>7 X X M\» 


ATATfiArGPA 


CATGTGGTTG 

Vrf \ X VJ X VJ%J X X NJ 


ATT AG CATTG 


TTGTTGGTTT 


CCTAAGCTGG 


2940 


30 


X X V— /A X V_JAvJ X O 


V — /-\vJ X /A lulnl 


TTTTGAAGCG 

x x X x vjrirt\Jv»vj 


GTAGAATTTA 


TAGCACAAAA 


AAGATTAGAA 


3000 




AAG CATAGTT 


GGAAAGCTGG 


AAAATCGAAT 


AGTAAATAGT 


GTGAACGTAA 


TCTCTTAACT 


3060 


35 


AGGACTAACT 


TTGCAAGCAT 


TGAATAGCAT 


GGAAAAGTTG 


CAT CATT AAT 


AAGTGAAATT 


3120 


CAAGTTGGCA 


TTGAGAAAAT 


TACAAGCGCG 


TAATCATACa 


GGTCTGTCTT 


AAGGGAGTCT 


3180 




TCGAACCCCG ATGTTGTCGT ATGTCAAAAC ATTTAGTCAA T CAT AAAGGT GACTTGATTT 


3240 


40 


AACTTTATCT 


GATAGTCTGA 


TTGTAATGAT 


TGTACTAATT 


GACTGGAGGC 


GTATGTAATT 


3300 




GAATCTGAGT 


AAACAAATTA 


AAAAGTATAG 


GGAACGAGAT 


GGTTATTCAC 


AAGAATATCT 


3360 




TGCTGAAAAG 


TTATATGTAT 


CTAGGCAGAG 


TATTTCTAAT 


TGGGAAAATG 


ACAAAAGCTT 


3420 


45 


ACCAGACATA 


CATAACTTAT 


TAATGAyGTG 


TGAATTGTTC 


AATGTAACTT 


TAGATGATTT 


3480 




AGTAAAAGGG 


ACCATTCCAT 


TTGTACCTGA 


TATTAAAGCG 


CAACGAAGTC 


TTAACTTATG 


3540 




GACATATGTG 


ATGCTTATTT 


TCATGACATT 


AGCTGCAATT 


TTAATGGGAC 


CTTTAGTTGT 


3600 


50 


TTATTGGAAT 


TGGACTTGGG 


GTGTAACGGT 


GGCAATCATT 


TTGGGAATAG 


GTTTTTATGC 


3660 




ATCTATGAAA ATAGAAGATT 


TAAAAAAAGT 


G CAT AAAATG 


GACAACTACG 


ATCGAATTGT 


3720 
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GACAAATGCG CTTTCTATTA TATCAGTAAT TGGTATACTC AGCCTCATAA TTTTCCTTAG 3 840 

TGTGTATTTG GCAAATAAGT TTTTATAAAT CATCGTGGTA TCGTCTCATA TTATTTATAT 3 900 

TATCCAAAAT AGCATAAAAA AATACCAACA AGATTTAGAA CCTTGTTGGT AATCAAAGCG 3 960 

a TTCATTT AT AATGAGTCGT TTTATGTTGT AAGATTAAAC AGTTTGTACG TTAACTGCTT 4 020 

GGTCTCCACG TTGACCTTCA GTGATTTCGA AAGTAACTTT TTGACCTTCT TCTAAAGTTT 4 080 

TGTAGCCATC GCTAGCGATA CCTGAGAAAT GTACGAATAC GTCTCCGCCA TTTTCTTGTT 414 0 

CGATGAAACC AAAACCTTTT TCTGCrTTAA ACCATTTwAC TGTACCGTTA TTCATATwGA 4 200 

75 AurACCTCCGT gTGCTTTTGC ACTTAATATT TGTAACAAAT TCATAACTAA AAAAGAGGAT 4260 

ATTCTAAACA AATACACTAC AATTTAATTC ACGAGCTTTT ATTACGTAAG ACCAACTATA 4320 

CGCTCATATT GGCATAATGT ACAGTGTTTT TTGAAAATAA ATTAAAAAAG ATTTTTAAAA 438 0 

20 ACCTTAGAAA CGTTGATTTA AAGGGGTTTA TAAAAATwAw AAAATTGTAG TCTTTTATGG 444 0 

TGTTTGCTAG TTTTCAAAGT GACATATCGT TTAAACATGA TGATTTTATA AGCAATCCAT 4 500 

AAAAAACAAG CAG CG AT AAA CGCTACTTGT TGATATTAAA ATCTGACTTG AAAGGT CAT A 4560 

GCAATGTTCT ATACCGATGG AATGTG CTT A CTTGCCTTTT TCTTCACGAC GTTTTAAATA 4620 

ATAAGAGCCA CCTAATAAAC CAGCTGGAAT GCCTATCATT GGTGTTGTGA ATGAGCTTAA 46 80 

TACAATAACA AGTATTGTTA AAGCAATGAC GTTATACCAA GTTACAGTCA AATTTTTCAA 474 0 

ATCCTCATAT GATTGTTTTA CTAATTCTCT AAATTTCATG ATTCAATCTC TCCTTTTTTA 4 800 

TAAATCTTTA GATTGTCAAA TTAAGCTGGA CA 4 832 
<2) INFORMATION FOR SEQ ID NO: 190: 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

- ^ (A)_ .LENGTH.:. -5-72-7-base-pairs- 

~ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 ' ( D ) TOPOLOGY : 1 inear 



45 



so 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CAAAGCTGTT CAAAAGGCTT ATAATTTAAA TTTAGATAAC ATACGTACAA TGGAACCTAA 60 

GTTGAGATAT CAAGCGATCA ATAAAGGTAA TATTAATTTA ATAGATGCAT ATTCAACTGA 120 

CGCTGAATTA AAACAATATG ATATGGTTGT GTTAAAAGAT GATAAGCACG TATTTCCACC 180 

ATATCAAGGA GCACCATTAT TTAAAGAAAG CTTTTTAAAG AAACATCCAG AAATTAAGAA 240 

ACCGTTAAAC AAACTAGAAA ACAAAATATC TGATGAAGAT ATGCAAATGA TGAACTATAA 300 
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w 



15 



20 



25 



30 



35 



40 



45 



50 



GTTAATCAAA TAACGACCAA CGCCACATAA GATGCGTAAC ACCAAATTAT ATCTTATGTG 4 20 

GCGTTGTTAT ATTTAAATCT ATAATTATGT TCAATTTAAA CATGCAATAA TGATTAAAAA 4 80 

ATATGACATG TTAAACACAA TGTAAG CT AT TATGATGTGA AAATAGTAGC ATTGCATTTT 540 

AGAAACATAG AG CGAT AT AA TGAATATAAG TTTTTTGAAA TTTCAGTTAA TTCTAAGGAG 600 

GTTGTTTTTA TTATGAAAGA ACAACTTAAT CAACTATCAG CATATCAGCC TGGTTTATCT 660 

CCAAGGgCAT TGAAAGAAAA GTATGGCATT GAAGGAGATT TATATAAACT TGCATCAAAT 720 

GAAAATTTGT ATGGAC CAT C GCCTAAAGTT AAAGAAGCGA TATCAGCACA CTTAGATGAG 78 0 

TTATATTATT ATCCTGAAAC AGGATCACCG ACATTAAAAG CGGCGATTAG TAAACATTTA 84 0 

AATGTAGATC AATCACG CAT TTTATTTGGT GCGGGATTAG ATGAAGTTAT ATTAATGATT 900 

TCTAGAGCTG TATTAACGCC AGGGGATACT ATTGTTACAA GTGAAGCGAC ATTCGGTCAA 960 

TATTATCACA ATGCGATTGT TGAATCAG CT AATGTGATAC AAGTACCTTT AAAAGATGGT 1020 

GGCTTCGATT TAGAAGGTAT TTTAAAAGAA GTTAATGAAG ATACGTCATT GGTATGGTTA 1080 

TGTAATCCAA ATAATCCTAC AGGTACATAT TTTAATCATG AGAGCTTAGA TTCGTTTTTA 114 0 

TCTCAAGTAC CTCCACATGT ACCAGTAATT ATAGATGAAG CTTATTTTGA ATTTGTGACA 1200 

G CAGAGG ACT ACCCGGATAC ACTTGCTTTG CAACAAAAAT ATGACAATGC TTTCTTATTA 1260 

CGTACATTTT CAAAGGCGTA TGGATTAGCG GGTTTACGTG TAGGATATGT GGTAGCAAGT 13 2 0 

GAACATGCGA TTGAAAAATG GAACATCATT AGACCACCAT TTAATGTGAC ACGTATATCT 138 0 

GAATACGCAG CAGTTGCAGC ACTTGAAGAT CAACAATATT TAAAAGAGGT AACACATAAA 1440 

AATAGTGTTG AACGCGAAAG ATTTTATCAA TTACCTCAAA GTGAGTATTT CTTGCCAAGT 1500 

CAAACGAATT TTATATTTGT AAAAACmAAG CGGGTAAATG AACTTTATGA AG CACTTTT A 1560 

AATGTAGGGT GTATTACGCG ACCATTTCCA ACTGGTGTTA GAATTACAAT TGGTTTTAAA 1620 

GAACAAAATG ATAAAATGTT AGAAGTTTTA TCAAACTTTA AATACGAATA GTAAGTGGGG 1680 

AGTGGGACAG AAATGATATT TTCGCAAAAT TTATTTCG t C GTCCCACCCC AACTTGcATT 1740 

GTCTGTAGAA ATTGGGAATC CAATTTCtCT TTGTTGGGGC CCCGCCGGCA AGGTTGACTA 1800 

GAATTGAAAA AAGCTTGTTA CAAGCGCATT TTCGTTCAGT CAACTACTGC CAATATAACT 1860 

TTGTAGAGCA TTGAACATTG ATTTATGTCT CAAGCTCAAT GCAGTGTGAA TGATGAGGTG 1920 

AGAGTATTCA GTGTAAAAAG CAACAATAGA TGATATTGTT TTGTATCAAT TGCTTTTTTG 1980 

CTATACTGAA TCAATACTGA TATTTTCAGG AGAAGATTAA AATGACC CGT AAATCAATCG 2040 

CGATTGATAT GGATGAAGTA TTGG CAG AT A CATTAGGAGA AATCATTGAT GCTGTCAATT 2100 
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TTCCTGAACA TGATGGATTA ATTACAGAAG TATTGAGAGA ACCAGGCTTC TTCAGACATC 2220 

TTAAAGTGAT GCCGTATGCA CAAGAAGTTG TGAAAAAATT AACTGAACAT TATGATGTAT 2280 

ATATTGCTAC AGCAGCAATG GATGTACCAA CATCATTTAG TGATAAATAT GAATGGTTAC 234 0 

TAGAGTTCTT TCCATTTTTA GATCCTCAGC ATTTTGTTTT TTGTGGTAGA AAAAACATCG 2400 

TTAAAGCTGA TTATTTAATA GATGACAATC CTAGACAGCT TGAAATTTTT ACTGGTACAC 2460 

CGATTATGTT TACAGCAGTG CATAATATTA ATGATGATCG ATTTGAACGC GTAAATAGCT 2520 

GGAAAGATGT AGAACAGTAT TTTTTAGATA ATATTGAGAA ATAAAATATA TCACTTGAAA 2580 

75 AATTTCATGT AGAAAAGATG ATGGATAGGC TATAAAGTAA TTGTGACTGA GATGAACTTT 2640 

TATGTCTTAG ACACTACAAC ACTATATTGG CAGTAGTTGA CTGCGGGGCC CCAACATAGA 2700 

GAAATTGGAT TCCCAATTTC TACAGACAAT GCAAGTTGGG GTGGsCCCCA ACATAAAGAA 2760 

ATACTTTTTC TTTAGAAATT AGTATTTCTT ATGCATGAGT GTAACTCATG CATTCATATT 2820 

TTTAAGTACA CATTAGCTGT GACTAATGAT AAAGAATCGC TACATAATCA ATCATTAGTC 2880 

GTTCTTTATC ATTTCCGTCC CGCTCTCAAT AAATGTTAGT CTATCTTATT ATTATAAATC 294 0 

GGATGAATGT GTTAATCTAT GGCAGATTAC ACGTCATCCG ATTTTTTATA GAATTTGAAA 3000 

AAGACGCATA AACCACTATG ATTTAAAATA CAACATCAAT CATTTTAGTG gCATGCGCCA 3 060 

AAATTATATG TCTGTTTTTG AAACAGGGTA ATAGCTTAAA GCTAATAAAA ACGAATATAA 3120 

GGTGCGTTGA ATCTTATGAT TACACTCCAA ACCTAATATA ATATCGGGTT AAGATCATTC 3180 

CGGATGCTTA CAAATCATTG ACAGTAAGTA ACTGAATGGC ATTTGGTATA ACCTCAATAT 324 0 

CAATAGGTGT TTCTAATGAA ATTTCGC CAT CAATATCAAC TTTCATTGCT GGATCTGTTG 3300 

TAAGTGAAAT CTTTTTACCA GGTATATGCT CAATACCTTG AGTAATTTCA TTCCaATTCA 3360 

TGCTATCACG CTTTTTAAM ATAT CATTTA AAATACTGAA ACTTTCTTCA TTAAAAATGA^ 3420 

40 AAGTGTTCAG TTCACCATCT TGAGGAGACA AATCAGTCaA TGGTATACGA CTACCACCAA 34 80 

TGAATGGACC ATTTGCTGTT AGTATCATGG TCGTTTCGCC AGAATATGTC TT AT CATCTA 354 0 

TTGATAATTG ATAATTAAAT TGTGTTGGAT TTAGCAGTGT TTTGACAGTT GATCCAATAT 3600 

45 AACTCAATTT ACCAAATATA TCTTTTGAAC CATCTTGTAC GTTTTCAGCG TTTTGAACAA 3660 

TGAGACCTAA GCCAACAAAG TTGAGTGCAT ATTGATTATT TATTTTAATT ACATCGTATG 3720 

TACCAACTTG TGCAGAAATC ATTTGTTCAC TAGCTTGTTT ATGATTAGGT GCTATATTTA 3780 

so 

GCGTTTTTGT AAAATCATTA AAAGTACCGC CTGGTAAAAT GCCAATAGGG AGTTGAAGGT 3840 

CATGTGTCAT AACACCGTTT ATAAGTTCGT TAACCGTGCC ATCACCGCCA AGAATAAATA 3900 
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CACCTTCGTT TTCACTCAAT TGAATAGAAA 
TAACTTCCCC AATACCTTGA TTAATATTTT 
5 CACCATGTGT ATATTTATTT TCCATAGTTT 

TATATATACC CACTTTTAAT TGTTAATACC 
AATAAATGAA ATTGATTTCT ATAGAGTGGG 

10 

TG AG CAT ATT AAGTTTTTAT TTATACTGAT 
AAAAACAGTA AG C CATTT AA ATGACTTACT 
TATACCTTTC AT CGCTTTGA TGATTAAAGG 

IS 

AATTGCAACA ACACCTAGGA AAATAAAGTA 
AACTAAAGTA CCATTGATTG CTTGTGCAGA 

p0 TTGGGCATTA AATG CTTTAG GTGCTAACTT 

GCATAGCTCA CCGATAACAC AAATAATGTA 
TGATGAACCT GATG CATAAC CTACAATACC 

25 CGTACCAATT GCAAATTTTA CTGGCAGGCT 

AAGTGAAATA ATTGGAGCTA GTAATAAAAT 
TTCACCAAAG TTTGTTTTCC AACCAAATAA 

30 ATATATGTTT AATACATTAG AC C CTTGTT C 

AAATAATGGA ATAAATGCTT TAACACGAGA 
AATAATTAAA GTGAAGTAAA TGAnTGGTAA 

35 

AACTAAGTTA AATGATAATG AGTTAGTTAA 
TAAJSACAACA CTTCCGATAA TAAGACCATA 
AGTAGGTTTC ATACCAACGC T AC CTAAGTT 
AC CT AATG CC ATACCAACTG CTGCAATCAA 
AAAGTGTTGC AAAATAATAG GTGATAATAA 

45 AACAAAACCT GCATCCATAC GTCTATCATT 

AATGTTTGGC TTCATTAAAC CTGAACCAAT 
G ATT AATG CA AATGGTAAGC TTAAACAAAT 

50 TAAAGTAGCG CCTCTAGTGC CTGTAATTCT 

CATATAGATT AATGAACCAT ArACTGACAT 

55 



GATGCTTACA AATTGAACTT AATGCTGTTG 4 020 

TTAATCCACT GTGTTCATGG TAAAAGAGGA 4 0 80 

AGCCTACTTT CTAAAAATTG GTTCATTAAA 414 0 

AAAAATATGT TTTTAAATAG AGAAAATGGT 4200 

ACGAGAAAAT ATAGTTATAG CTGTCTATAA 4 260 

ATCTTGAATT TAATTAATAG AAACCTATAA 4 3 20 

GTTTTTTGAA TTAGGCCAAC AATATTAACG 43 80 

TGAGAATGCT AATACAATTG TTGTAACAAT 444 0 

ATTTGTTTGA CCTAGTGGTT CTATTAACTT 4 500 

AGCGTTAGTT AAGTACCAAA TACTCATCAT 4 560 

AACAG CAGCA CTATTACCCG TTGGTGATAA 4620 

CGATAAAATA ACCCAGTTAA CTGAAAAGTT 4 680 

AATTAGTATG TATGACGCAC CTGCTAAGAA 4 74 0 

AGGTTGTTTA GTTCCAAGCT TTTGCCATAA 4 800 

AAATAATGGG TTAATTGATT GGAAGATCGC 4 860 

GTTTAATTTC ATATCTGAAT GTTCAATTCC 4 920 

TTGAAT AG CC CAGAACACCA TTCCAAGAAT 4 980 

ACGTTCAGTA TCAGTGACAT CTTTACTTCT 5 04 0 

TGCAATACCT AATACTAAAA CAGTATTACT 5100 
TGCACCAATA ACGATAATTA ATACAATTGC . 5160 

CTTTTTCTTT TCAGCTGGTG TCAATGGGTT 5220 

TTTGCGGTTG AAAAGTACAT ACCATACTAA 5280 

GAATCCGCCG TGGAAGTTTT TAACATTAAC 5340 

TGCACCCATA TTAACTGACA TATAGAAAAT 54 00 

TTCAGGATAT AAACGGCCAA CGATATTTGA 54 60 

AATGATGAAG AACATTGATG TGAATAAGCC 5520 

ATGTCCGATA ATAATAAAGA CTGCACCTAA 5580 

GTCAGCAATC CATCCGCCTG GTATTGATGT 564 0 

AATTGACATA GCTGTTGTTT TATCAATTCC 5700 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGGACTATTA ACGGCGaAGA AGATTTAACG AAATACTTAC AAACCAATGT TGATGGTATT 6 ( 

is AT CACAGATG ACCCAGCATT AG CTGATCAG ATTAAAGAAG AAAAGAAAGA CGAAACATAC 12C 

TTOGATCGTT CTATAAGAAT TTTGTTTGAA TAATATAAAC AAAGACCTCT AAAGTTATCA 18C 

AGATGATACC TTCAGAGGTC TTTTTAATGT TGCCATCTAT GGGATAGGCA ATCGTTTCAT 24 C 

2Q TCGTTTATAT TCATATGACA AGTATTTGTA TGGCAATTTG GCGTCACAAA CACTTACATG 3 0C 

ATTTATTGGT GAATTATTAA TTGTTTTGTG AATGCAAAGG GTTAGAAATT GAATTGTAAA 3 60 

TACTTTCTAA TCTTTGTTTC GCTTTAGTCA TTTGATCCAA ATTTTTAGTG CGTATAGCGG 420 

ATTTTGCAAT ATAGTGCGCA CTAAAATATC GCGTTTTTGA AACGCATCTA AATTTAGGTA 4 80 

CGATAATTTA TTTAAGTCAG TGTTTGCTAT TAATTCATGT AATTGATCTA CAAGCGCTTG 540 

ATGTTGATAC GTATGTGATG TAGTTTCAGA TTTGCTTGCT AATTTAATAC CAGTCGTATC 600 

AAGGAGCGCC GCTTTAATAC CAGCAACTAA ATATGTTTTG ATTTTCATTT GTGTTGTCAT 660 

GCTTTGTTAC TCCTTTGATG TACATTAATC AAAAAAATTA TACACTATTG TATATTG CAA 720 

AGCTAATTAA CTATAACAAA AAGATAGTTA ATGCTTTGTT TATTCTAGTT AATATATAGT 780 
TAATGTCTTT TAATATTTTG TTTCTTTAAT GTAGATTGGG CAATTACATT TTGGAGGAAT 
_ TAA^AAATTTA -TGAAAAAGCA -AATAATTTCG CTAGGCGGAT TAGCAGTTGC- ATCTAGCTTA " 

TTTACATGGG AT AACAAAG C AGATGCGATA GTAACAAAGG ATTATAGTGG GAAATCACAA 960 

GTTAATGCTG GGAGTAAAAA TGGGACATTA ATAGATAGCA GATATTT AAA TTCAGCTCTA 1020 
TATTATTTGG AAGACTATAT AATTTATGCT ATAGGATTAA CTAATAAATA TGAATATGGA 
45 GATAATATTT ATAAAGAAGC TAAAGATAGG TTGTTGGAAA AGGTATTAAG GGAAGATCAA 

TATCTTTTGG AGAGAAAGAA ATCTCAATAT GAAGATTATA AACAATGGTA TGCAAATTAT 1200 

AAAAAAGAAA ATCCTCGTAC AGATTTAAAA ATGGCTAATT TTCATAAATA TAATTTAGAA 1260 
GAACTTTCGA TGAAAGAATA CAATGAACTA CAGGATGCAT TAAAGAGAGC ACTGGATGAT 
TTTCACAGAG AAGTTAAAGA TATTAAGGAT AAGAATTCAG ACTTGAAAAC TTTTAATGCA 
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GTTGTATCAT ATTATGGTGA TAAGGATTAT GGGGAGCACG CGAAAGAGTT ACGAGCAAAA 1500 

CTGGACTTAA TCCTTGGAGA TACAGACAAT CCACATAAAA TTACAAATGA ACGTATTAAA 156 0 

AAAGAAATGA TTGATGACTT AAATTCAATT ATTGATGATT TCTTTATGGA AACTAAACAA 162 0 

AATAGACCGA AATCTATAAC GAAATATAAT CCTACAACAC ATAACTATAA AACAAATAGT 1680 

GATAATAAAC CTAATTTTGA TAAATTAGTT GAAGAAACGA AAAAAGCAGT TAAAGAAGCA 174 0 

GATGATTCTT GGAAAAAGAA AACTGTCAAA AAATACGGAG AAACTGAAAC AAAATCGCCA 1800 

GTAGTAAAAG AAGAGAAGAA AGTTGAAGAA CCTCAAGCAC CTAAAGTTGA TAACCAACAA 18 60 

GAGGTTAAAA CTACGGCTGG TAAAGCTGAA GAAACAACAC AACCAGTTGC ACAACCATTA 192 0 

GTTAAAATTC CACAGGGCAC AATTACAGGT GAAATTGTAA AAGGTCCGGA ATATCCAACG 1980 

ATGGAAAATA AAACGGTACA AGGTGAAATC GTTCAAGGTC CCGATTTTCT AACAATGGAA 2 040 

CAAAGCGGCC CAT CATT AAG CAATAATTAT ACAAACCCAC CGTTAACGAA CCCTATTTTA 2100 

GAAGGTCTTG AAGGTAGCTC ATCTAAACTT GAAATAAAAC CACAAGG T AC TGAaTCAACG 2160 

TTAAAAGGTA CTCAAGGAGA ATCAAGTGAT ATTGAAGTTA AACCTCAAGC AACTGAAACA 2220 

ACAGAAGCTT CTCAATATGG TCCGAGACCG CAATTTAACA AAACACCTAA ATATGTTAAA 2280 

TATAGAGATG CTGGTACAGG TATCCGTGAA TACAACGATG GAACATTTGG ATATGAAGCG 234 0 

AGACCAAGAT TCAATAAGCC ATCAGAAACA AATGCATATA ACGTAACAAC ACATGCAAAT 2400 

GGTCAAGTAT CATACGGAGC TCGTCCGACA TACAAGAAGC CAAGCGAAAC GAATGCATAC 2460 

AATGTAACAA CACATGCAAA CGGCCAAGTA TCATACGGAG CTCGTCCGAC ACAAAACAAG 2520 

CCAAGCAAAA CAAACGCATA TAACGTAACA ACACATGGAA ACGGCCAAGT ATCATATGGC 2580 

GCTCGCCCAA CACAAAACAA GCCAAGCAAA ACAAATGCAT ACAACGTAAC AACACATGCA 2640 

AACGGTCAAG TGTCATACGG AGCTCGCCCG ACATACAAGA AG CCAAGTAA AACAAATGCA 2700 

TACAATGTAA CAACACATGC AGATGGTACT GCGACATATG GGCCTAGAGT AACAAAATAA 2760 

GTTTGTAACT CTATCCAAAG ACATACAGTC AATACAAAAC ATTACGTATC TTTACAACAG 2820 

TAATCATGCA TTCTATGATG CTTCTAACTG AATTAAAGCA TCGAACAATC GGAAGCATAT 2880 

TTCTAAATTA TTTATTCATT ATAGTCTTAA ACATAACATG ACCTAATATA TTACTAACCT 2940 

ATTAAAATAA ACCACGCACA TCTAAGTGAT ATACGACAAT CACAGCAATA ATAATTGCTT 3000 

TAGAAAGTCG TGCCGAACTG GAACTTACAA GTCTAGTTCG AACACACACT GATGTGAGTG 3060 

GTTTTCTTTA TTTTAAACAT GAACAATCAG ATAAGTTACT AG CATT AG CA AATATTATTA 3120 

AATCAAAGGG CTTCGATTCA TAAAATTTAA AACAATGATT AAAATTAGAC GTGTAAATGT 3180 
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TATTTCACAC AGCTTCATTA ATAAAACGAA ATTGCTTCAA CCCGCTTCAA CTTCAACTGG 
CTTCAACTTC AGCCTACTTC ATTCAATAAC AAAACGAATC CGCTTCATCC AAAATCAACC 
ATTCTAACGC ACATATTCAA ATATAGCAGC TGCACCCATG CCGACACCAA TACACATCGT 
AACCATGCCG TAACGGCTAT CGGGACGTCT ACCCATTTCA TTAAGTAAAC GCGCGGTTAA 
CATTGCGCCT GTAGCACCTA ATGGATGACC TAAAGCAATA GCGCCACCAT TCACATTCGT 
ACGTGATATA TCTAGACCTA CTTCTTTAAT AGATGCAATC GTTTGAGAAG CAAATGCTTC 
GTTCAATTCG ATCAAATCAA TGTCTTCAAC AGATAGATTG CTGAGTGACA ATACTTCAGG 
AATCGCATAT GCAGGCCCAA TACCCATAAT TTTCGGGTCA ACGCCTACTG CCTTAAAACC 
AACGAATCGT GCAATAGGTG TCACGCCGAG TTCTTTCACT TTATCTCCAG ACATTAAAAC 
TACAAATCCT GCACCATCAG AAAGTGGGGC AGATGTTCCT GCAGTCATAG TGCCGTCAGC 
TTTAAATACT GTACGTAATT TGGCTAATGC CTCCATCGTG GTGTCAGGGC GTATAAATTC 
ATCTTGGTCA AAGATATTTG TGTGTACTTT TGGTCCTGCG TTTGTATATT CAACTGAGTT 
TACTTGTATT GGAATAATTT CATCTTTGAA CCGACCATCA CGTTGTGCGT CATAGGCACG 
TTGATGACTT CTGACAGCAT AAGCATCTTG ATCTTCGCGT GATACGTCAA ATTGGGATGC 
TACATTTTCA GCAGTTAAAC CCATAGGATA TGACGCACCT ATATCATCAT ATTGTAAGGT 
TGGATTGTTT GTGGGCTCGT TGCCACCCAT TGGTACGGCA CTCATCAATT CAACGCCACC 
AGCTACAAGT ATATCTCCTT GACCAGCCAT AATTTGATTG GCTGCAATCG CGATGGTTTG 
TAATCCTGAT GAGCAGTAGC GATTCACTGT TTGACCCGGT ACCGTGTCAG ATAATCCCGC 
ACGCAATGCA ATCGTTCGTG CAATGTTTTG GCCTTGTAAT CCTTCTGGAA AAGCCGTACC 
AACAATGACA TCTTCAATCA TATTCTTATT GAATTTTCCG TCAATACGTT TCAATACGCC 
-TTGTAATACT TTGGCTGCGA CATGATCAGG TCTTTCGTGG AATAATGCGC CTTGCTTTGC - 
TTTCGCTGCG GCTGAACGCC CATAAGCTAC AATGTATGCT TCTTGCATGG TTATCATCCT 
CTCTTAATGA CTATCTTTTA ATTACGTAAT GGCTTACCAG TTTTTAACAT ATGTGCAATT 
CTTTCATATG ATTTTTTAGA TTTTAGTAAG TCAATAAAGC CAATTTTCTC CAACGATTGA 
ATGTAACGTT GATTGATAAA TGTATTTCTT GGTAAATCAC CACCCGCTAA AATTGTGGCG 
ATATTTAAGG CAATATGATA ATCATGGTCG CTAATAAAAT GACCCCGTCT TTGCGCATCT 
AATTGTCCTT GGATCAATGC TTTGAAGTCT TCACCTAAAG CGATATATTG ATGTCTAGGA 
TTCGGAATAT AGTTTGTTTC TG CTTCAT AT TTCGCACGTT TGAGCGCAAC TTCGACACGT 
TGTGCTGTAT TGAAAATAAT CGTATCTGTA TCACGTAAAT AACCATAACG ACGTGCCTCA 
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TGTTTGTCAT CAAACTTATG CGATGTGCGT AATATGCGAT CAGCCATTTC TGCAAGGCCA 5100 

CCGCCACTCG GTAATAAGCC AACACCTGCT TCAACAAGAC CGATATATGT TTCACTTGCA 5160 

GCGACAACAA TAGGTGAGTA AAGTACAAGC TCACAGCCAC CGCCTAAGGC ACGACCTTGA 5220 

ACAGCTGTGA CTACTGGTTT CAAACTATAC TTCAAACGAT TAAAGCTATA ATGTAATTTA 5280 

TCAATTGATT GTGCAACGAC ATCATCTACA AGACCGTCTT CATGCGCCTT TTTCATTAAG 5340 

AAAAGGTTAG CACCCACACT GAAATTGTTA CCATCTGCAT AAAT AAC CAT ACTTGTGTAA 54 00 

TGGTCATTTT CCAGTAAATC AATCGCATCA ACTAACGCAT CGTTGAATTC ATCGGTAATG 54 60 

ACATTATTTT TACTTTGTAA TTT CAGTAAC AGTTGATCAT CATGAGTTAC GGAAAGTTTG 5520 

GCATCACCTT TATCCCAAAG TTCATCTTTT ACGAAGTGAG AAATAGGTGT TGCATATTCA 5580 

ATGGTCTCAT CTTGTTTATA AAAGCCACCA TCTAAATCAC TAATCCATTG TGGTAAGTCT 5640 

CCAAGTTCGT CTTCCATACG TGTTTTAACA CGTTCGTATC CCATTGCATC CCATAATTGG 5700 

AATGGACCAA GTTTCCAGTT GAACCCCCAG ACAAGCGCAC GGTCTATGTC TCGGAAATCA 5760 

TCGGTAGCTT TAGGTACATT GATAGCAGAG TAATAGAAAT TATTACGTAA TGTCTCCCAT 5820 

AAAAATAGTC CCGCTTCGTC TTGCGCATTG AATATGGTAT CAAGGTTATG CACTAAGTCT 5880 

TTATTAAATT CATTTAAAAT TGGTAATTGT GGTTGCGATA CAGGTACATA ATCTTGTTTT 5940 

TCAACATCGT AAACAAGTCG AGCTTTAGTT TCTTTATCCT TTTTGTAAAA TCCTTGTTTC 6000 

GTTTTACGTC CGAGTGCGCC ATTGTCAAAC AACGTATTTA CAATTTTGAC ATCATGAAAA 6060 

TAAGGTGTTT CTTCAGGTAC TTGTTGCATG CCTTTAATTA CAGACACTGC AATATCTAAA 6120 

CCGACTAGGT CAGATAGCGC ATATGTACCT GTTTTAGGAC GACCAATCGC TTGCCCAGTT 6180 

AAAGCATCCA CATCTACAAT GCTT AT CTTG TGTTGCTCGG CGCGATACAT AATATCATTC 6240 

ATTQTTTGCG TGCCGACTCT ATTTGCGACA AAGCCAGGCA CATCATTGAC GACAATGACA 6300 

CCTTTACCTA ACACATTTTG CGCGAAATTT TTTACATCTA ATATAATAGA TTCCTTCGTG 6360 

TGTGACGTAG GTATTAACTC CACTAATTTC ATAATACGTG GTGGGTTAAA GAAATGTAGA 6420 

CCAAAGAATC GTT CTTGATC CTTCTCGTTA AATGCTTGAG CAATCGCATT AATTGGAATA 6480 

CCTGATGTAT TTGTAGCGAA TAAAG CATCT TCTTTAGCAT GTTGTAGAAC TTGTTGCCAA 6540 

ACAGCATGCT TAATTTCAAT ATCTTCTTTG ACTGCTTCGA TATATAAATC AG CATCAT CA 6600 

TTTACCAAGT CATCATCAAA ATTACCATAT GTTAAATGAC TCGCTAGATT TAAGTCGAAT 6660 

AGTAGCGGCC GTTTCTTATC TGTAATTTTA TCGTAAGATT TTTTCGCAAT GAGATTTGGA 6720 

TCGTTTTTGT CCACTACAAT ATCTAATAGT TTTACTTTAA GTCCAGCATT CACAAAAAGT 6780 
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GTGATTCCTC CAATTTAGTT GAGGATAAGA TAACCATTAA GATAATTGGA ATAACGTTGC 
TATTTTATAA AATTAATTAA GTATCTTTGA CAGTCATCTT AG CCTCTTAT TTAAGGAAAA 
AGCTTTATGC TTAAAATAAG TCTTTTTTAG TGAAATTAAT GCATCTCATA TAATTATTTG 
CTATTTATAC GAAAGCAGAA TCTCCAGTCA AAGCGCGTCC AATTACTAAG GCATTAATTT 
CATGTGTACC TTCGTACGTG TAAATCGCTT CTGCATCAGA GAAGAAACGT GCAATATCAT 
AATCGTCAGC TAGTATG CCA TTACCACCTG TAATACCGCG GCCCATAGCT ACTGTCTCAC 
GCAAACGTAA GGCATTCATC ATCTTCGCCG TTGAAGTTGC AACCTCGTCA TATTCACCAT 
GTGCTTGCAT ATT AG CTAAT TGAGCACATG TTGCCATTGC TTGAGCTAAA TTACCTTGCA 
TCATTGCTAG CTTTTCTTGT ATTAACTGAT ATTTACTAAT TGGTTTGCCG AATTGcTTAC 
GCTCAGTGAC ATAATCTAAT GTGGCACGTA AAGCGCCAGC CATACCACCT GTAG CCATAT 
AAGCAACGCC TGCTCTCGTT GAATAAAGAA TTTTGGCAAT ATCTTTAAAG CTTGTTATGT 
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TTTGTAAGCG ATCCGCTTCA TCTACTTTGA CATTAGTTAA TTTAATTAGG GCGTTAGGAA 
CAATGCGAAG TGCGATTTTA TTATCAATGA CTTCAATATC GACGCCATCT TGTTCTGGTC 
25 TGACTACAAA GCAATGGGGT TTGCCAGTTT CTTTATTTAC TGCGAATACT GGAATGACAT 

CAGATACATG TGCACCACCA ATCCATTTCT TTTCACCATT GATAACCCAA GTATCGCCTT 
GGCGTTCAGC GACTGTTTCA AGACCTCCOG CAACGTCCGA ACCGTGTTCT GGTTCAGTTA 
AAGCAAAGCA TGTACGCAGT TCATGTGACT GTAATTTAGG TACATATTTC GCAATTTGTT 
CTTTGCTACC TCCGAAATAG AAAGTGTTAT GCCCTAAACC TTGGTGAACA CCGAGTAGGG 
TAGCTAAGGA AATATCAAAT CGCGCGAGTA GGTAAGACAT GAAAAACTGA AATAGTTGAC 
TAGGCATTTT GGCGTTTGGA CGATCCTTGT AAAGTAATGG ATTGTTAAAA TAATTTAATT 8040 

CTCfiCAGATC-TTTAAAATAG TCCTCGGGTA CAGTAGCGTC TATCCAATGT TGATTAATAT 8100 

TTTCACGGTA CTTACTTTCT AGCAATGAAT CTACTTGTTG TAAAAATTCG ACTTCACCGT 8160 
CTGTTAAACC TTTAGCAATA CTAAGTACAT CTTCAGGAAA TAATGTTTTT AAGACCGTTT 8220 
CTTTTTCAAA TGTCATATAA ATTCCTCCTA AAAATAATAT GAATACTAAT GTGAAATGCA 8280 
TTTAATTCAA AAACAACACG CTTTATTTGT AAACGCTTAC ACTAAATGTC AAAAATTTTT 8340 
ATCACCTTTA AAGTGTTTGC GAGACTTTGT CATTCATCAT TTGTCGAATC GCAAGTTTAT 8400 
CTGGTTTCTG CGTACTGTTT AACGGCATAT GTGTCACTGG TACATACATT CTTGGGACTT 8460 
so TATAACCTGC TAAACGACT t CGCATATGTT GATTTAAAAT TTCAG CGTAA TGAGGTTCAT 8520 

CTTCGCGAAG T AT AATGG CT GCAGCAATTG ATTCACCATA TTTTGGATGA TCATAGCCAA 8580 
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AGACATTTTC 


GCCACCAGTT 


ATGATTAATT 




rTraATAATA AATATATCGC 


8700 




CATCGTTGTC 


CATCTTCGCT 


7\ 7\^*TV^7A f^f* 7A 

AACj 1. L_A.v_L-A(j 


lTAAiAAAln 


tpgappatga AATGCTTTGG 


8760 


5 


CAGTCTCTGC 


TGGTTTATTC 


LAAlAXv-LiVj 


\jV_Aj I untn X 1 


TTTARCCTTA ATTGCAAGTT 


8820 




CGCCAATCTC 


ACCAGTAGGT 


AC 1 I LL 1 LAL 




AAnRATACRT GCATCAACGA 


8880 




ACATG ACTG C 


1 i l ACCAATA 


v- 1 v_A 1 xTjvtv- 1 


1 ALb 111 IvjrV 


ATTTTCCGGT GTATTAACAA 


8940 


10 


GTAGAAGAGG 


TGCTTCAGTT 


TV A Ti ?\ I* 1 7\ <~* /""• 

AAALCA 1 Abt 


Lul 1 AMI AMI 


nTTTATGCCA TATTGTTTAA 

\J XXX X VJ\— V*^V X *» X X VJ X A * **** 


9000 




AAGCTGCTTG 


GATACTTGGT 


AATGGTTGTG 


AACCACCTTG 


ri ATG AT AT AA TC CAT AG CT C 


9060 


15 


TAAAATTTTC 


m /^i/i m mum * it ifc 1*. 

AGGATTAAAA 


TTACTAGCAC 


GTAGCGTACT 


ATAATAfATT GTCGGAATCA 


9120 


TGATAATAAA 


TGTAGGGTGA 


TATTGTGCAA 


TCATGTCATT 


PAATTCTTCG CCGTTAAAGT 

\ mm f\f \ X X V«. X X V_ Vp-Vj X X X 


9180 




AACGTTGAAG 


AATAAGTGTG 


CCACCTGACA TTAATACTGG 


TAATAPAGTA TCGTTAAACC 


9240 


20 


CTAAAACATG 


GAACATTGGT 


GTTGATACAA 


TCGTAATATA 


r'T'TTTiZi Arm aacttatacg 


9300 




TCAGCTCTAA 


GTTTGCACCG 


TTATGAACAA 


ATGATTCATA 


T'na.r^zvzvPAT'P ACIAPCTTTAG 

X Vj/\Vj^\rtV__r\ X /»-V — /^.> — v- xxx rk\j 


9360 




GTGATCCGGT 


TGTACCACTT 


GTATAAATTA 


ATGCTGCAAG 


TATOTTVITWJT TPAArAGGTG 
/\ x \_ X X vj x vtvj x x ww^nvju x vj 


9420 


25 


TTGCTTGAAA 


AGGTTGGTGA 


TAATCTGGAT 


TTACGATTTC 


TATrTATTAfTOr* nrCAPATCAA 


9480 




TATCCATATG 


CAATAAGTTT 


TGGTCAATAT 


CGGTGAGTGA 


21 PTT 7A 7A ATHT TTTTr AC? PAT 
X X nnn X w x x x x x v^xvj x 


9540 




AG AAGAG CAG 


TTTTAATTGT 


GCATCTTCCA 


CAATGGCTGC 


£-\£\ XXXwXXXX VJV?Vj X X rtrtVJV*^. 


9600 




GCCAATTCAA 


TGGTAAAAAA 


ACCGCACCTG 


TTTTAAAACA 


AGP AAA PA AT AAATPTAATA 


9660 




TTGCAATATC 


ATTTGGCG CA 


AAAATACCGA 


TAACATCGCC 


' 1 TTTTTAACA CCTTGAGATG 


9720 




TTAAATAATG 


TGC CAT ATT A 


TCAGCGCGTG 


CATTGAGTTG 


TTGGTATGTP PAAGATGTTT 
X X VJ>J X x» X VJ X * — Vfiriv/i x vj xxx 


9780 


35 


GTTTTGCGTG 


ATCAATAACG 


GCAGGCTTGT 


CATCATCGAA 


GTPTGAAPGC GT"rT"rTATCC 

Vj X V>> X vjnnvvVo vj x x x x x n x v-v» 


9840 




AATGGAAATT 


CATTAGTATA 


CCCCCTTTAG 


CTTCACTTTC 


ATA CTTTATG AATTG ATTGT 

/\ X nw XXX <V x VJ **** x x vjr\ x x u x 


9900 


40 


TTAAGTTGTC 


CCCArriiTxC 


TTTGTAAATG 


CTGGTATCAA 


TTAATTTTAA ATG AT CAG CA 


9960 


ATAATTGGTT 


TAAAAGCCAT 


TTGATTCAAA 


ATATCTTTAT 


GPAAATPAAG ACCTGGTGCA 


10020 




ATTTCAATTA 


y^rnm^nytfe % y^ y^ y*^ 

GTTTCAAGCC 


TTGATTGGTG 


AGTTCGAATA 


PTGPAPGATP AGTAACAAAA 
v» X Uw\v_uni v_ nvjxnnv- . 


10080 


45 


TAGATTTCTT 


GCTCGAGTGA 


TTGTGAATAT 


TGTGCATTAA 


AGTCGATATG GCTCACATCT 


10140 




GATACAAATT 


TCTGGTTTTG 


TCCTTCAGTT 


TCAATGTTTA 


ATCGTTGATT ATGGCATGAG 


10200 




ACATGACTGC 


CAGCTACAAA 


AGTACCTGAA AAGATAATTT 


TATTTACAGA TTGCGTAATG 


10260 


SO 


TCTATAAAGC 


CACCACATCC 


ATTTAGTCGG 


; TCATTGAAGT AAGACACGTT GACATTGCCG 


10320 




TATTGATCAA CCTCAGCAAA 


k GCTAAGATAG GCAACTGATA CACCATTGTT ATAAATAAAA 


10380 
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CGACTCCCAA CGAATCCACC GAAAATGCCA 
ACACATTCTT CATGCAATAA ATTAGAGAGT 
GTATCGCCAT AAGTTAAAAA CTGAGCAGCA 
GGTAATGCGG GTTCAGGTAT TCCATCAATT 
TGACTTTGAA TTACTTGGCG GTGATTCTTT 
AGATTTCCTG GGATAACAAC TTCATTCGGT 
ACTTGTACAA TAACTTTCCC ATGATTGGCT 
CTCAAGTACG CTTCTTGAGT TAAATAAATG 
AGTAGTGCCA CATCAACGCT AGGGAATGTG 
ACTAATGAAA CTAAATCATC CGTTGTTCGT 
TCAACAGCTG TGTTTAATCC GATTTTAGTA 
CGATAATGAG TTGCAATGAT ACCTTGTGGT 
ATTGcTTGTG CCGTTTTGGA AGAAGCCGTT 
CGACGTTCTA TAAAATCATC TAAATCCGGT 
AATATAAACG TTAAATCATT GGGCGTATGA 
AGAACTTCGG CGGGTAAGTT GGCTACAGCT 
TTAATGATAT GTTGTAAGTC GTGCCATGTG 
TTTGATAAAA TATAGCGTTT TTACACTTTG 
GACGTTTAAA ATCAATTAGA AATATCTTTT 
TTTTGATGGT CGATAACAAT TGTAATATCA 
ATACTTATAT AAAAATATTG. TTCGGAATATL 
ATGAAATACA AAGTGCCCAA TCGAACAAAG 
ATATTAAATT AAAATAGTTT TATTATGAAA 
TAATCATCGA AGAAATACAG AAAAAGAAGA 
TACAGGTGAA GAGGGGATGT CATCAGCAAG 
TAATGAATCT ATCGATAAAC AAATGGAAGC 
TTTAAAAAAC GAAGCAAGAA GTTTATTTGA 
GGGCAATGAT GAAAGCTTAA ATCTTAATTT 
GCATACTAAA AACGAAGCAG ATGAAATATT 



ACATCTAAAA 
TCATTATTGA 
CGTCGGAGAA 
CGTTCTTCTC 
TCATCTTCTG 
TTTAGTTGAT 
TTCGCGTTTA 
TTACCTTGTT 
TAATGTAAGT 
GTATTTACTT 
ATAACTCCAG 
AAAAAATAAG 
AAAATACTCA 
GCGGCACCTA 
TATGTGTCAT 
AATGCTGGTA 
ATTTGTTTCA 
TGTAAACCCT 
TATTCTGATA 
AGGGTTTGTA 
AAAAAGTTAA 
TATTTATATT 
AGTGAAAGTA 
GACAACACAA 
TACACAATCA 
TAAAGCGCAT 
TAATGCAACC 
AAAAGATATG 
TATAGCGGGT 



TCGGTTGCAC 
TGCCATAACC 
TCAATTTGCG 
CAGACAAGGC 
TGACGTATAC 
AGTCGTCAAC 
ATGCGACATG 
GATCTGCGTA 
ATGTTTCATC 
TACCGCCACC 
GTAATAATTG 
CTTCAATGTC 
TAATGACACG 
AACTATGAAT 
GTTGCGCTAA 
AACCAATCAC 
AGCAAGTCAC 
TaCAAGAAAT 
ATAGACACAG 
ATGAATTGAA 
ATAGGTTTTG 
AAAATATGGA 
GGTAAGTCTA 
ACGCAATCaG 
ACTAAGACGT 
GAAACAGCGC 
AAATCAATCG 
CTTTCTGAAG 
ACTGCTAAAA 



ATCATGTTCA 
GATGCTAATT 
ACTATTAAAA 
TGGTAAATAA 
ATAATCGACA 
TAAAGCTTTA 
ATAACACTCG 
TGTTCCTCTC 
GTTGATGGTT 
GTATCTAGGA 
ATTACTCTGA 
ATTATTTTTC 
TTTAATCATG 
ATCATTCGCT 
CACAGCACGT 
ATCACCATCT 
CTCCATCACA 
ATAACATAAC 
TATAGACACA 
TATCATTAAA 
ATTTTTAAAT— 
AAATCCATCA 
TGGAAGGTCT 
TTGCACCTAA 
CCGACATACA 
AAAATACAGA 
GTAGACTAGC 
TATTTAAGCC 
CTACGCCAGC 



10500 
10560 
10620 
10680 
10740 
10S00 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
-11700 - 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
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TTTCACAGTA 


ACATTTATTG 


GATTATGGGT 


CATGGCAGCA 


ATTTTTAATA 


ACACTAACGC 


12300 




GATTCCGGGT 


CTCATTTTTA 


TAGGGGCTTT 


AACAGTACCA 


TTATCGGGTT 


TGTTCTTCTT 


12360 


5 


TTATGAATCA 


AATG CGTTT A 


AAAATATTAG 


CATTTTTGAA 


GTTATTATCA 


TGTTCTTTAT 


12420 




TGGCGGCGTA 


TTTTCATTAC 


TAAGTACGAT 


GGTATTATAT 


AGATTTGTCG 


TTTTTAGTGA 


12480 




TCAATTCGAA 


AGGTTTGGTT 


CTTTAACATT 


TTTCGATGCA 


TTTTTAGTAG 


GATTAGTTGA 


12540 


10 


AGAAACTGGA 


AAAGCACTCA 


TTATTGTTTA 


TTTCGTCAAT 


AAATTGAAAA 


CAAATAAGAT 


12600 




TTTGAATGGA 


TTATTAATCG 


GTGCTGCTAT 


TGGTGCAGGG 


TTCGCAGTTT 


TTGAATCAGC 


12660 


15 


AGGTTATATT 


TTGAATTTCG 


CTTTAGGAGA 


AAATGTCCCA 


TTATTAGATA 


TTGT CTTCAC 


12720 


ACGTGCGTGG 


ACTGCGATTG 


GTGGTCATTT 


AGTTTGGTCA 


kCGATTGTTG 


GTGCTG CAAT 


12780 




AGTTATTGCG 


AAAGAACAGC 


ATGGCTTTGA 


ATTCAAAGAT 


ATTTTTGATA 


AACGCTTTTT 


12840 


20 


AATATTCTTT 


TTATCAGC CG 


TTGTTTTACA 


TGGCATTTGG 


GATACATCTT 


TAACTGTACT 


12900 


TGGCAGTGAT 


ACGTTGAAAA 


TATTTATTTT 


AATCGTTATT 


GTGTGGATAC 


TTGTATTCaT 


12960 




TTTAATGGGG 


GCAGGTTTAA 


AACAAGTGAA 


TTTACTGCAG 


AAAGAATTTA 


AAGAACAACA 


13020 


25 


GAAAAAAGTA 


GACGAATAAT 


AATTAAAGCT 


TATGTTGCTC 
ATAGGAATAC 


ATATGTTTGT 


GACATAAGCT 
TAAAAAAGTG 


13080 
13140 




atttttataa 


TTTGTCTTTA 


AAAGAGTGGA 


TTITTGGAGT 




TTt CACGTTA 


AACAAATAGT 


GACAATTAGA 


TTTATATAAA 


ATGAACATGA 


TTCACTGAAA 


13200 


30 


GTATGTAATA 


ATCATTTTAT 


TGAAATTCAT 


CAAACAGAAA 


TTAATACAAT 


CAT AT AAG CA 


13260 




AATTAAACCA 


CGCCATAATC 


ATATTGGATG 


ACTTCGGCGT 


GGTTTTTATA 


GTTGAAGCAG 


13320 




GGCTGAGACA 


TAAATCAATG 


TCCCACACTC 


CCTTATCGTT 


CAATCGTTGT 


TCGATAATCG 


13380 


35 


ATTAAATAGA 


TACCTTCAGG 


TGTTACTTTA 


TAATTTTTAA 


CCTTAGAGTT 


AGCAGCGACT 


13440 




ATTEGATCGT 


TGTAAGCAAT 


ATAACTGTTT 


GGTACATCTC 


GACTTGATAA 


TTTAATAATA 


13500 




TCAtTAGAAA 


TATTGTGACG 


TTCCTTAACA 


TCTACAGTAT 


GATTCAATTG 


ATTAATTAAA 


13560 


40 


TCATCGACGT 


TGCTATTATT 


GTAGTCTCCT 


TTATTAATAG 


CACCATCTTT 


TTTATATGCT 


13620 




TGATTAAAGA AATAACCTGT 


ATCTCCACGA 


GGAATTGTTC 


CGAAACTATA 


CATCGTTGCA 


13680 


45 


TCCCATGCAG 


AACGGTCTTT 


TAAGTAACCT 


TCTATGTCAT 


CAACACTTTT 


AATGTCGATT 


13740 


TCAATATTTG 


cttttttagc 


ATCTGATTGT 


AATACTTGCG 


CAATTTTCGA 


TAGCTCTGGA 


1 "> Q f\ f\ 

lioUU 




CGACCGTCAT 


ACGTAATTAA 


CTTAATTTTT 


AAAGGGTGTT 


CTTTTGTATA 


ACCATCTTTA 


13860 


50 


GCTAATAACA 


Tl-riTGCTTG 


TTCGATATTT 


TGTTTGGTTA 


ACTTAGGTTC 


TTTAATATAT 


13920 




GGAATTTTAT 


CATTAAATGG 


ACTCGTTGCA 


GGTTTCGCAT 


AACCTTGATA 


AATATGATCT 


13980 
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TTAT t AGTAT GATTATACAT AAGTaAGAAG TTCTAAAn 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



15 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1626 base pairs 

(B) TYPE : nucleic acid 
.35 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



40 



SO 



120 
180 
240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
TGAAAACTAA AGTGTTTCTA ATGCGTGACT AAAATTAGTA ATAATTAAGT TCTCATGATA 60 
ATAGGTATTT TTGAAAAATG GAGGAGTCTA TAAATGGGTA AAAAAATGGG TCTAGGTTTA 
TCTATTGCAT TGGTTGTTAT TGGTATTGCC GTTGTATGTT TAATGATTTT TTCTAGTCAA 
20 AAAACGACTT ATTTTGGTTA TATGAATAGT AATACAAATG CAGAAAAAGT TGTCAGTGAA 

AAAGATGGAT TAGTCAAACA TAATATCAAA GTAGAAC CAT CTAATGATTT CAAGCCGAAA 300 
AAAGGAGACT TTGTAAAATT AGTTTCTAAA GATGATGGGA AGACATTTTA TAAACAAGAG 360 
ATTGTTAAAC ATGATGACGT CCCACACGGT TTAATGATGA AAATTCACGA CATG CAT ATG 420 
AATTAATAAA AAAGCATCTA TAACGTAATT TTGAAGAAGT AGAGTTATCT TCTTATGCGT 4 80 

TTTAGA 

<2) INFORMATION FOR SEQ ID NO: 193: 



486 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

GAGGTCTATA TACAATTATG GTTGTTCCAG TTAAACGAAC TGATGGCTTT ATTACTAAGT 60 

TTAATAGATT AATTGAAAGA CGATTATTAC GTCATTTCAG TAAAAAAGGT TATATCACAT 120 

45 GGGAGGAAAA TTGATTGTCT GACATTTTAA AATGTATCGG TTGTGGTGCG CCACTTCAAT 180 

CTGAAGATAA AAATAAACCT GGTTTTGTAC CAGAGCATAA TATGTTTCGT GATGACGTGA 240 

TTTGCAGACG TTGTTTCCGC TTGAAAAATT 'ATAACGAATT CAAGATGTAG GATTAGAAAG 300 

TGAAGACTTT TTAAAATTAT TATCAGGACT TGCGGATAAA AAGGGTATTG TCGTCAATGT 360 

CGTGGATGTA TTTGACTTTG AAGGATCATT TATTAATGCA GTTAAACGTA TTGTCGGAAA 420 
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TCG AG 1 I AAA 


GAATGGTTAA 


ft ft /*"*/""• 7\ TV /"» TV 

AACGAACAGC 


AAGAAAATAT 


GGTTTGGAAG 


CTGACGATGT 


j •* \j 




CGTATTAATT 


TCAGCTGAAA 


AAGGCTGGOO 


CATAGACGAC 


TT ATT AT CAT 


CAATTGCGAA 


600 


5 


TATTCGAGAA 


AATGAAGATG 


TGTATA1 IAj I 


AGGGACAACG AATGTTGGGA AATCTACATT 


660 




GATTAATAAA 


CTGATTGAAG 


CTAGTGTTGG 


TGAAAAAGAT 


GTAGTAACAA 


CTTCAAGATT 


720 




CCCTGGAACA 


ACTTTAGATA 


TGATAGATAT 


TCCTTTAGAT 


GAAACATCAT 


TTATGTATGA 


780 


10 


TACACCAGGT 


ATTATTCAAG 


» «-r%/-i jl TV J\ TV T* 

ATCACCAAA 1 


GACGCATTTA 


GTTAGTGAAA 


AAGAATTGAA 


84 0 




AATTATTATG 


C CT AAG AAAG 


7\ » 7\ rp J\ TV TA IV /"•TV 

AAATAAAACA 


ACGCGTATAT 


CAATTAAATG 


AGGCGCAGAC 


900 


15 


ATTATTCTTC 


GGCGGTCTAG 


CG CG CAT AG A 


TTATGTATCA 


GGTGGTAAAC 


GTCCGTTAGT 


960 


TTGTTTCTTT 


TCTAATGACT 


MV* TV TV T* TV TV 7\ 

TGAATATACA 


TCGTACTAAm 


ACGGAGAAGG 


CTAATGATTT 


1020 




ATGGCGTAAT 


CAACTTGGCG 


ATTTATTAAC 


GCCACCTGGA 


AATCCACAAA 


ATTTTGATCT 


1080 


20 


TAATGAGGTA 


AAGGCTGTTA 


GACTTGAAAC 


AGGCAAAGAG 


AAACGCGATG 


TTATGATCTC 


1140 




TGGTCTAGGC 


TTTATAACTA 


ft\ ft ft ^^^^ 

TAGGACCAGG 


GGCTAAAGTA ATCGTTCGTG 


TTCCTAAAAA 


1200 




TGTTGAkGTT 


GTATTAAGAA 


ATTCTATTTT 


ATAAGGTGaT 


TAAAAAAATG 


AAATTTGcAG 


1260 


25 


TTATAGGAAA 


TCCTATTTCA 


CATTCCTTGT 


CGCCCGTTAT 


GCATAGAGCA 


TV TA TTTTl A * 1 1* 


13 20 




CTTTAGGATT 


AGATGATACT 


TATGAAGCTT 


TAAATATTnC 


CAATTGAAGA 


TTTTCATTTA 


13 80 




ATTAAAGAAA 


TTATTTCGAA 


AAAAGAATTa 


GAAGGCTTTA 


ATATCACAAT 


TCCTCATAAA 


l d a n 


30 


GAACGTATCA 


TACCGTATTT 


AGATTATGTT 


GATGAACAAG 


CGATTAATGC 


AGGTGCAGTT 


1500 




AACACTGTTT 


t G AT AAAAG A 


TGG CAAGTGG 


ATAGGGTATA 


ATACAGATGG 


TATTGGTTAT 


1560 




GTTAAAGGAT 


TGCACAGCGT 


TTAnCCAGAT 


TTAGAAAATG 


CATACATTTT 


AATTTTGGGC 


1620 


35 


GCAGGT 












1626 



(2) Information for seq id no: 194 : 
(i) sequence characteristics: 

40 (A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
AGGGTTAATT GTCGGTTTAA TTGCAATGAA TAAGTTCCAT GTATT AG CTG GCTATAGAGC 
SO GAAATTCATC TTAATGGTGA TTTTAACTAT GATGGTCTTC GTACTTATTA ATACGTATTT 

ACTAAGACAG GTAAAATCTA TCGGTATGTT CTTAATGATT GCTGCATTGG GTCTATACTT 



896 
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s 



GTCTTATATC GATAACATGT 


*^"**"» X A TTTAAATGCA 


GAGCATCCTA 


TAGGCTTGGT 


300 


GCTAGTAATA TTAACAGTAC 


TTGTGATTAT TGGCTTTGTA 


CTGAACATGT 


TTATAAAACA 


360 


CTTTAAGAAA GAGAGATTAA 


TCTAATGTTG ATGAATAGCG 


TGATTGCTTT 


AACTTTTTTA 


420 


ACAGCATCTA GCAATAATGG 


CGGACTTAAT ATTGATGTGC 


AACAAGAAGA 


GGAAAAGCGA 


480 


ATCAATAATG ATTTAAATCA 


ATATGATACA ACGCTATTTA ATAAAGACAG 


CAAAGCGGTT 


540 


AATGATGCGA TTGCTAAGCA 


GAAAAAAGAA CGACAACAAC 


AAATAAAAAA 


TGATATGTTT 


600 


CAAAATCAAG CGAGTCACTC 


GACTCGCTTG AATGA 






635 


(2) INFORMATION FOR SEQ ID NO: 195: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195; 



25 


CTGAAATGGG 


TATTATTTGT 


CTTCTTCATC ATAAAGTAAT AAAGATTGTT 


CATCATTGCG 


60 




ACGTTGCCAA 


TTTTCATTTG 


GCGATCTTGG AACGTATAAT AATTGCCTAG 


TAGCATACGC 


120 




TTTGATTGAA 


ACATACAAGT 


CATTCCTTGA ACTTGCACCT CAATTTCCAC 


ATTTGAATTT 


180 


30 


TCTTTTATAT 


TAATAATTTC 


ATCCAAATTC AGCTCACGTG CTAAGACAGC 


TCTTGATGCG 


240 




CCTCTTTTAC 


CCCAGTAATT 


ACATTGAAAA TGATTAGTTA CTAACGTCTC 


TGCATTCCAA 


300 


35 


TGAAGTGGTA 


TTGGATTTTC 


TTGCGCCTTC ACATACATTA CTACTGCTGG 


ATCCCCGAAA 


360 


ATAATTCTGT 


CAACTCGTAT 


TTCATGTAAA AAATTAATAT AATCTTCTAC AGCATCTAAA 


420 




tgatSaattat 


GAAATAATCC 


ATTCACTGCC GCATATACTT TTTTATCGTT- 


TTTGTGAGCT 


4 80 


40 


AATGCGACAG 


CCTCTGTCAT 


TTGTTGTCTA TTGAATTCCC CTGGAAGTCT 


TAAACCAAAC 


540 




TTTTGCTCGC 


CAATTACAAA AGCATCTGCA CCTAAATCAA TAAGTGTTTC 


CATATGGCTT 


600 




AATGACTTGG 


GTGTGACAAG 


TAATTCTGTC AT AG TCATTC TCCTTTAATT 


GAAAT CGCTA 


660 


45 


ATCCATCGTC 


TATATTTAAA 


AAATTCGTTG TATATCCTGG TTGCTTTATT 


AACCACTCAT 


720 




TATAATCTTG 


AACCTTTTTA 


ACCATTTGTC TTACATTTCT CGATCTAACA 


ATCCCAATAT 


780 




CCGATACAAA 


ACCGTGATAT 


AAAACATTAT CTGTAATTAC GAGACCTTGG 


TGCTTTAAAA 


840 


50 


GTGGTGTATA 


TATTTCAAAA 


AATTTCTTTG ATTGCGCTTT TGCTGCATCA 


ATAAATATCA 


900 




TATCATAAAC 


TTTGTCATTT 


ACATTTTCAA ATTG CTCTAA AGCATTACCT 


TCAATAATTC 


960 
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CATTACGCTC TATCGTTGTG ACATGAATGT CATCAGATAT AGAAGCGAAT TGCATAGAAC 1080 

TATAGCCGAT TGCTGTACCA ATTTCTAAAA TATTTTTAAC ATTATTCATA CGAATTAATT 1140 

GCTTAATTAA ATCTAATGTT AAACGATCTA CAATTGGCAC TTCATTTACC TCGGCAAATT 1200 

CACGCAAAAC TTCGATTGAA CTATTTTGAT GTTGATGTAA ATCTATTAAA TATTTTTTAT 1260 

TTAGGTCATC CATGTTTTAA ACTTCCTTTA TGTAAAATAA GTCAATATGA TTATGACAAT 1320 

AAAATAAATC AGCCTTCACA ATTGATTATA ATTTTGCCAA CCAATTAAAT GACTGATTTC 1380 

GTGTTAGACG CAAAGCTATT TTATTTATAG AAGCGAATCA TTCATATAAA ATTTAACTTT 14 40 

AGATATTTTA CCATATTTTC AATAAAATTA TAAGCGTTAA TTATTTATAC ATTGCTTGAC 1500 

TTAAAAAATA CTCTTGCCTC CCCATCTTTA AGGTTAGCAA GAGTAAAATC TTTTTAATTA 15S0 

TTCTTCCATT TCAGTATTTA CAACTTCTTC AATCATGTCC CATTCTTCAT CAGTTTCGAT 1620 

TGGTACTAAC TTACCACCGT CACCTGACTC ATCTGGTTCA TTGATCATTG GTACAAG CTC 1680 

AATCATATCG TCTTCATCTG ATTGAGCACC TTCTTCAGCT AAGATAACAT ACTCTTTTTT 1740 

GAATTCAGGA TGATAAAATT CTAAAACTTT TCGGTATAAA ACTTCATTTC CCTCTTCATC 1800 

GAATAAAGTT AATAATTCTT CTTCGTTATT AATTTCTAGT TGTGAATCAT GATTATGTTC 1860 

AGTCATAGTA AAATCTCCTT TTAATGTAGT GAATCTAAAT AGCCTTGTAA AATAAATACC 1920 

GCTGCCATTT TATCAATCAC TTGTTTTCTT TTTTGTCTTG AAACATCTGC TTCTAATAAT 1980 

GATCGTTCAG CAGC GATTGT GCTTAATCTT TCATCCCACA TCACAATCTC AATAGAAGGA 2040 

TAAGCTTCTA ATAATTTTTC TTTATATGTT AACGAAGCTT CGCCTCGAAA TCCTATTGAA 2100 

TTATTCATGT TTTTAGGTAG TCCTATTACG ACTGTACCCA CATTATGTTT TTTAATAATG 2160 

35 TCTACTAATT GGTCAATACC TAATTCATTA TTTTCTTCAT TGATTCGGAG TGTGTCTAAT 2220 

CCTTSTGCCG TCCAACCCAT TATATCACTA ATTGCAATTC CTACCGTTCT ACTACCGACA 2280 

TCGAGTCCTA AAATTTTATG TTGTAACATA AATTATTTAT TTTGCTCTTT TAAATAGTAA 2340 

40 GAAACAAGCT CTTC CAT AAT AACATCTCTA TCAATATGAC GAATTTGATT TCTTGCTTCA 2400 

TTTTGGCGTG GAATATACGC AGGGTCACCT GATAATAAAT AACCTACAAT TTGGTTTACG 2460 
GCATTATATC CTCGTTCATC TAATGTTCGA TAAACATTAT TTAAAACATC TCTTACATCT 2520 
TGCGTTGGAA GTTCTTCATA GTCGAATTTC ATTGTTTTAT CAAAGTTTTC CATTTGCGAC 2580 
ACTCCTTTAA TTACAAATAT AACTCACTAT CATCATACAA TATTATGGCT TTAAATTATA 2640 
GATTTTTAAT GTAATCTTTA ATAAAGCTTA ATGATTTTGA GATATTTTCA GGTTGTGTAC 2700 
CGCCACCTTG AGCCATATCT GGACGACCGC CACCTTTACC ACCAACGATT GGTGCCATTT 2760 



25 



30 



45 



SO 
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TCGATACTTT ATCATCAACA TTACTTGCAA GAATGATAAT TGTATCTTGT AGTTTAGATT 
TAAAATCGTC CATTGTCGAG CGAATTGCTT TCG CATTTGG TACATCCACT TCAGTAACCA 
ATACTTTATA GCCATTGATT TCTTCAACTT GATCTTCAAT ATTACCCATT TTAAGTGATG 
TGATTT CTTT GTCACGTTGC TCTAATTGTT TTAATAATGC TTTTTCTTCA TCTTGTAATT 
GTGTTAACTT ATCGACTACT TGATCATCAG ATTTCACTTT CAGCTGTGAT TTCATCGTAT 
TAAATTTCTC TTGAATATCT TCTAAATATA AGAAAGCTGC TTTACCTGTT AATGCTTCAA 
TACGACGCAC ACCAGCTCCT GTACCTGACT CACTTACTAT TTTGAATAAG CCAATTTCAG 
AAGTATTGCG GACATGAATA CCACCACATA ATTCAATTGA AAATGGTGCC ATATTTACTA 
CACGCACAAC ATCACCATAT TTTTCACCGA ATAATG C CAT TGCGCCCATT TCTTTAG cTG 
AAGCAATATC CATTTCTTGA ATGTTAACGT CAATACCTTT CCAAATTTCT TCATTTACTA 
^ AGCGTTCAAC TTGATCAATT TCATCATTAG TCATTGGACC AAAATGAGAG AAATCAAAAC 

GTAAACGATC TGCTTCTACT AGTGAACCAG CTTGGTTAAC ATGATCACCC AGTACTGATT 
TCAACGCTGC ATGTAATAAA TGTGTTGCAC TATGGTTCTT TTGAATGTCA CGTCGATCAT 
25 TTTGGTTCAC TTCAG CAGAC ACTGTAGCGC CAACATTTAC TTGGCCAAAT TGTACTACTC 

CTTTATGCAA GTTTTGACCA TTTGGTGCTT TGGTTACTTC ACTAACAGCA ATTT CAAAAT 
TGTCATTATA AACAATACCT GTATCCGCAA CTTGTCCACC ACTGATTGCA TAAAATGGTG 
30 TTTCCGTTAA CATGAAGTAT ACTGTTTCAC CCGCTTCAAC TTGTGAAACT TCTTCACCAT 

TGTATATCAA GTGTGTTAGT GTTGTTTGAG ctGTCGCAGT ATCATAACCA ACAAAAGTAC 
TTGCAGATGT AATATTTTTC AATACTTCAC TTTGAACTTG CATTGATTGA GAATTTTGAC 
GTGCTTGACG TGCACGATCA CGTTGTTGTT GCATTTCTGA CTCGAATGTT GTCAT AT CAA 
f^^T" TGCTTGCACT _GCTATTTCTT_ CAGTTAATTC -AATTGGGAAC-CCATAGGTAT- - 
CATACAATTT AAATGCATCT TTCCCATTAA TTTCATTTGT TGTCGCTTTA GCTTTTTTAA 
TTAATTCATT TAAAATCGCT AAACCATCTT CTAATGTTTC ATGGAATCGT TCTTCTTCAG 
ACTTTATAAC ACGCTTAATG AAATCTGCTT TTTCCTTAAC ATTTGGATAA TATGGTTCCA 
45 TAATGTCTGC AACAATATCA ACAAGTTTGT ACATAAATGG CTCATTGATT CCTAACGTTT 

GACTAAAACG AACGGCACGA CGTAACAATC GACGTAATAC ATACCCTCTA CCTTCATTGG 
CAGGTAATGC ACCATCAGAA ATTGCAAATG CAATCGTACG AATGTGGTCA GCAATTACTT 
so TAAATGCCAC ATCTTGTTCG TTGTTTACTA AATATTGTTT ACCTGATACT TTTTCGATTT 

CATTCATTAT AGG CAT AAAT AAATCTGTTT CATAGTTAGT ACGTACATTT TGAGAAACTG 



35 



40 
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TATGATCTTT 


ATTATGATTG 


AATTCACTAA 


ATACTAAGTT 


C CAT ACTTCA 


Aun 1 AVjVAj 1 1 


4680 




CATTTTCTCC 


ACCTGGATAC 


ATTTCTTCTG 


CCGGATCGTC 


Tib lA-CA 1 A 1 




474 0 


5 


GATCATAGAA 


AATCTCAGTG 


TTCGGTCCTG 


AAGGCCCTTC 


ACCAAT ATC C 


vJAv^AALil 


4 800 




CTTCAATGCG 


AATAATACGA 


CTTTCTTCAA 


GCCCAATATC 


11 1 A rGC LAA 


A T^TT^Tl 

A lu i io Irilvj 


4 860 




CTTCCATATC 


TTCCGGATGA 


ATCGTAACGT 


ACAATTTATC 


TGG C1CCATA 


LLLAlLUil 1 


4 92 0 


10 


TATCACTCGT 


TAAAAATTCC 


CAAGCAAATT 


CAATCGCTTC 


TTGTTTAAAA 


1 AA 1 LAL LHA 


4 9oO 




TTGAGAAGTT 


ACCTAACATT 


TCAAAGAATG 


TATGGTGACG 


CG CTGTG AAA 


LLAALAl 111 


c a /i a 


15 


CAATATCATT 


TGTACGAATA 


GCJT1TTGAG 


AGTTTACAAT 


TCTTGGCTTT 


11 AGO 1\j ilt 


c n a A 


CACGTCCATC 


AAAATATTTC 


TTTAATGTTG 


CTACACCTGA 


ATT AAT C CAT 


7V A T*A A TV""!"* A T 1 

AATAATGT A 1 


c i c a 
blot) 




CATCATCAAT 


TGGCACTAAT 


GGTGCAGAAG 


GTTCAACCAT 


ATGTCcttxt 


•Y*/"^ 7\ » ^» A A A ^ A 

TCAACAAACjA 




20 


AATCTAGATA 


TTTTTGTCTA 


ATTTCACTCG 


CTTTTAACTT 


TTTCATCATT 


«T> n /-» TV /"A , "P /"""/"*• I* 

TACACATll 1 


C O O A 


ATTTACTGTT 


TTTAAATTAC 


CATTC CAT AA 


AAATTGATGA 


CACAGATAGT 


CGATTTG caa 


534 0 




AACTAGTATA 


AATCAATATC 


aitttttatt 


ATTAAAAAAT 


AAAAAACGCC 


CATCCTCAAA 


5400 


25 


AGGGACGAAC 


GTTATCGCGG 


TACCACCCTA 


GTTATAAATG 


CAATTCAACA 


CATTTATCAC 


5460 




TTTAATTCGA 


CTATACAGTT 


GTGCATAAAG 


TAGCGTTCAC 


taatgtttgt 


tGTACTTTTC 






ACCAACCAGT 


ACATCTCTGA 


TAAACAAATC 


aTTAACTACT 


CATCTTTATA 


CGAATTTAAT 


5580 


30 


TCTATTTTAG 


TTACATTTAC 


GCTTGTTGTC 


AACGTTCTAT 


AAAGTCATAC 


GGCGTGATTT 


bb4U 




CTCCCATATT 


AATCATTGGG 


TCAATTTTAA 


ACATTGTAGC 


ttccgttaat 


ACATTTGTAT 


C 1 A A 

b /uu 




CTGTTTTTGT 


TGAATCAGAC 


ATAACTTCTT 


CACTATCATT 


CGATGACATT 


VjVjCVjC 1 ILIA 


j / o u 


35 


CTTGATCATC 


TATTGTCGTT 


TGTGAAGCTC 


CTGTATCATT AgTTG CTGTG 


ill 1 LLAyLA 


cpon 




tttcStcatc TTCTGAATTA aaataatttt 


TCAACAATGT 


ACATAATTGT 


G 1 1 AAALAjCvj 


cQon 
boou 




CTTGACCATT 


TGTTTTCAAT 


CCAATATCAA 


ATGCTTCCGG 


ATCACCAAGT 


JV TV IV TV /T TV TV TV f+ 

AAAAL 1AAAL 


J 31 V 


40 


TCGTTTTCGC 


TCTAGTTAAA 


CCAGTATATA 


ATATCGGTCT 


TTGTAACATT 


r^rp » jv TV TV *T» JV fT 

C 1 AAAA1 AC 1 






GTTTAACAAT 


AGGCATGATA 


ACAATAGGAA 


ATTCTGAACC 


TTGTGATTTA 


TGVjAI 1 vjAIO 


O \J O V 


45 


TACAATAAGC 


ATGTGTTAAT 


TCCATCATAT 


CTTGTTTCGT 


AAATGTAATT 


T CATTAC CTT 


OX^v 


CAAAATCCAC 


AACAAGTACA 


TCTTTATTAA 


GGGCATTTTC 


TTTCGCCCAA 


AAAATACCAA 


6180 




CAATAACTCC 


TATGTCACCA 


TTGAATATGT 


TATCATTTGG 


CCTATTAACA 


agttgtaata 


6240 


SO 


CTTTGTCACC 


TTTTCTAAAG 


ACTACATCAC 


CAAACTCAAT 


TTCTCGTGTG 


TCTTTCTTTT 


6300 




TAGGGTTTAA 


AATATCTTGT 


AAAACTTGAT 


TTAAACGTTT 


AATACCGGCA 


TTTCCTTTAT 


6360 
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CTACCTTCTC 
GAAAACGCTG 
TGCTTGAACC 
ATTCAATTAA 
CTACAAATAT 
TATCTACCAT 
TATCCTCTGG 
ATCCAGTTGA 
GATAATCATC 
ATTCAACAAT 
TCACCTTAGA 
GATTCGCATC 
GTTTATTTGT 
ACAAACTTGG 
ATTGCAACAT 
CATCAACCGT 
ATTCTTCTTC 
TACCAATATT 
TCAATTGATA 
TTGCCATTGA 
-ATTdTGATT- 
AAACTTTTTC 
TATTAACAAT 
AACTTATAAT 
GTTTACCATA 
CAATATTCGG 
TAGTATCTAC 
CAGTACCTTT 
TTCATTTTAG 



AACAACTGTT 
TGTAATATCA 
ATCTTGTTGA 
ATCTTTAAAT 
CAATTGTGCA 
AGACATCTCA 
CTTTGTGTCT 
CTCTTGTAGT 
ATTGACATAA 
ACCTTTAATT 
ATTGATAGCC 
TTCTATTTCA 
ATGCGTCTTT 
AATTGCAACT 
TTCGTCTAAT 
TAAATCTATA 
TAACGTATAG 
TCTAGCAAGT 
TGGATTTCGA 
AAGTTTAGGA 
TGCTGAAATT- 
TAGCACTGAA 
ATTTTGAGCT 
TGCTTCTTTC 
ACGTGGATGA 
AAGAAACCCT 
TTTAAGCACT 
GATCATTGAA 
TGAATGTTTT 



GGTATTTGGT 
ATTTTCTGAC 
CGATATACTT 
ACTTGACCAG 
TCTAAAGGAA 
TCAATGATTA 
TGATTCCAAC 
CTCTTAGACG 
TCATCATAAT 
ACTGTCGTTT 
GTTTGCAAAG 
CCAATATGCA 
ATTCTGAATA 
TGTTCaTTGT 
TTTTCAGGTT 
ACAACATTGA 
AGCAACGCAG 
TGATCTGCTT 
TCTAAAATAG 
CCAAACCCTA 
TGTTCTGCAA 
TGGTCATCTA 
GTCTTTTTAC 
GTTTGTGGCA 
TCAACAACTT 
ACAACAGTTG 
GTATAAAAAT 
TAATCAAATA 
CAGCGCATGC 



TTGCCTGACA 
CCAACTTCAT 
CAGTCAGATT 
GACCTACAGA 
CTGCACTTAA 
TGAGTCGTGC 
CTATTAAACG 
CTCTTCCTGT 
CTAATGATAA 
TTCCTGTTCC 
CTTCTTTTTG 
TTTGTAAATC 
AGTTTTGAAC 
CAATAATTAG 
CGATGACCTC 
TAGGCAAATA 
CTTTTAATCG 
TATTAAAACC 
TCAGTGTATC 
AATCATGTAA 
TTTGTTTCTG 
ATATATCATT 
CTACACCTTT 
TTTCTTTTTC 
GCCCTTTAAA 
GCATTGTATC 
TATCACTGTT 
GTGTAGGGTC 
TGACTTAATA 



GTTAATAAAA 
TCGATGTGCT 
TACTCGTGGT 
AGGCAATTGG 
AAATTGGTGG 
GTTTATTTCA 
ATGAATCGTC 
TGGCG CTGCT 
GCCATGAATC 
CGGTCCACCG 
TGAAGCTGCA 
TGACTGTTCA 
ACTTTTGATT 
TCGTTTTTCC 
TTCATCTTGa 
TGTATGTCCC 
TTCATTATCG 
AATACCTTTA 
GCCGAGATAA 
ACGAATCATT 
TTTCTTTTTA 
TATCGCATTG 
AAACAAATCA 
AAAAGTCTCT 
TGTGTAGACA 
AAAGTCTTCA 
TTGAAACAAT 
TGACATGTTA 
AGTGTTTAGG 



CTTCTATCAT 
AATTCTATAA 
ATAGCTTTCG 
TCCTCATCAC 
AACAACCAAG 
TTTTCTAATA 
ATTGCTTCTA 
AATACAACTG 
TCAGCATATA 
GTTAATAGCA 
TAGTTCACTT 
ATTTCTGTAA 
TCaGAATAAT 
TCATTTAAGT 
TAATTTAATA 
TGTTTAATAC 
TTATAAGCGA 
ATATCATAAA 
AACTGATAAA 
ATTTTTTCAG 
" GATAATCCCG 
TCACCTAATG 
CTAGATAAAT 
GCTTTTAATT 
TCGCCTTCAA 
TTTGTTTCAA 
ATCGCTTCAA 
CTCCTCCTCT 
GTCGATAGTC 



6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
'7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
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AAGCCCAAAT 

TGAGTTATGG 

ACTTCTGCAT 

CTTAAATGAT 

AATCTCTGTT 

ATAATATACA 

TTTTTGGAAA 

ATTAATATAA 

CGCTTCTTCT 

CGAACCTCAT 

CAACGAATAA 

AATGCTTAAG 

TTATGTACTT 

GATATTTTAA 

CGAAATTCTC 

GTCCCAGACT 

GTTGAAACTT 

CCTTCATTTT 

AAAACAACTG 

CAtGGtCGTT 

CTGtACATTC 

AAGCAATTAA 

CATTATCTTT 

ATCCATGTCT 

CATCAAGTGT 

AGTTTTTTTC 

CTTGTTCTTC 

CTTTTGAAAG 

CTACACCACG 



TGTATCTTGC 
CTTCATTAAA 
CTTTGTCTTT 
TCTAACTGAT 
TGTAATGCTT 
TTTGCTAGAT 
AAACGCTCTG 
TTTTCAATAA 
ATTTTTCCAT 
TTCTCATCAA 
AATATAATCT 
AACCATTAAC 
AAAAATAAGA 
CAAAATTAAA 
TATGTTGTGG 
CTATCCTCAT 
CAAATAAGTA 
TGAACACATC 
CTTGTCCAGG 
TTCACGTTTC 
AAAACCTTGA 
GTAATCACTG 
TAGGTTTTTA 
TTGTCCTATT 
TATCATATCA 
GCCGATAAAA 
AGCAATTCGA 
TTGTTGTTGA 
TAACATTTCA 



ATCAACATGA 
CATTTCTAAT 
ATCTAGTTCC 
TAAACGCCAT 
GTTGATATAA 
TAAAAAATAC 
CCTTTyCAAy 
TTGTAGGATT 
TTTGTATGTA 
TTATAACATC 
AATATCATCT 
GGTTTATTAT 
ATACATAATT 
TTCGTTATCC 
TCCCGCCAAT 
GAATTATTCT 
ATTTATTGTT 
GTCAATTGTT 
TGTGATTGCT 
ACAAAAACTT 
TCTAAGTCAT 
TATAATGCAT 
CCGACAACAA 
GTGTAATACA 
CCCGGTTGTG 
CAAATGCCTG 
CGCACTTCAC 
GATAATTGAT 
ACATGACCAT 



TTTTTATCAA TCGTTAATAC 
TGACATAATA CAAGACCATA 
GCAGCAGTCA TTAAATACGG 
ACCGATCATA TAATTACAAT 
TTTAATAGCT TCTTGATAAC 
GACGCCATTC TTCGGATCTA 
CyCATTCgCA TCAGCAAGTA 
TTCTTCGATA TTTCCGAACA 
TTGATAAATT GTTTGTTGAT 
TTGATAAATT GTATGTCTCG 
TCATTCATGA AAAAGCGGGA 
GTAATGGTTC TTCCACATTA 
AGATTCATGC ATAGGGAGTG 
CCAACTGGCA TTGCCTGTAG 
ATAACATTGT AGAGCCTAGG 
CATCAAAAAC TGTCTTTCGT 
GCCATTGTTT ATACAACATA 
GCACCACCAA GACACACATC 
CTTACTGGCT CAGCAAAAGT 
TCGTATCTTT TTGGCGATAT 
TATCTTCTGG ATTTACAAAT 
CGTGATGGAA TCCTTGTTCT 
ACCAAGGATC GCCATCTCCA 
TCAAACCACT ATGTTTACCC 
CAGGTAAATA TTGTGATAAA 
TAGAATCTTT TTTCTTAGCA 
TCTTTTCGAT GTCGCCAATT 
TCAAGAAGTA TGTTTGATCT 
CTTCATGACG ATGTATGCGT 



ATGTTTAAGT 
TTGAAATTGA 
CAATGCCAAG 
CAACTTGTTC 
GTTGCTGATT 
TTGTnAAAGC 
CGATmCCaGC 
ATGCTTGTAA 
CTATCATTTA 
AATCACTTAA 
ATGGAATAGA 
GCCACCACTA 
GGACAGAAAT 
AATTTCTTTA 
ACATTGTGAT 
CATTTTCAAC 
ATTTAATTGA 
ACCTTGATAA 
AACACGTAGg 
CTAAATTTAG 
GAATAGTCTG 
ACATATAAAA 
CCTATACCTA 
ATTTTCTTAC 
AATGTTTTAA 
GTAACAAGTC 
GGGaACATCA 
TTATTATTAT 
GCGTAATGTC 



8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 
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inv-iHAli ACACATAACG 


TCTGGATTTG 


\ GAGTACGACC 


: TTTTTTGTAT 


TCATCTAAGA 


10060 




A AT* A fY2*P A A A f 1 R /"t^tt* iv 

*W A rtA— vj Lf\HLt\ LxAL. 1 1 IATCC 


CAATATTCTT 


' TTTCAAAATT 


AACAGCGTAA 


TACGGAATGC 


10140 


5 


TA BTTTYIR *P* I * TAfft /tw»^»*iv^tv 

v^n l A AvjA i 1 ACACACTTCA 


ATAACATCGT 


TGTAATCTTC 


AGTTGCAGTA 


CATACGCCAT 


10200 




XXI X Uiu I L» 1 L-A 1 CCCAG 


1 1 x x 1 CATAA 


ATATGCCAAT 


GACATCATAA 


CCTTGTTCTT 


10260 




X X r%J\K»x\\^\j l ij LtLjL. WITACA 


GAACTATCTA 


CACCGCCTGA 


CATACCAA CG 


ACAACACGTA 


10320 


10 


TATTTTTATT TYI apa RTT 1 Pv *t» 
lAll.1 llnii XVaALAAl 1AI 


(jAC TCCTCCT 


TAAATTTAAA 


ATATATTTTA 


TGAATTTCAG 


10380 




l_ X ALA/i X X LrL- A I l AA l 1 I CA 


TTTT CAG TAG 


TCAATTCGTT 


AAAACTAAAT 


CGAATCGAAT 


10440 


IS 


BTTHY i 7A T^O/^ PTPr*?/' tv t^/~»t* 

uAiixviAiLij LlUUxCATCT 


TCGAACATTG 


CATCTAAAAC 


ATGCGACGGT 


TGTGTAGAGC 


10500 


>-> X s»L X\j X AuA i oCALiATCCA 


GACGACACAT 


AGATTTGTGC 


CATATCCAAC 


AATGTTAACA 


10560 




X LV» 1X1 LAAL X I CAACAAAC 


GGAAAATATA 


GATTTACAAT 


ATGGCCTGTA 


GCATCCGTCA 


10620 


20 


X Xlv>u\LL-AX 1 iAAl I LAAA i 


GGAATCGCTC 


TTTCTTGTAA 


TTTAACTAAA 


AATTGTTCTT 


10680 




TTlhhTTPJkT *T* t\ ta t\ TV"' fv tv «r* tv 

I 1AAA1 ILftl TAAATGAATA 


TTGTTATCGT 


CTCGATTCTT 


TTCTGCTAAT 


TGTAATGCTT 


10740 




X ALtCCATCCC AACAATTTGC 


GCAAGATTTT 


CAGTGCCTGC 


ACGGCGTTTC 


AATTCTTGTT 


10800 


25 


LALLULLAAb TTGAGGATAA 


TCTAGTGTAA 


CATGGTCTTT 


AACTAGTAAT 


GCACCGACAC 


10860 




Ll i 1 1 vjL» xCC Ci L CAAACTT A 


TGAGCAGTAA 


TACTCATTGC 


GTCGATCTCA 


AATTCGTCAA 


10920 




ALi taacatl aagatgtcca 


ATTGCTTGAA 


CCGCATCAAC 


ATGGAAATAT 


GCATTTGTCT 


10980 


30 


TV TV IX TV TA T^ TV m/vm<n/i w n rrk 

L^LtLAAI AAT ATCTTGAATA 


TCATAAATTT 


GTtGCACTGT 


GCCAaCTTCA 


TTATTTACAA 


11040 




ALAirdiAQja tactaaaatc 


GTCTTATCTG 


tAATTGTTTC 


TTCAAGTTGA 


TCTAAATCAA 


11100 




Af"2/^*7A /" V™** 1 •/"""■p TV TV^TV T 1 /*"' TV TV /"» TV 

iAuw\LL 1 Vj I A I LATCAACA 


TCTAGATATG 


TTACATCAAA 


ACCTTCTCGC 


TCTAATTGTT 


11160 


35 


f A A A A fAPRTV *Tt\ 7\/"*7\/""»t\/"»t\ tv 
LMMMAALA X\» I AACACAG AA 


TGATGTTCAA 


TCTTCGATGT 


GATAATGTGA 


TTACCCAATT 


11220 




f TJ'l 'I'M" |»/^/^ '1 H J H ■ •• I ■ TV / TV TV* 

— ul lLAI i l\jL -I x-X X-AL. X-ATG- 


CCTTTAATTG 


GCGTATTATT 


CGATTCTGTT 


GCGCCACTCG 


11280 


40 


TAAATATAAT TTCATGTGTA 


TCTGCACCAA 


GTAATTGTGC 


AATTTGACX5T 


CTTGACTCAT 


11340 


V.lAIUilAI X X ALTjLLjLZA 1 CT 


CTTCCCTTAG 


CATGTATTGA 


TGATGGATTA 


CCATAATG CG 


11400 




A/iX Xvj X AAA 1 C LixCA 1 CATC 


G C7\TCTACT A 


CTTCAGGTTT 


TACTGGTGTG 


GTCGCAGCAT 


11460 


45 


AA XL, XVjL-A 1 A AAX 1 ICCATG 


TTTGGACACT 
CTACTTTTCT 


CCTCACAATT 


TT AT CAATGT 


TCCAATAATA 


11520 




GCACCTTACA TACTATTTTT 


GTTTAACTTT 


ATTTATAATG 


TTTTTAATTA 


11580 




TATTTTACCA TTTTCTACAC 


ATGCTTTTCG 


ATAGGCTTTT 


TTAAGTTTAT 


CGCTTTATTC 


11640 


SO 


TTGTCTTTTT TATAAATTTT 


AGTATTTGCA 


GATAITTTTT 


TATTTGTAAA 


ATGTAACGTA 


11700 




CTATTATTTT GGTTATGAGC 


AATTTAATAT 


TTATCTGGTT 


ATTCGATTGG 


TATACTTCTT 


11760 



SS 
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ulLLLlnl 


GAGAAGGTGA 


AGATGAACAA 


ACAGCAATTA 


ATAATATGGT 


TAATCTCGCA 


11880 




f A APATTTAG 


A PG AATTATC 


ATATGAAAGA 


TATTGGATTG 


CTGAACACCA 


TAACGCTCCC 


11940 


f 


AAV-V. 1 AL7 1 /vrt 


GTTPAGCAAP 


TGCTTTATTA 

Ivvl X X *V A xn 


ATTCAACATA 


CGTTAGAACA 


TACGAAACAC 


12000 




M 1 ALU lu 1 /\Vj 




CATCATGTTA 


CCTAATCATG 


CTCCATTAAT 


CGTTGCGGAA 


12060 




LAn ill 0\2\_n 


pg atgg pa ap 


ATTATTTCCA 


AATCGTGTCG 


ATTTAGGATT 


AGGACGTGCA 


12120 


10 




A T ATG ATGA P 


CGeAAGTGCA 


TTAAGACGAG 


ATCAACATGA 


TGGTGTTTAT 


12180 




n K ATTTPPAG 
AAA! 1 ILLnu 


A AGAGGTTTP 


ATTATTACAA 

X X X X AWV% 


CAATATTTCG 


GCCCTGCTCA 


CCAACAAGCA 


12240 




T & THTT PGTG 
lnlul lUwlu 


PTT ATPP AG C 

Ll 1 /\X LwVJv 


AGTAGGTAAA 


AATGTGC CTT 


TATACATTCT 


TGGTTCTTCA 


12300 


15 


& p a g & ttp*pg 


PAPATTTAGP 


TGCTCGCAAA 


GGGCTTCCAT 


ATGTGTTCGC 


TGGACATTTT 


12360 




or* a pptp a a p 


AAATGAAAGA 


AGCTATCGAA 


ATTTACAAAA 


CGTTATTTGA 


ACCTTCTGAT 


12420 


20 


LilAi IAwAHj 


HALL 1 XAlul 


TUTTYIT A TGT 
x t\ x x vj x ax vj x 


TTAAATACAA 

X X ****** X *»^-****v 


TCGTTGCTGA 


AAATGATGAC 


12480 




All iKvjv. 1 1L 


ATPTATGGCA 

A 1L1 £\ X \JVJw\ 


CAAGTAATGG 


TTAGTATCAC 


TCGTGGCAGA 


12540 




A 1\K-AL*v,Cv\aj 


'IT'P A A CCCIC^C 


AAPAPATGAA 


CTACAAAATA 


TATTAACGCC 


GAGAGAATAC 


12600 


25 


(jLXjA xLtVjC 1 A 


X \jKjt\f\r\\Jt\\ — t\ 


GAAAATATPA 


TTAATAGGTT 

X X X OWW X ^ 


CAGAAAATAC 


TGTTCAACAA 


12660 




AAAA1 1 L-AALj 


Al 1 1 ihlouA 


A APTTATGGT 
nnL X X /"» X vJsJ x 


GAAGTCAACG 


AAATT ATGG C 


AATAAGTTAT 


12720 




ATTTATGATA 


AAGATATGCA 


ATTAGACTCT 


TATCGTCGGT 


TCAAGAATGT 


TATAAATCAG 


12780 


30 


ATAAATGAAA 


AAAACACTTT 


ATAATGTGAT 


AAATAAACTA 


AGTGAAAGTA 


TGTATCCATA 


12840 




ATATTAATAA 


AAATATACAG 


TAACAGCATT 


TTGAATGAAA 


GATGTCTTTA 


TTGTTCAATC 


129O0 




ATTTATTTTA GTAATGATTC AAATTCACTT AAAATyCTAA 


t GCAAATATG 


AAAGCGCCCC 


12960 


35 


TTCAc TTTAC 


ACTGTGTAAG 


TGTTTATTTG 


ATGGGGCGCT 


TTCAAAATAT 


TGAAAAGCAT 


13020 




ATCCAaAATT 


TAAAGAAATT 


TATTTCTCTT 


TATCTTCATT 




TCTTCGTTAT 


13080 




TCGATCCTGT 


ATATTCATTT 


ATCTTATCTT 


TTACATTTTT 


AACTTGTTCA 


TTATCGCTAT 


13140 


40 


TTTTAAATTT 


TTCTACGCGT 


CTTTAGCTTT 


ATC CATAAAA 


CTCATATTAA 


TCGCTCCTCT 


13200 




TATATTTGAT 


TAGTTTAATT 


GAACTTATTT 


TTTAAGTTTA 


TCAATTGCAT 


CAGTTATTTT 


13260 


45 


GTTTTTAGCA 


TTTTCAACAA 


CTTCTTTTGC 


TTTaCCAGTC 


GCTTTATCTT 


GCTGACCTTC 


13320 


TTTTTCTAAT 


TCTTTGTTAT 


CAGTAACGTT 


ACCTACTGTT 


TCTTTAACAT 


TTCCTTTAAA 


13380 




TTGATCGAAC 


TtACTTTCGT 


CTGCCATAGT 


GAAACCTCCT 


TGGATGTATA 


TATTTATATA 


13440 


SO 


CCACTAAGGA GGTTCGCTmm mCAyymyAAT 


1 ATGAAGTTTT 


1 TATGTTATAG 


TATAGTATTT 


13500 




ATACGATTAA 


ATATAAAACA 


> TGTATCCGTC 


! TAAATCTTCA CTTGTATCTA 


. CATATTCCGC 


13560 
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TAGTTGTTTT TGCGCAGGTG GTTCTGATTC AATACTTTCA ACAAATGTAA TTGGACCTTC ' 13680 
TAACAGTCTT ATAATATCCC CTGCTGAGAT TTCTT 
(2) INFORMATION FOR SEQ ID NO: 196: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



13715 



15 


<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 








AAATCCATAA TGTCATGATA ATCTGCATAT GCTTCATATA ATTCAATCAT 


TGTGAATTCA 


60 




GGGTTATGTC TAGTTGATAC ACCTTCATTA CGGAATACTC TACCAATTTC 


ATATACTTTT 


120 


20 


TCAAGTCCAC CGACAATTAm ACGTTTTAAA TGCAACyCAA TAGCAATACG 


CATGT AT Aa C 


180 




GTTGCATCTA ATGCATTATG ATGTGTTACA AATGGTCTAG CAGCTGCTCC 


ACCAGCAATT 


240 




TGGTGCATCA TAGGTGTTTC TACTTCCAAG AAACCTTTAT TATTTAAATA 


ATTACGCATT 


300 


25 


TCTTGAATGA TTTTACTACG ATTAATAAAT GTACGAGTGC TATCTTCGTT 


CGTAATTAAA 


360 




TCTAAATATC TTTGACGATA t CTCTGTTCA ATATCCTGTA AACCGTGGAA 


TTTATCCGGT 


420 




AATGGTCGCA ATGATTTAGT TAGTAGCGTG AATTTCTTCG CTTTAACCGA 


TAATTCGCCA 


480 


30 


GTATTTGTTT TGAACATTAC ACCTTCAACA CCAACGATAT CGCCTAAATC 


AGCATTTTTC 


540 




CATAAATCAA ATTCGTCATC GCCAACTTGA TCTTTACGAA CGTAAATTTG 


AATTTGTCCA 


600 


35 


GCTAAGTCCT GAACGTGTGC AAATCCTGCT TTACCTTTAC CACGCTTAGT 


CATTAATCGT 


660 




CCAGCTATAG CGACATGACT ATCCGCTTCT TTTTCTACCA ATTCTTCTTT 


AGAATACTGG 


720 




-TCC^CTCTT -CTTTCAAATC ACTAGATAAA CCTGAACGGT CAAATTTAGA ACCAAACGGG 


780 ~ 


40 


TCTATACCAA GATCATATAA TTCTTGTAAT TtTTGACGTC GAACCAACAT 


TTGGTCATTC 


840 




ATTTCTTCTG ACATAACTtT CTCTCCTTTA ACT 




873 




(2) INFORMATION FOR SEQ ID NO: 197: 







45 <i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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AC CAT AATAT GAATGGCTTC AGGATCAAAA TAAAGACCAA CTTCACTGCC TACTTCAGCT 120 

TTTTTAGTCG TTTGTATTAC CCATTCATAA CCTTTATTGT CTATACAACA TATTTCATAG 180 

TGGACCCCTC TAAATAACAT AGAATCAACA GTTGCTTTAA ATAATCCTTC TTCAGCTTTG 240 

ATTAATGATA TATCTTCTGG TCGAATAACG ACTTCTACTT TTTTATTTTC AGGAATACCC 300 

ATATCGACAC ATTCGAAATC TTGCCCATAA ATATTCACGA CATAATCTCT AACCATGCGC 360 

CCTTCAACAA T ATT AG ATT C TCCAATAAAA TCAGCTACAA ATCG ATT CAC TGGTTCGTCA 420 

TaTATATCTG TTGGTGTGCC AAATTGTTGA AT 

(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



452 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
TAGGTTGGGT TCTAACATAC GATAAAGCTC AACAAATCAA CACAGCTTTC TTTGTAAAAT 60 

TGTTTAATAC TGCATTAGCA GAACGTGATT ATTATTTTAA TATAGATGGA ACAAATGCTT 120 

TTAGATTATT TAATGCTGAA GGTGATGGTG TTGGGGGATT AACAATCGAC AATTACGATG 180 

30 GTCATTTGTT GATTCAATGG TACTCAAAAG GTATTTATAA ATTTAAATAT GCCATTCTTG 24 0 

AAGCGGTTAG AAAAGTATTT GATTATAAAT CTATTTACGA AAAAGTAAGA TTTAAAGACA 300 

GCGAATATAG TGGTGGTTTT GTTGAAGGAG ATGCACCTGa GTTTCCAATT GTTATCGAAG 360 

35 AAAACTTCAC ATTTTATAAT GTAGACCTTG AAGATGGTTT GATGACAGGT ATCTTTTTAG 4 20 

ATCAAAAAGA AGTGCGCAAG AaATTAAGGG ATCAATATGC CAAAGAACGC CATGTTTTAA 4 80 

ACTTATTTAG TTATACAGGT GCTTTTTCTG CAATAGCAGC AAGTGAGGCA TCTTCAACAA S40 
40 CAAGTGTAGA TTTGGCTAAT CGTTCTCGTA GTTTAACTGA AGAAAATTTT GGATTAAATG 600 
CTATTGATCC TAAATCCCAA TATATTTATG TCATGGACAC TTTTGATTTC TATAAATATG 
CTGCACGACA TGGACATAGT TATGACACGA TCGTGATTGA TCCACCTAGC TTTGCGCGTA 
ACAAAAAACG TACATTTTCA GTGCAAAAAG ATTATGACAA ATTAATTAAT GGCGCCTTAA 
ATATCTTATC ATCTGAAGGA ACATTATTGT TATGTACAAA CGCAAGTGTA TATCCATTAA 
AGCAATTTAA AAATACTATT AAAAAGACGC TTGAAGAGAG TGGCGTTGAT TATGAATTAA 900 
CTGAAGTTAT GGGATTACCA AAAGATTTTA AAACGCATCC ACATTATAAG CCATCTAAAT 960 



660 
720 
780 
840 
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TATTGAGAAA AAGAAGGGTG ATAATATTAT GGGATTCAAA AACAATTTAA CATCAAATTT 
AACAAATAAA ATCGGTAATT CAGTCTTTAA AATAGAAAAT GTTGACGGAA AAGGTGCAAT 
5 GCCAACGACG ATTCAAGAAT TGAGAGAAAG ACGACAACGT GCTGAAGCAA TTG T AAAG AG 

AAAGTCTTTA ATGTCATCAA CAATGAGCGT TGTTCCAATT CCGGGTTTAG ATTTTGGTGT 
TGATTTAAAA TTAATGAAAG ATATTATCGA AGATGTTAAT AAAATTTATG GTTTAGATCA 

10 

TAAGCAAGTT AATAGCCTTG GGGATGATGT GAAAGAAAGA ATTATGTCTG CAGCAGCAAT 
TCAAGGTAGT CAATTTATTG GTAAAAGAAT TTCAAATGCA TTTTTAAAAA TTGTAATTAG 
, s AGATGTAGCT AAACGTACTG CTGCAAAACa AACAAAATGG TTTCCTGTTG TAGGACAAGC 

TGTGTCTGCA TCTATTAGTT ACTATTTTAT GAATAAAATT GGAAAAGATC ACATTCAAAA 
ATGCGAAAAT GTTATTAAAA ATGTCATGTA GGTGCTATAA TAGTTTTGCA ATTTGCAAAT 
20 TTTACTGAAA CCGGTTTTAA ACGAATTGAA TTTAAAGcAT GGTTTTGGTA AAGTTAATGT 

ATAAAACTAA GTTAGyATTG TAATAATATk GAAGATTCTA ACTATACGAA GGAGAAATGT 
AATTATGGAA CAAAATTCAT ATGTAATCAT CGACGAGAmT GGTATTCACG CTAGACCAGC 
25 AACAATGTTA GTACAAACAG CTTCAAAATT CGATTCTGAT ATTCAATTAG AATATAACGG 
TAAGAAAGTA AACTTAAAAT CAATCATGGG TGTTATGAGC CTTGGTGTTG GTAAAGATGC 
TGAAATTACA ATTTATGCTG ACGGTAGTGA TGAATCTGAC GCCATTCAAG CAATCAGTGA 
CGTCTTATCA AAAGAAGGAT TGACTAAATA ATCATGTCTA AATTAATTAA AGGTATTGCC 
GCATCTGATG GTGTCGCAAT TGCTAAAGCT TATTTATTAG TTGAGCCAGA CTTAACATTC 
GACAAAAATG AAAAAGTCAC TGATGTTGAA GGAGAAGTTG CAAAGTTCAA TAGCGCTATC 
GAAGCTTCTA AAGTTGAGTT AACTAAAATT AGAAATAATG CAGAGGTTCA ACTAGGTGCT 
- -GATAAAGCTG -CTATOTTTGA - TGCAcaTTGG" GGGGTGGTAG _ ATGACCCTGA AftAATTCAA " 



30 



35 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
AAGTAATAAA TCGTCTCATT TGGCAACTGA CGCATAATTT CTTTAGCTAC TGTCAAACCT 



1080 
1140 
1200 
1260 



CCAATCCAAG ATAAGATTAA AAATGAAA 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5559 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 



55 
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15 



20 



25 



30 



35 



40 



45 



50 



TTTAATTTTT 

ATTTTTGTCA 
CTTATTAAAA 
AACATAACwT 
GAATGATTTA 
TGGAATACCT 
ACCAGTACCC 
ATTAAACAAA 
ATTAGGACAA 
TTGACGTTTT 
GATCCATGCT 
ACGGATAACT 
AATCGCAGAA 
TTCTTCTAAG 
TGGGTTACGT 
TTCAAATGTT 
TACTGTTTTT 
TTCAGTCATT 
ATTAAACTCA 
GAGGCCATTG 
TGGGCACCGC 
AACATGTTCC 
TCCATATCAA 
AGTTTTTCAT 
TGTAATTTAT 
TCTTCAGCTT 
TTTGAAATAT 
AATGAATTGG 
AATAACCCTT 



AGTTTATCAT AACTAAGCAT TGGATTTTAG TATTATGCAC TGTGTTTACC 
TTATAATATT TATTTTAAAT CAGCCCACTA TCATATTGTC ATGTAATCTG 
AAATCCCTTC CAAGTTATTG TGTATCTCCA TTCAATTTAA TTTTGAAAGG 
TTTAACTCAA AAGGGATTAA TTTnTAnTCT ACTTCATGGT CTGAACCAAA 
AACATGTGGA ATGTTGTTTC TCTGTTCATT GCTGCAATGG ATGTTGTTAA 
TTAGGGCAAG CATTAACACA GTTTTGTGAA TTACCACACT GCTGTAAGCC 
ATTAATGCAT TTAAACGTTC ATCTTTAGTC ATAGATCCTG TTGGGTGCAA 
CGAACTTGCG AGATTGCTTG TGCACCAACG AaTTTATTAT TTTCAGTAAC 
ACCTCTAAAC ATACACCACA TGTCATACAT TTAGATAATT CATAAGCTGT 
TTCTCTGGCA TACGTGGTCC CGGACCTAAA TCATACGTTC CATCAATTGG 
TTCATACGTT TTAAGTTATC GAACATTCTA GAACGATCAA CTTGTAAGTC 
GGGAAAGTAT TCATTGGCTC TAAACGAATA GGTTGTT CT A ATTGATCAAC 
CAAGATTGTC TTGCACG AC C ATTGATAACC ATAGAACATG CTCCACATAC 
CAGTTCATAT CCCAGACAAC AGGTGTTGTT TTTTCACCTT TAATATTAAC 
CTAATTTCCA TTAAACAAGC AATGACGTTT AAATTTTCAC GATATGGAAT 
TCTTCATAAG GCTTAGAATC ACTTGT AT CT TGTCGTTTAA TAATTAATTT 
TGTTTCGGTT TAGATTGTGT TTCATGTTGT GGAGTGTTTT TCACTGATTG 
ATTTTTTACC CCCTTTAGAC TTACTTGTGT AATCACGTTT ACGAGGTGGT 
CATCGACGTC ATCATAAGTA AACTGCGGTT TTTCAAATGC GCCTTGGAAT 
TCGTTTTTAA CCACTCTTCA TCATTACGCT CTGGGAATTC TGGTTTATAA 
GTGATTCGTT ACGGTTATAT GCACCAATCG TAATAACACG TGCAAGTACT 
ATAGTTGACG GGTAAAGAAT ACCGCTTGGT TACTCCAAGT TTGAGTATCT 
TATCTTCATA ACGTTTCATC AATTCAACAA TCTTTTTATC TGTTTCTAAC 
TTTCACGAAC AACAGTTACA TTTGCTGTCA TAATTTCACC AAGTTCACGG 
ATGCATTTTC TGTACCGCGC ATAGCTAATA ATTTATCAAA ACGTTCTTGC 
TACGCTTTTC AAAAATACTT TCGTCCATAT CAGTATATGA TCGATCAATA 
AATCAATCGC GTTTGGACCT GCTACTGTAC CACCATAAAT CGCTGATAAC 
CACCTAAGCG GTTACCACCA TGTTGAGAGA AGTCACATTC TCCAGCTGCA 
TAATATTTGT CATTTGATCA TAATCTACAT ATAGACCAC C CATTGAATAG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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TAAATCTCAA TGATACCACC TAGTTTTACA TCTAACTCAT GTGGATCTTT ATGTGACAAA 
TCAAGATATA CCATGTTTTC GCCATTTATA CCTAATTTTT GGTTAATACA TACATCGAAA 

ATTTCACGCG TTGCGATATC ACGAGGTACT AAGTTACCAT AATCAGGATA TTTCTCTTCT 2100 

AAGAAGTACC AAGGCTTACC ATCTTTATAT GTCCAAATTC GTCCACCTTC ACCACGTGCT 2160 
GATTCACTCA TTAGTCGCAG TTTATCATCA CCAGGGATTG CAGTAGGATG AATTTGAATG 
AACTCACCAT TAGCATAAAT AGCGCCTTGT TGGTAAACAA TGGAAGCCGC TGATCCTGTA 

TTAATCATTG AGTTTGTTGT TTTACCGAAA ATAATACCAG GGCCACCCGT TGCCATAATA 2340 

ACTGCATCTG AACCAAATGT TTCAATCTCA GCAGTTGTCA TATTTTGTGC An CGATACCT 2400 
CTTGCACTAT CATCGTCACC TTTAACTATG CCAAGGAATT CCCATCCTTC ATACTTCGTA 
ACTAATCCAT CTACTTCATA TGCACGAACT TGTTCATCCA ATGCATATAA TAATTGTTGT 

CCAGTTGTTG CCCCTGCATA TGCTGTTCTG TGATGTAATG TACCACCGAA ACGTCTAAAA 2580 

TCTAATAGAC CTTCATTTGT TCTATTGAAC ATTACGCCCA TACGGTCTAA TAAATGAATA 2640 

ATTTTAGGTG CTGCCTCTGT CATCGCTTTA ACAGGTGGTT GGTTTGCAAG GAAATCGCCA 2700 

2$ CCATACACTG TATCATCAAA GTGAATCCAA GGAGAATCGC CTTCCCCTTT AGTATTGACC 2760 

GCACCATTAA TGCCACCTTG GGCACAAACA GAGTGCGAAC GCTTTACTGG TACAACTGAG 2820 

AACAAATCTA CATGTGCACC TTTTTCTGCC GCTTTAATTG TTGACATTAA GCCCGCTAGG 28 80 

30 CCACCTCCGA CAACAATAAG ATGTTTCTCT GCCATAAAAA TTTCACTCCC CTAAATTTTC 2 940 

AATCTATATT TGTTAAATGC GATGTATTAC ATAAAGGCAA TAATTGCAGT AACACCAATA 3 000 

TACGAAATAA CTAAAAATAC GATTAATGAA ACCCATGTAA ATACTCGTTG TGATTTTGGA 3060 

35 GATTGAAGTC CACCCCAAGT AACTAAGAAT GAC CATAAGC CATTTGCAAA GTGGAACACA 3120 

ACAGCAATAA- TACAAATAAT -ATAAAATATT -GGCCATCCAG GATGTTGCAA -TGTTTCGTGC- -3180 

ATTAAATCGT AATTCACTTC TTTGCCGTAA AATGCTTTTT GTAAAOGTGT TTGCCATAAA 3240 

40 

TGGATACCAA TAAAGATAAA TGTTAAGATA CCACTCACTC TTTGGAAGAA GAACATCCAG 3300 

TTTCTAAAAA TCGAGTAATG TCCAACATTT TCTTTTGCTG TAAATGCAAT GTGTATACCA 3360 

4S AACAAACCGT GATATAACAA CGGAATGTAT ATAAATAAAA ATTCTACAAT AATTAGAAAT 3420 

GGTAATGATT CCATAAAGTT AGATG CCTTA TTAAACGCTT CAGCACCTTG TGTTGCTTGG 34 80 

TGATTCACTA ATAAATGAAC GACCAAAAAT GCACCTATTG GGATAATACC TAATAACGAG 3540 

50 TGAATACGTC TTAGATAAAA TTCATTTTTT GATTGAGCCA AAAGGAGTCC CCCCTGTGAA 3 600 

CGAATATTTA ATTTATTGAG CTATTTATAT TAAACGTACG CTTAACCCCC TAAAGTGATA 3 660 
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PP TV TP A P P A A 


Ak_ I vjtJAX vj X v~ 


p a. & atvita 

bAAwvilvj In 


fifaTTTYYlfiT 
nLft t 1 1\jO/\X 


TCGATATTTA AAATTGCTTG 


3780 




itv TV <tvi AT 1 A A A 

TGATUAIAAA 


CTTTCTCATT 


tap a a a a ppp 




ATTCAAAAAA ATAACTTTGT 


3640 




*T*A A PP A T A Tr* 


v» 1 AAv_>\ 1 lAi 


TTP AT ATA TT* 
X X vnlAlnl X 




AGAATGATTC TCACGCCCAG 


3900 




^P ft ft ^P^P^P ft ^P^^^P 

TAA1 HA1 I X 


A V\3\--t\i\ X X\j 1 


TP A TVIT AfyST 

X Uv lul /WJVJ 1 


TPTTTfirf! Af* 
x v— x x iuv.unv. 


GTTTTCAGGA ATACCTATAT 


3960 




TTTTAAAATC 


*1TS* A A PTPT 7A 


Vji. X X >—V_ X X V— J-V 


XX X X V* X IVlnl 


TGAACCGAAT GAACGCAATA 


4020 


JO 


ATAATGTTTT 


Av_VjX X iul in 


p pp. a t & p p at 


V- X/\XnX V»nX v~- 


AAGTATTGAT TTCAAGCCTG 


4080 






Xvj X 1 lblLlA 


X VjxvAri X Vj X \jr\ 


TTC CCt A ATt^T 

X X VJwSJAA X w X 


GTGAACCTCA T CTTGG AT A C 


4140 




GGTGCAACAA 


At* A A A A TV* PP 


luuLinl XXI 


TPTTP a riTTirj 

lv.ll Vw-riw X vjvj 


TACAJVTTTCT GCACTAGCGC 


4200 


15 


CATATAATAA 


TTCAGATGTT 


XVtvj XVj X X X A X 


PTiTTTTTPTfJ 
Vnl 1 1 1 IvlVJ 


PAAACCTGCA ACAGGGATAT 


4260 




CAAGACCTAA 




AV?l~AV~A X Lhn 


T A A rPPPfSTT 
lnnLv<v>w X X 


CATATGTCCT TTACCACCAT 


4320 


20 


CT A CT ATT AT 


T A A ft. TP A PP*! 1 

X AAA. X v_ALrvj 1 


A BTriPTli atp 
AAX w\j XAAX V, 


PTTPriTTT A A 


AACGCGAGAA TATCGTCGTC 


4380 


TTACTACTTC 


TCTCATi OA X 


TTPTft RTVUT 

X X vj X AA X V— A 1 


V» XVjVj/W.V^ X X X 


AArn^TTTTG A TTTT AT ACT 


4440 




TT CT AT AATT 


TTTCTTATCT 


GGTTTACCGT 


PP. A P A A A *TY^T 


A A rrATTCfT GACACTGCiAT 

X^A\V»V>-'% X X VJV» X Vj*^\_— «^ v— x vjvjr\ A. 


4500 


25 


C CACAC CTTG 


TV TV T* TV TT*A P A TV 

AATATTAGAA 


XX AX vJviAA Xo 


X lUvil XV* X 


AATTGGTGTT TGAATTCCCA 


4560 




TTTGTGTTCC 


AAGTTCTTCA 


ft T*ft / 1 *^^^P ft ft 

AXAvjV^X X XAA 


Tpp/rTPTTiri a 


rTCATCACGT GATATTAATT 

V« x \>/v x VanwVJ x vj«» x n x x x x 


4620 




CAAATTTATT 


TV >1"l"l' TV TV P/""< T\ rp 

A X X X AACjOA X 


Av_ XXX AvjiV_A^ X 


T A TYlTdC Af3P 

X n 1 VJ 1 VJV-/AVJJV. 


TAGGTCAACC ATATCl"l"I"rT 


4680 


30 


TGGGACCTCG 


CGCGGG1 XAjA 


ft ft t^^t^^p^p ft 
AvJvrAl X X XAvj 




A G ATTG AAT C ATTTCTTTAT 

X*VJX* X X VJ*»*» X w *V X A X w i x * 


4740 




CCAAATTACG 


H^PPT 1 TV P TV TP A 

TGGTACATGA 


ft ^^^P^P^*^^^P^P ft ^* 

AL X X v_l_ X X AVj 


p*T A a A AT ATf3 


TTGGTTTAAG C T AT AAAATT 

X X VjxJ XXX ****V»J V» X ^» X * ^ 


4800 




t'**T t f^f~* IV A T 1 IV TV A 

GTCCAATAAA 


X VjI AXAAAAI 


TP*TT / > **I**T , /-«*i — ■ • 


v_ X vjr X X X vjv, X w 


TAATGGAAT C ATCGTTGTAT 


4860 


35 


CTCGCTTTAT 


IJA1 A X X ACC X 


X VJ X V_VJ X A X AA 


AflAAAAPTTO 


GATACACATC CATCCTTTAT 


4920 




» / ^ ?V /'m'I i TV rn« 

CAACACTATA 


A OPTi A TV/""' A 

AV-L-AAAvjAvJA 


X uALVj/ifl, X V_Vj 


TTTT A Tf*Tf5 A 

X XX A AV X V- X VJLfV 


TGA CAT AATT TTTTGTTTGT 


4980 




TTGTCAGATT 


T»TP TV A TfiTVT 
X X vjAA X A X v» X 


TT* A ATT A A AT 


V. IVx 1. ^% X J"\. X x \— 


TTTAGCCCGT TCAAAATCAA 


5040 


40 


GTGATTCACT 


TPPRPTTJl A p 

X Ov— Avj X iAAU 


AX X X v» 1 X 


fT AAA f TTTT 

V_> X fW^V>> X X X X 


TAAAATTGTT TTGTCTTCCC 


5100 




CAnCAGAAA 


A TP A PT* A A *l 

A X L-ACj X AA X X 


1UL X X Uj X v_~r\ 


TTTfiTfl^f^ T A 

X X X VJ X vJV_VJ x n. 


TTTACTCAAA TCAACGTCAT 


5160 


45 


ATACACATGG 


m^/«'H TV TV TV TV 

T CCT AAACAT 


IvjXV-L-AAXAX 


PPT A ATA A afi 
Vjvj X 1 nnnVJ 


A PAT AATTTA TCTTGGCATCT 


5220 


TATCACATTT 


GCGATATGGA 


TATATTCTGT 


CTAATAACTT 


TTTAGTTTCT TGAGCAGAAT 


5280 




ATGCATTCGG 


ATACGGTCCG 


AAATATTTGC 


CAGTACCTTG 


TTTTACAGTT CTCGTCACTA 


5340 


50 


GTAGTCTAGG 


ATATTTCTCC 


TTCGTAATTT 


' TAATAAATGG 


ATAACTTTTA TCATCCTTTA 


5400 




ATAATATATT 


ATATCTTGGT 


1 TGATATTGTT 


' TAATCAGATT 


» CAATTCCAGT AAAAGTGATT 


5460 



55 



iNSOOCID: <£P 078651 9A2_L> 



910 



EP0 786 519 A2 



TTTTAGCATC ATGAGCACCC GTAAAATATG ATCGCAATC 
(2) INFORMATION FOR SEQ ID NO: 200: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4594 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS : double 
( D ) TOPOLOGY : 1 inear 

10 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


200: 






15 


AAATCAATCG 


AGTGGCATGT 


CAAGGTCATA 


TCAATATTTT 


AGAATCTGCG 


ACTATGAGAG 


60 




AGGAAATAAA 


TGAAATTGCG 


CGACGTATCA 


TCGTTGATAT 


TCGTGATAAG 


CAATTACGAT 


120 




ATCAAGATAT 


TGCTATTTTA 


TATCGTGATG 


AATCTTATGC 


TTATTTATTT 


GATTCCATAT 


180 


20 


TACCGCTTTA 


TAATATTCCT 


TATAATATTG 


ATACAAAGCG 


TTCGATGACA 


CATCATCCGG 


240 




TCATGGAAAT 


GATTCGTTCA 


TTGATTGAAG 


TTATTCAATC 


TAATTGGCAA 


GTGAATCCAA 


300 




TGCTACGCTT 


ATTGAAGACT 


GATGTGTTAA 


CGGCATCATA 


TCTAAAAAGT 


GCATACTTAG 


360 


25 


TTGATTTACT 


TGAAAATTTT 


GTACTTGAAC 


GTGGTATATA 


CGGTAAACGT 


TGGTTAGATG 


420 




ATGAGCTATT 


TAATGTCGAA 


CATTTTAGCA 


AAATGGGGCG 


TAAAGCGCAT 


AAACTGACCG 


480 


ov 


AAGATGAACG 


TAACACATTT 


GAACAAGTCG 


TTAAGTTAAA 


GAAAGATGTC 


ATTGATAAAA 


540 


TTTTACATTT 


TGAAAAGCAA 


ATGTCACAAG 


CGGAAACTGT 


AAAAGATTTT 


GCAACTGCTT 


600 




TTTATGAAAG 


TATGGAATAT 


TTCGAACTGC 


CAAATCAATT 


GATGACAGAG 


CGAGATGAAC 


660 


35 


TTGATTTAAA 


TGGTAATCAT 


GAAAAGGCGG 


AGGAAATTGA 


TCAAATATGG 


AATGGCTTAA 


720 


TTCAAATCCT 


TGATGACTTA 


GTTCTAGTAT 


TTGGAGATGA 


ACCAATGTCG 


ATGGAACGTT 


780 




TCTTAGAAGT ATTTGATATT GGTTTAGAAC 


AATTAGAATT 


TGTTATGATT 


CCGCAAACAT 


840 


40 


TGGACCAAGT 


AAGTATTGGT 


ACGATGGATT 


TGGCTAAAGT 


CGATAATAAG 


CAACATGTTT 


900 




ACTTAGTAGG 


TATGAATGAT 


GGAACGATGC 


CACAACCAGT 


AmTGCGTCAA GCTTGATTAC 


960 




AGATGAAGAA 


AAGAAATACT 


TTGAACAGCA 


GGCTAATGTC 


GAGTTAAGTC 


CAACATCAGA 


1020 


45 


TATTTTACAG 


ATGGATGAAG 


CATTTGTTTG 


TTATGTTGCT 


ATGACTAGAG 


CTAAGGGAGA 


1080 




TGTTACATTT 


TCTTACAGTC 


TAATGGGATC 


AAGTGGTGAT 


GATAAGGAGA 


TCAGCCCATT 


1140 




TTTAAATCAA 


ATTCAATCAT 


TGTTCAACCA 


ATTGGAAATT 


ACTAACATTC 


CTCAATACCA 


1200 


50 


TGAAGTTAAC 


CCATTGTCAC 


TAATGCAACA 


TGCTAAGCAA 


ACCAAAATTA 


CATTATTTGA 


1260 




AGCATTGCGT 


GCTTGGTTAT 


ATGATGAAAT 


TGTGGCTGAT 


AGTTGGTTAG 


ATGCTTATCA 


1320 



SS 
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GTTTGACAAT 
AATCAATGCC 
GTCACATGGT 
TGATATTTTC 
ATTAGACCTG 
AGTTCAGTTT 
CGCTATTGTA 
GCCAAAACAT 
ACAAACATTA 
CGATACGTAT 
AGGTAGTGCG 
ATACATGGAT 
GGTGGaTTAT 
ATTGATGAAG 
AATGCAGACC 
GATATTGTAC 
GCAGATGAAG 
GCTTCAAATA 
CCATGTGCTT 
CGATATCGAA 
AATGATGAAT 
TGGACTGACG 
GCGGCAGGTT 
GATGGCATTG 
ATGAAGCATC 
CACTTGAAAA 
TGCTTGAAAT 
AGCAGTGAAG 
CATTACGATA 



GAAACTGTAA 
AGTGTATCCC 
CTGAAACTAA 
CATTCTGTTT 
AAAAAAATAA 
AATTTATTAA 
GAAACAACAC 
TTTGAGACAA 
ACGACAACTC 
ACAAAGAATG 
ACACTTGATT 
ATCGTTTTAC 
TATACTTCCA 
ATAAACTAGA 
AAACTGTTAT 
CAGTTGGTTT 
CAACGATTTA 
TTATGGATGG 
TTTGTAGTTA 
CTGTAGATGA 
TTGGGGGTGA 
CGCAATGGCA 
CAGGTAAAAC 
ATGTCGATCG 
GTGTAGACCA 
ACCAACGCAT 
TAATTCAACA 
CTGAAAATAT 
TCCTTGATCC 



AATTAGGTGA 
GTTTTGAAGG 
ATGAGCGAAC 
TAAAATATAT 
GACAATTAAC 
ATTCTTCAGC 
TAAGCGCATT 
GTTTTAGAAG 
AAGGTATTCC 
ATACAAGTTT 
TAACGAAAGT 
AAAATAAACA 
TGTACATGAA 
ACAAGATTTA 
TGATGCATTG 
GAATAAAGAT 
TAAATTCATC 
ACATACTGAA 
TCAATCGGTA 
AACAATAAAT 
GCAATAGATG 
AAGTATTTAC 
AGCTGTACTA 
ACTTTTAGTC 
ACGTATTCAA 
CAAAATTCAT 
GCATTATGAT 
TTTATTATTA 
TGCTTTTATT 



AACGTTGTCT 
TTATCAACAA 
GAAGTATGAA 
ATCTGAACGT 
GAATGAAGCA 
TTACTATCGT 
AAAATATCAA 
GAAACCAAGA 
AATTAATATT 
TGTTAATATC 
ATATTATGGT 
ACGCCTTGGA 
CCTAGAATTA 
ATTAAAAAGT 
GATATTCGTT 
GGCTCTTTGA 
CAACATAACA 
GTTGCACCAT 
TGTCATGTAG 
CCAATTGAAG 
ACAATTCCAG 
GCAACTGGAC 
GTTGAGCGTA 
GTAACGTTTA 
GAGGCATCGA 
CAAGCACAAA 
GTATTAAATA 
GAACAAACGA 
GAATTAACAG 



AAAGATTTAT 
TGCCCATTTA 
CTTCAAAACT 
ATTAATGGCG 
TTGGAAGAAA 
TATTTATCAA 
GGCACGTATT 
ACAAATGACG 
AGAGGGCAAA 
ATTGACTATA 
ATGCAAATGC 
TTAACAGATA 
AATTTAAATC 
TTAAGTTGAG 
TAGAACCTAA 
GTAAACGAGG 
AAGAGAATTT 
TAAAGTACAA 
ATGGCATGAT 
CAATTCAAAA 
AGAAACCACA 
AAGATGTACT 
TTATCCAAAA 
CAAACTTAAG 
TTGCTGATCC 
TATCTACACT 
TTGACCCGAA 
TAGATGAGGT 
AGCAATTGTC 



ATGGTAAGGA 
AACACTATGC 
TTGATTTAGG 
ATTTTAAACA 
TTTTACCTAA 
GACGCATTGG 
CAAAGTTTAT 
AATTAATTGC 
TTGACCGTAT 
AATCCTCTGA 
AAATGATGAC 
TTGTGAAcCA 
ATGGTCTGAT 
TGGTTTAGTT 
ATTCACTTCA 
CAGCCAAGTG 
TATAGAAACA 
ACAAAAATTG 
TGATAGTAAG 
TATTAACATT 
AGGCGTGATT 
TGTTGCAGCC 
GATTTTACGT 
CGCACGTGAA 
TGCAAATGCA 
CCATAGTTTT 
CTTTAGAACA 
CATAGAACAA 
TTCAGATAGA 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

25B0 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 
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AATCCTACAA ATTGGTTGGA TCAATTGGTG ACACCATACG AAGAAGAAGC ACAACAAGCG 
CAACTTATTC AACTACTAAC AGACTTATCT AAAGTATTTA TCACAGCTGC TTATGATGCT 
5 TTAAATAAGG CGTATGATTT GTTTAGTATG ATGGATAGCG TCGATAAACA TTT AG CTGTT 

ATAGAAGATG AACGACGTTT AATGGGGCGT GTTTTAGAAG GTGGCTTTAT TGATATACCT 
TATTTAACTG GTCACGAATT TGGCGCGCGT TTGCCTAATG TAACAGCGAA AATTAAAGAA 
GCAAATGAAA TGATGGTCGA TGCCTTAGAA GATGCTAAAC TTCAGTATAA AAAATATAAA 
TCATTAATTG ATAAAGTGAA GAGTGATTAC TTTTCAAGAG AAGCTGATGA TTTGAAAGCT 
GATATGCAAC AATTGGCGCC ACGAGTAAAG TACCTTGCGC GTATTGTGAA AGATGTTATG 
TCAGAATTCA ATCGAAAAAA GCGTAGCAAA AATATTTTGG ATTTTTCTGA TTATGAACAT 
TTTGCATTAC AAATTTTAAC TAATGAGGAT GGTTCGCCTT CAGAAATTGC CGAATCATAC 
zo CGTCAACACT TCCAAGAAAT ATTGGTCGAT GAGTATCAAG ATACGAACCG AGTTCAAGAG 

AAAATACTAT CTTGCATCAA AACGGGTGAT GAACATAATG GTAATTTATT TATGGTTGGA 
GATGTTAAGC AATCCATTTA TAAATTTAGA CAAGCTGATC CAAGTTTATT TATTGAAAAG 
2S TATCAACGCT TTACTATAGA TGGAGATGGC ACTGGACGTC GAATTGATTT GTCGCAAAAC 

TTCCGTTCTC GAAAAGAAGT ACTGTCAACG ACTAACTATA TATTCAAACA TATGATGGAT 
GAACAAGTCG GTGAAGTAAA ATATGATGAA GCGGCACAGT TGTATTATGG TG CA C CAT AT 
30 GATGAATCGG ACCATCCaGT AAACTTAAAA GTCCTTGTTG AAGCGGATCA AGAACATAGT 

GATTTAACTG GTAGTGAACA AGAAGCGCAT TTTATAGTAG AACAAGTTAA AGATATCTTA 
GAACATCAAA AAGTTTATGA TATGAAAACA GGAAGCTATA GAAGTGCGAC ATACAAGGAT 
35 ATCGTTATTC TAGAACGCAG CTTTGGACAA GCTCGCAATT TACAACAAGC CTTTAAAAAT 

GAAGATATTC CATTCCATGT GAATAGTCGT GAAGGTTACT TTGAACAAAC AGAAGTCCGC 

TTAGTATTAT CATTTTTAAG AGCGATAGAT AATCCATTAC AAGATATTTA TTTAGTTGGG 

to 

TTAATGCGCT CCGTTATATA TCAGTTCAAA GAAGACGAAT TAGCTCAAAT TAGAATATTG 
AGTCCAAATG ATGACTACTT CTATCAATCG ATTG 
(2) INFORMATION FOR SEQ ID NO: 201: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
so (D) TOPOLOGY: linear 
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GGTTTTCnTG GAAAGATAGT GAAAATCTCG TGTTTTTTGG TTTTgAGGTG TTGTTTGTAT 60 

TTTaTAAAAT GGCTTACATA TATGAAGCGT TGATTAAGTA TGGAATTGTT AATTAATTGA 120 

ACCTATTTAG CTTTAAGAAG GCATAACAAG ATGACCTTAT TTTATGCTAT AATATTTCTA 180 

TTATGCGAAG ATTAAGGTGA GTAGTAAATT GGATAAAAAA GTAAGTATTC AAACAAAGCA 240 

AGTGTTGAAA CAGCACAACG AAAAAGAAAA ATTTGAATTT ACTACTGAAG GAACTTGGCA 300 

ACAAAGGCAA TCTAACTTTA TTCGGTATGT AGAACAAATT GAGGATGCAA CAGTTAATGT 360 

TACAATAAAA GTGGATGATG AT AG CGTTAA GTTGATTCGT AAAGGCGACA TTAATATGAA 420 

TTTGCATTTT GTTGAAGGAC AAACGACAAC AACTTTTTAC GATATATCGG CTGGACGAAT 480 

TCCACTAGAA GTTAAAACAT TACGCATTTT ACATTTCGTA AGTGGAGACG GTGGCAAGCT 54 0 

AAAGATTCAT TATGAATTAT ATCAAGATAA TGAAAAAATG GGTTCTTATC AATATGAAAT 600 

TAACTATAAG GAGATAGGCG AATGAATATT ATTGATCAAG TGAAACAAAC ATTAGTAGAA 660 

GAAATTGCAG CAAGTATTAA CAAAGCAGGA TTAGCAGATG AGATTCCTGA TATTAAAATT 72 0 

G AAGTTC CT A AAGATACAAA AAATGGAGAT TATGCTACTA ATATTGCGAT GGTACTGACT 780 

AAGATTGCAA AGCGTAATCC TCGTGAAATT GCTCAAGCGA TTGTTGATAA CTTAGATACT 84 0 
GAAAAAGCAC ATGTAAAACA AATTGACATT GCTGGTCCAG GATTCATTAA TTTTTACTTA 900 
GATAATCAGT ATTTAACAGC AATTATTCCT GAAGCAATTG AAAAAGGTGA TCAATTTGGA 96 0 

30 CATGTAAATG AATCAAAAGG TCAAAATGTA TTGCTTGAGT ATGTTTCAGC TAACCCTACA 1020 

GGAGATTTAC ATATTGGTCA TGCTAGAAAT GCAGCAGTTG GTGATGCTTT AgcTAAtATT 1080 

TTAACTGCAG CTGGCTATAA TGTAACACGT GAATATTATA TTAATGATGC TGGTAATCAA 1140 

35 ATTACTAACT TAGCGCGTTC GATTGAAACA CGTTTCTTTG AAGCTTTAGG TGACAATAGT 1200 

TATTCAATGC CAGAAGATGG CTATAATGGA AAAGATATTA TTGAAATAGG TAAAGATTTA 1260 

GCAGAGAAAC ACCCTGAAAT TAAAGATTAT TCTGAAGAAG CACGTTTGAA AGAATTTAGA 1320 

AAATTAGGCG TAGAATACGA AATGGCTAAA TTGAAAAATG ATTTAGCAGA GTTCAATACG 1380 

CATTTTGATA ATTGGTTTAG TGAAaCATCT TTATATGAAA AAGGAGAAAT TCTTGAAGTT 1440 

TTAGCAAAAA TGAAAGAATT AGGTTATACG TATGAAGCTG ATGGCGCTAC ATGGTTACGT 1500 

ACAACTGATT TTAAAGACGA CAAAGACAGA GTATTAATTA AAAATGACGG TACATATACG 156 0 

TATTTCTTAC CAGATATTGC GTACCACTTC GATAAAGTAA AACGTGGTAA TGACATTTTA 1620 

ATCGATTTAT TTGGTGCTGA T CAT CATGGT TATATTAATC GTTTGAAAGC ATCTCTTGAA 16 80 
ACGTTTGGTG TAGATAGTAA TCGTTTAGAA ATTCAAATCA TGCAAATGGT TCGTTTAATG 1740 



25 



40 



45 



50 
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ATTATGGACG 


AaGTTGGCGT 


TGACGCTGCA 


CGTTATTTCT 


TAACTATGCG 


TAgTCCTGAT 


1860 




AGTCACTTTG 


ATTTTGATAT 


GGAATTAGCG 


AAAGAGCAAT 


CTCAAGACAA 


TCCAGTTTAC 


1920 


5 


TATGCTCAAT 


ATGCACATGC 


GCGTATTTGT 


TCAATTTTAA 


AACAAGCGAA 


AGAGCAAGGT 


1980 




J\ TTV It It /^fpo Jt 

ATTGAAGTGA 


CTGCTGCGAA 


TGATTTTACA 


ACGATTACTA 


ATGAAAAAGC 


GATTGAATTG 


2040 


10 


'1*1 it It It ft it it r« 

TTGAAAAAAG 


TAGCTGATTT 


CGAACCTACA 


ATTGAAAGTG 


CTGCTGAGCA 


TAGATCGGCA 


2100 


CATAGAATTA 


CTAATTATAT 


TCAAGATTTA 


GCTTCTCATT 


TCCATAAATT 


CTATAATGCT 


2160 




GAAAAAGTGT 


TAACAGATGA 


TATTGAAAAA 


ACAAAAGCAC 


ATGTTGCTAT 


GATTGAAGCG 


2220 


15 


GTCAGAATTA 


CATTGAAAAA 


TGCATTGGCA 


ATGGTCGGTG 


TAAGCGCACC 


TGAATCAATG 


2280 




TAAGAACATT 


TATATACACT 


CCAACGTAGA 


GTTTCTCGAA 


AGATACTTTG 


TGTTGGAGTG 


2340 




1'iTrrrnAG 


GTATGTGACA 


TATTGGGGAA 


TGCTTAGTAT 


GTGAATAAGG 


TTAAGAGGAA 


2400 


20 


CACAGTTGGA 


TGCTCTGCAC 


AACTG CAT AA 


GAGAGCCTGA 


GACATAAATC 


AATGTTCTAT 


2460 




GCTCTACAAA 


GTTATAATGG 


CAGTAGTTGA 


CTGAACGAAA 


ATTCGCTTGT 


AACAAGCTTT 


2520 




TTTCAATTCT 


AGTCAACCTT 


GCCGGCGGGG 


CCCCAACAAA 


GAGAAATTGG 


ATTCCCAATT 


2580 


25 


TCTACAGACA 


ATGCAAGTTG 


GGGTGGGACG 


ACGAAATAAA 


TTTTACGATA 


ATATCATTTC 


2640 




TGTCCCACTC 


CCTCTAAAAT 


GGAGGGTGTA 


AATGTTAGGA 


ACTGATGAAT 


TATATAAAGT 


2700 




TTTATATGAA 


CATCTCGGAC 


CACAATTTTG 


GTGGCCTGCT 


GATAATGACA 


TTGAAATGAT 


2760 


30 


GTTAGGTGCA 


ATTTTAGTTC 


AAAATACTAG 


ATGGCGAAAT 


GCAGAAATTG 


CATTGAATCA 


2820 




GATTAAAGAA 


CATACGCATT 


TTAATCCAAA 


TCATATATTA 


GAACTACCTA 


TTGAAACGTT 


2880 


35 


ACAATCATTG 


ATACATTCAA 


GTGGCTTl'TA 


TAAAAGTAAA 


TCACTGACGA 


TTAAAACATT 


2940 


ATTAACATGG 


TTAGCACGAC 


ATCATTTCAA 


TTATCAAGAG 


ATTAATGAGC 


GATATAAAGG 


3000 




- -TGGATTAAGA AAAGAATTAT 


TATCTTTGAAr 


AGGTATTGGA 


AGTGAAACAG- 


CAGATGTCTT 


3060 


40 


ACTTGTTTAT 


ATATTCGGAC 


GTATTGAATT 


TATTCCAGAT 


AGCTATACAA 


GAAAAATATA 


3120 




TGATAAATTA 


GGATATGAAA 


ACACTAAAAA 


TTATGATCAA 


TTAAAAAAAG 


TAGTCaCATT 


3180 




ACCAAATCAT 


TTTACAAATC 


AAGATGCTAA 


TGAATTTCAT 


GCTCTGTTAG 


ATGTATTTGG 


3240 


45 


TAAACATTAC 


TTTAGAGACA 


AAGATATAAA 


GAATTATGAT 


TTTTTAGAAC 


CTTACTTTAA 


3300 




AAAGTAAACG 


CTGTGAAGTT 


AGATAGATGA 


GTTTATATGA 


AATATAAAAA 


ATAATTTACT 


3360 




ATTTTCTTTT 


AGTATGTGGA 


CTTATATAAT 


AAAT AG AAG C 


ATATAAAGAA 


AAAAACAGTT 


3420 


50 


GTTTGTTTGT 


GCAGCAACTG 


CATAAGAGCC 


CCTAATCGCT 


AAAGCTCAAG 


GGGAGTAAAG 


3480 




GAATACAGTT 


GTTTGTGCAG 


CAACTGCATA 


AAAGCCTCTA 


ATCACTAAAG 


GTGAAGAGGA 


3540 
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AACGCAGTTG GATgCTACCG 


CACAAC71 Vj LA 




1 cLA l v„y c 1 AA 


A/TYI A A 2V AC5T 


■a £ c n 




GGGATTAAAA 


AGGAGATGTG 


AT AG LXj luAA 


LrAAA 1 V~w 1 1A 


A 1 1 UL 1 1 1 1 A 


1X1 X w^A X XXX 


•lion 


5 


TATG CTTGTC 


CTGAGTGGCT 


rr*f-*r~t <-p t\ Tyi A A 

vjIuvjXAIuAA 


Ti/** A*P A A. TY" 1 A T" 
nuA 1 AA 1 VjA 1 


A A AP A AfVZTA 


rra A THAT A A 

UUlA X VM"V X Aft 


*7 Q n 




TGGCTCGTCT 


AAATCGCCGT 


AC CAT AC* Aft I 


^^V* ^^^^^^^^^^^^^^ R 
I\j 111 V.V9 1 1 A 


A 1 vj*- V— 1 Aw 1 A 


ATACTGAAAT 






TTTATATGAA 


TTAGGATTAG 


TV A A ^ IS *P 

GTAAATACAx 


AVj 1 i Vjw X Vj 1 1 


X v-rtrl\-.vjVj A x w 


ATH ATT AT CC 




10 


AAAAGATGTG 


AAAAAGGGTA 


7\ TV » A /""* A ATT 

AGAAALAAJ. 1 


1 VjA 1 lilvj 


A/\ X \— X n/vl X n 


AAGAGGAACT 


3 96 0 




TTTAAAGGCA 


AAGCCAGATC 


TAATTCTTGC 


wCA IvxAw ILu 


c* a aaa riri/"^ A A 


PTGPTAATAA 

X V-7 >— X AA X ATI 


4 02 0 


15 


AGTATTGTCA 


TCATTAGAGA 


TV A /"* TV TV ^V** /*' J\ 1^ 

AACAAGGCAT 


LAAAb 1 Avj 1 vj 


1 AX vj X X AAAvj 


A Tf5 fAClAATC 

r\ X o v-A>»-An X v_ 


4 080 


AATTGATGAA 


ACTTACAACA 


CAllTAAGCA 


TV TV 'TTOPr* A A A 

Ml 1GGGAAA 


TT A A r*n C A TP 


A*TY2 ATAAfJPA 
e\ X on X nnuv»n 


d i a n 

■illw 




GG CTGAACAA 


CTTGTTGAGG 


TV A A r ,f P A A A A 

AAAlTAAAGA 


T» A A T" A TPP & T 

lAAlAlLGAl 


nnnu X V— /\ X AO 


n X X \^r\*\ X X v>v< 


4 200 


20 


TGCTCATCAT 


AAAAAATCAA 


AAGTATTTAT 


TG AGv» 111 LA 


1 LAAAuLL I\j 


A A AT AT ATA C* 






AGCAGGGAAG 


CATACATTTT 


mmit TV CT^/^ TV ^niv fJ> 

TTAATGATAT 


fwpT>ii A A A A A 

GTTAGAAAAA 


X lAwAAwwv-L 


AAA ATYITHT A 
AAAA X s> Ivjin 






TAGTGACATT 


AATGGTTGGA 


JV ^*^t Ml*/ MI I TV IV 

ACCCTGTAAC 


/■^ TV TV /^/"* A TV RPT 

G AAGCj AAAij 1 


A TT ATT* A A A A 
Al 1 Al 1AAAA 


A fl A A CCC A A 
nunn v_ v_ vavjA 




25 


TATATTAATT 


TCGACGGAAG 


WH TV TV TV X**TV TV 

CTAAGACAAG 


ATCAGA1 1 A I 


A 1 LAiA X A X \_A 


TP A A A A A A An 


****** VJ 




AGGTGGATTC 


AATAAAATTA 


ATGCTGTCAA 


TV TV «T* TV A r T^ 

GAATACAOj I 


TV rpTV TV TV /~> T»rn 
A 1 1 vjAAvj 1 1 vj 


TA A A TYlflTfl A 
X AAA 1 VjVj X VjA 


^ e Art 
H 3 v U 




TGAAGTATCA 


AGACCAGGTC 


^ fV 'Ull TV 'f Ml IS^ ft 

CACGTATTGA 


^/*» tv a r^r* A » T " 1 1 A 

TGAAGGATiA 


A A A/"* & A 'ITS A 

AAAGAAi X AA 


r* A C A P A A T 




30 


TTATAGAAAA 


TAAACCATTC 


«T* TV TV f OT^ TV f* 

TAATTATGCC 


CCTTATTG CT 


TPT B A A A 
ALA IVjI AAAA 


AA X r\v,>l lol 1 


*» o ^ V/ 




TGAGATAAGG 


GGTTTTTaAA 


ATATATTTAG 


TGAATGATAG 


LAALGUvjrtvj X 




4 680 




TATAATGAAT 


GTAATTATCG 


ATGAAcaaAA 


GAGAATGCTA 


TY7 A 1 A * i i — i • A A 
X Vj Av A X X 1AA 


TAAAHTATTA 
X nnnu X n x x r\ 


474 0 


35 


TTGAGCTGGa 


TAGTCmTATT 


GATTATAACA 


ACTAGCATAT 


AIL X AX 1 1 X w 


vjV— /\VJ X X v-TVjrVyv^. 


4800 




GATATCAATG ATGTATTTAA CCAGTCTATT TTAATCAATG 


X XAwAX X nv»v. 


Vj/\vjx\ x x rv x x 


4 860 


40 


GAAGCATTGT 


TGACAGGTAT 


GATATTAACT 


GTTGCAGGCC 


'PT A **P A T*l" P/** 1 A 

1 1 A X AX X Ivvft 


A A PAfJTTTTA 
nAv^/VJ X X X X *■* 


4920 


AATAATGCAT 


TGGCAGATAG 


CTTTACATTA 


GGATTGGCAA 


uLouLov, inv. 


ATT*TYlflTT(*?A 

t\ XXX ww X X \~r\ 


4980 




GGATTAGCAT 


TATTTTTAGG 


TTTAACAACG 


TTATGGATTC 


V_ XAj 1 Al XXXV- 


A ATAACATTT 


5040 


45 


AGTTTGATAA 


CATTAATAAC 


TGTATTAGTC 


ATTACGTCGG 


1 A X IwAwwV^J-l 


AfifirTATfTA 
nww V- X t\ X V- \— A 


5100 




GTTAGAATCT 


TAATATTAAG 


TGGTTTAATG 


ATTGGTGCGT 


TATTCAATTC 


ACTTCTATAT 


5160 




TTTTTGATTT 


TATTAAAACC 


TCGCAAATTA 


AATACAATTG 


CCAATTATCT 


GTTTGGTGGT 


5220 


50 


TTTGGTGATG 


CAGAATACTC 


AAATGTATCT 


ATAATAGCAA 


TCACATTTAT 


CATTGCATTG 


5280 




TTTGGTATAT 


TTATCATTCT 


TAATCAACTA 


AAGTTATTGC 


AATTAGGAGA 


ACTAAAAAGT 


5340 
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w 



15 



20 



25 



30 



35 



ATAACGGCGA TAAATGTCGC ATATGTTGGC ATCATTGGAT 
CAACTCATTA GAAAATGGCA GTGGAAACAA TCATTAGGAA 
GTAACTGGAG GACAAATAAT GGTTATGGCA GATTTTATTG 
GTACAAATAC CGGCAAGTAT TATCATTGCA TTAATTGGTA 
CkAAwAtCtC aGTCgAAAcG GTTACaCTAG CACACGACaT 
ATAAACATAA AGAGGGCATA AGCGATGGAT TTGAATCAAA 
TTAGAAGGTA CGTTGTTGGA CAGAGTTAAA TCTCGAGAGA 
GAACGATTTC ATGACTACTT AATTCATGTT CAACTGGCAG 
GAGCTAGATG ACGATGAAGA TAATGATAAA CCTGATTTAT 
TTCCATGTAG ATAGGTTAAC TTGGAAAGAC TTATTTAATG 
CGTTATGTAT TTC CTTATTA CGATACTTTG TATACACTAG 
TTTCAAATTG GTGTTATCGC AAATGGTAAA TCTAAGATTA 
CTTGGTTTGA TGCATGTTAT TAATTATTTA TCAACATCAG 
CCACATCCTA AAATTTTTGA AGATATGATT GATCAACTAG 
ATGTATGTTG GCGATGATGC GTTAAATGAT GTAGCTCCAG 
AGTGTATGGT ATA 

(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TCATTGGTAT 
GACAATTGGC 
GTAGCCATAT 
TACCAGTGTT 
TTGCTAAAAT 
TTAAAGCAGT 
AATTTATCGA 
ATTTTAAAAA 
ATAAAGAAAT 
ATTTTGAAAT 
AAAAgCTATC 
AACAATTTCG 
AAACAGTTGG 
GGGTATTACC 
CACGAGCTAT 



GGTGATACCG 

TTTAAATATT 

ATTGTCACCA 

At TTTACaTG 

AAAAATAACT 

TGTATTTGAT 

AGAGCAATAT 

AgCATTTATT 

CATTAAACGT 

GCATTTTTAT 

GCAAAAAGGC 

ATTACATTCA 

TTTTCGTAAA 

TGAGCAAATT 

GGGCATGGTT 



5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6313 



40 



45 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
CCGTAAACAC ATCAACAAAA GAAGGCTATA TTACAAAAGA AGACTTGGAC TTATGCTGCA 
CGTCGCTCTA ATTCAGCTGG AATGCAAGTC ACCGGACGAC TGGCTTACAT TGAACCTTAT 
GGGGCAACAA GTCGCACAAA ATAAACGCGC GAGAAGCaAG AATAGGAAGT GATATCTATG 
AAATGGTTAT CACGAATATT AACAGTAATA GTGACCATGT CtATGGcGTG TGGTGCATTG 
ATATTTAATC GTAGACATCA GCTAAAGGCG AAAACGCTGA ACTTCAATCA TAAAGCATTA 
ACAATTATTA TTCCGGCTAG AAACGAAGAA AAAAGAATAG GTCATTTACT ACATTCGATA 
ATACAACAGC AAGTTCCAGT AGATGTCATT GTTATGAATG ACGGATCGAC AGATGAAACA 



60 
120 
180 
240 
300 
360 
420 



55 



IGDOCID: <EP 07865 1 9 A2 J ._> 



917 



EP 0 786 519 A2 





AAATGGTATG 


GGAAATCACA 


TGCTTGTTAT 


CAAGGTGTGA 


CG CATG CATG 


TACGAATCG C 


540 




ATTGCCTTTG 


TAGATGCTGA 


TGTAACTTTC 


TTAAGGAAAG 


ATGCTGTTGA 


AACGl IvjATT 


600 


5 


AATCAGTATC 


AATTACAAGG 


TGAAAAAGGA 


TTGTTAAGCG 


TACAGCCTTA 


TCATATAACA 


660 




AAGCGTTTCT 


ACGAAGGGTT 


TTCAGCGATA 


TTTAATTTAA 


TGACAGTCGT 


TGGTATGAAT 


720 




GTATTTTCTA 


CCTTAGACGA 


CGGTCGGACT 


AACCAGCATG 


CATTTGGACC 


GGTGACA l I A 


780 


10 


ACAAATAAAG 


AAGATTATTA 


TGCAACTGGA 


GGTCATAAAA 


GTGCAAACCG 


TCATATTATT 


840 




GAAGGATTTG 


CTTTAGGAAG 


TGCATATACT 


TCACAATCAT 


TGCCCGTAAC 


AGTTTATGAA 


900 


15 


GGGTTTCCAT 


TTGTTGCATT 


TCGCATGTAT 


CAAGAAGGAT 


TTCAGTCATT 


ACAAGAAGGA 


960 


TGGACAAAGC 


ATTTGTCAAC 


TGGGGCAGGT 


GG CACAAAGC 


CTAAGATCAT 


GACAGCAATT 


1020 




GTGTTGTGGT 


TGTTTGGTTC 


TATAGCGAGT 


ATTTTAGGGC 


TATGTCTTAG 


TTTAAAATAT 


1080 


20 


CGC CAAATGT 


CTGTAAGAAA 


AATGGTAGCA 


CTTTACTTGA 


GCTATACTAC 


ACAATTTATT 


1140 




TATCTGCATC 


GAAGGGTCGG 


CCAATTTTCT 


AATTTATTAA 


TGGTATGTCA 


TCCATTGTTA 


1200 




TTTATGTTTT 


TTACTAAAAT 


TTTCATCCAA 


TCTTGGAAAC 


AAACGCATCG 


TTATGGTGTA 


1260 


25 


GTTGAATGGA 


AAGGTCGTCA 


ATATTCTATA 


TCTAAAGAAC 


AATAAATCAA 


GGTAATGGCA 


1320 




TTT CAAT AT A 


GGAGGACTAG 


TATGACAATG 


ATGGATATGA 


ATTTTAAATA 


TTGTCATAAA 


1380 




ATCATGAAGA 


AACATTCAAA 


AAGCTTTTCT 


TACGCTTTTG 


ACTTGTTACC 


AGAAGATCAA 


1440 


30 


AGAAAAGCGG 


TTTGGGCAAT 


TTATGCTGTG 


TGTCGTAAAA 


TTGATGACAG 


TATAGATGTT 


1500 




TATGGCGATA 


TTCAATTTTT 


AAATCAAATA 


AAAGAAGATA 


TACAATCTAT 


TGAAAAATAC 


1560 




CCATATGAAC 


ATCATCACTT 


TCAAAGTGAT 


CGTAGAATCA 


TGATGGCGCT 


TCAgCATGTT 


1620 


35 


GCACAACATA 


AAAATATCGC 


CTTTCAATCT 


TTTTATAATC 


TCATTGATAC 


TGTATATAAA 


1680 




GATCAACATT 


TTACAATGTT 


TGAAACGGAC 


GCTGAATTAT 


TCGGATATTG 


TTATGGTGTT 


1740 


40 


GCTGGTACAg 


TAGGTGAAGT 


ATTGACGCCG 


ATTTTAAGTG 


ATCATGAAAC 


ACATCAGACA 


1800 


TACGATGTCG 


CAAGAAGACT 


TGGTGAATCG 


TTGCAATTGA 


TTAATATATT 


AAGAGATGTC 


1860 




GGTGAAGATT 


TTGACAATGA 


ACGGATATAT 


TTTAGTAAGC 


AACGATTAAA 


GCAATATGAA 


1920 


45 


GTTGATATTG 


CTGAAGTGTA 


CCAAAATGGT 


GTTAATAATC 


ATTATATTGA 


CTTATGGGAA 


1980 




TATTATGCAG 


CTATCGCAGA 


AAAAGATTTT 


CAAGATGTTA 


TGGATCAAAT 


CAAAGTATTT 


2040 




AGTATTGAAG 


CACAACCAAT 


CATAGAATTA 


GCAGCACGTA 


TATATATTGA 


AATACTGGAC 


2100 


50 


GAaGTGAGaC 
GCAAAGTTGT 


AGGCTAACTA 
TTCA 


TACATTACAT 


GAACGTGTTT 


TTGTGGaTAA 


GAGGAAAAAG 


2160 
2174 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
GAAnCAGn TA GACAAATTAT GGaAAmCGGT GTGAATCaAG GATTCtTTGG 
TTTGACCTAC TCGTCGATGA GGATGATAAC GTTTATGCGA TTGATTTAAA 
AATGGTTCaA CGAGCATGTT ATTACTTGCT AACGAGTTGA ATTCAGGATA 
TATAGTTATC ATTCAAAAGG TGATAACACA CATTTCTTCA ATACGATTTT 
AAAGAAGGTA GTTTATACCC GTTATCTTAT TATGATGGTG ATTGGTACGG 
GTTAAATCAA GGTTTGGCTG TATTTGGCAT GGTGATTCAA AAGAAACAGT 
GAACGCGCAT TTTTAGCTGA ACTTGAACAC TATTAGAGTT CGGAACATAA 
TGTTGTGTTG CCAGTAGTTG ACTGAATATG CGTTTGTAAC AAGCTTTTTT 
CAACAGTAAT TAAATTTATG ATATGGCAAT ACTTTGTAAT ACTAATATTA 
TTTATTTCAC TATGTTATAA GAGTTGCCAT TTTGTTGATA AAGGTATACT 
GTTTTGAAAT TTTTAGTAAC TAGATATGTT TCGTGTTATA GACCGAATTT 
AAAATTTAAT GCTATTGAAT TTTTAAAATG AAAAACATGA CATTAAATTG 
ATGTCTAATT GACTAACTTG TTGGAGTCAT TTACTATTTT ATGTATGACA 
AGTGAGGGTC AAGCATGTCT TATAAAGCAT ATCCATTCTT TAGAGATATA 
AATGTATTTA TTTCGCCTCT AAAAATAAAA AACTAGTACG CCTAAATTAT 
_CGnAlGTAGG- CGTTTGGACA-GAAGAAAGTG TGGCCGTATC^ ATTTTTAACA 
TTCCATTTGA TAAAGTTGTA AAAATGGACG TTGATCGTTT TGCTACTTAT 
AATTGTTTGA TGAACAAGAC CATATTATTA TGAATCAAAC AATGGAAGAw 
TACTAAACGT TGTAGCTGTT ACACAAGAAG TGATGACGGA ATTAGATAAA 
AAGAATTTGT CCAAGATGTA GCGAAATATG ATGAAGTATA CGGCTTAACT 
GTAAGCAGTT TATTCTCATT AGTGAAAATG ATAGCGACGA AAAAAAGCCG 
CTGTATGGAG TATTAAAAAC AGAGCGTTAA AAGTTCGAGA TGAAGATTTT 
ATTTAATTAC GATTGAAGGT TCTGTTTTCG G AG AATGG CT AGATGAACTT 
ATAAAGCCGT TGCGATAGAT TTAAAAACTG GCGTGGTTGG TACAATTGTT 



TGTAGCTGGT 
CTTTAGACAA 
TCAAAAGTTT 
GAAATATGTC 
TGAAGATAAA 
ACTGGAGAAT 
GGCGCTACAA 
CGATTCTAGT 
AATGG CGACT 
AAAGGTTATC 
GTGTATACGT 
AATTCATAAT 
TATTTTAAAA 
TTAATAAATG 
AAAAGTGAAG 
AGTCGTGATA 
GAATTAGATG 
GAAGGGCATC 
ATTAGAATCA 
AAAAAAGGTA 
CATATTATGC 
GAAGAGTGTG 
AGAGATGATC 
TCAGCGCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
640 
900 
" 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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ATGGAACAAT ACGTATTCAA AACACTTAGA C CAT AAAAT A AAAGGCCATT TATATAGCGT 1560 

TTATTTAAAA CAACGCG CAT ATAAATGGTC TTTTTCTATT TTTCTAAATA TAATGCACCA 1620 

ATAGCACCTG nAAAATGCGC CGTTTTCAAC ATAGTACGGT TTGCAACCGC GTAACACAGT 1680 

ATAATCTTCC ACAACTTTGC GTAATAAAGC GTTATTATGA AATGAAGAAC CGATATAAAC 1740 

GATATTTTCA GTTTTAAATT CACGTGCAAC AGTAATGGCC ATTGTCGTAA CAACTTCGCC 1800 

AACGACACCA ATAACGGCTG CTAATTTATT GCTAGGTGTA AAATCAG CAT CTAAATGATG 1860 

TAGTACATGA CCAAAATTAG CTGCTGTTAA ATCACCGGGA ATGGGTGGTT CGGTATCTTT 192 0 

ATAAATATGT CTAACCTTTA AATCGATAGT GTTACGATCA CCGTGTTGTG CCATGTCAGT 1980 

TAACTGTTTA TAATCAGTGA TTTGACTTAG TAAATAACCG AGTCCTTGAA TCATGCCTCC 2040 

ACCTGTACCG ATACCGCCTA CACGACGTTG TGATTGGCCG TCGAAATAAT GTAGTGACGT 2100 

ACCGGTACCA ACATTTGCAA AAATATAATC TGCTAAGTCA TGGCCTTGCT CTTTTAACAA 2160 

AATACCTAGT CCTTGAGATG CAGCATCAAA CTCTACAAAA ATTTGTGCAG GAATGTTGAT 2220 

GTTTTCAG CA ATGACACCTG CATTACCTCC AGTTAAGCAT AATTTTTCAA TTTGCTGTTG 2280 

25 GTTTAACCAT TCCACAACTT GATCAATATT TTTAGTTAAT TCAGTTTTAA AAGTACGTTG 2 34 0 

GTTATCTTGC TCTTGAACGA TTTTAATTAG TGTACCGCCA GCGTCAATGC CAACTTTCAT 2400 

AAGATTCCCA CCTCATTATT AATGTCTATC CTTAAATAAT AGTATAGTAA AATGACTAAA 2460 

30 AAACAAGTAA TAATAGTAAT TATTAACAAA TTTGATGCCa TTGCATTTCA ACATTGTAAG 2520 

cGTATCGCAA TTAAtGTTTT ACAAACGTGG ACGTT AAGT t ATATATATTA TTTTCTAGGA 2580 

ATTTTGAAGT TGTATAGGAT TGTTAGTTAG TGACGCAATA TTAAAAGTAG TTCGTACGCA 2640 

GTGTATTTGT AAGTCTCTGA TTAAAATGAT AAGTAATGAG GAATAGTACA TTAATTTTGA 2700 

AATTTAAAAA ATATAAATAA GTAATTTATT TAACTTAGAG CAAATAATGG TATCGTAGTG 2760 

AAATAATAGG TAAAATAATA TGGGGATTCA TGCTTCATAT ATAAAAAGAT AGGGGTTAAA 2820 

TATATGGCTA AAGAACTTTG TTTTGAAGGT ATCACTTTAA AAGCATTTGA TGAACAATAT 2880 

CGTTCAGCAA TTAATGATTT TGACTTGAAT GAAAGACAAC AAATATATTC ATCTTTACCT 2940 

AAAGAAGTTA TTGATGATGC AATTAATGAT GCTGATAGGA TTGCTAACGT AGCAwTAAmC 3000 

GATAAAAATG AAGTGGTGGG C TTTTTTGTA TTACATCGTT ACTATCAGCA TGAAGGTTAT 3060 

GATACACCTG AAAATGTCGT TTATATTCGT TCATTATCGA TTAATGAAAA ATATCAAGGT 3120 

SO TTTGGATATG GCACGAAAAT AATGATGTCA TTGCCGCAAT ATGTTCAAGG TGTATTTCCT 3180 

GATTTTAATC ATCTATATCT AGTAGTAGAT GCGGAAAATG ACAATGCTTG GAACCTATAC 3240 



55 



35 



40 



45 



JNSOOaD: <£P 076651 9A2J_> 



920 



EP 0 786 519 A2 



CTATATTACT TGGACTTAGA TTCAAAACAT GTTTCATCAT TAAAGCTTGA AGAAGAAAGT 
CGTTCAGAAG TGACCAATGT ACATATCATT AATTTAATGA TTGATGGCCA AAAGGTTGGC 
TTTATCGCAT TGGAGCAGAT TGGTGAACGC ATGAACATTG CTGCTATTGA AGTGGATAAA 
TCATATCGCT TTAATGGTAT TGGTTCAAGT GCTCTGCGAC AATTGCCAAC TTACTTAAGA 
w AAAAACTATG ACAACCTTAA TGTGATTACG ATGATTCTGT TTGGAGAGAA TAATGATTTT 

AAACCATTAT GTTTAAATAG TAATTTCGTT GAAATCGAAC AAACTGATGA TTATGTCGTT 
TTCGAAAAAT ATTTAAATTA CTAACAGTGA TTGOGAAATA TGATATTGTC ATTTATAATT 
,5 TAGTTTTGTT ACTATATATA AATGAATTCA GACGTATAAA TTTAGATTAT ATCCTTCGAA 

AGGAAGTATT GGGCAATGAA AATTCAAGAT TATACAAAAC AAATGGTTGA TGAAAAATCA 
TTTATTGATA TGGCTTATAC ATTATTGAAT GATAAAGGCG AAACAATGAm mTTATATGAT 
20 ATyATCGATG AATTTAGAGC GTTAGGTGAT TATGAGTACG AAGAAATTGA AAATCGTGTT 

GTACAATTTT ACACGGATTT AAACACAGAT GGTCGTTTTT TAAATGTTGG AGAAAATTTA 
TGGGGATTAC GTGATTGGTA TTCGGTAGAT GATATTGAAG AGAAAATCGC ACCAACTATT 
CAAAAATTCG ATATTCTGGA TGCAGATGAT GAAGAAGATC AAAACTTAAA ATTATTGGGC 
GAAGATGAAA TGGATGACGA CGATGATATT CCAGCTCAAA CAGATGATCA AGAAGAACTA 
AATGATCCAG AAGATGAGCA GGTTGAAGAA GAAATCAATC ATTCGGATAT AGTCATTGAA 
GAAGATGAAG ATGAACTAGA CGAAGACGAA GAAGTGTTTG AAGACGAAGA AGACTTCAAC 
GATTAATTTT TTGTTTGACT TTTAGTTGAA AGATGATAAA ATTTTATTCG GGCTCCTTTA 
35 AATAGGACAC GTGTATAAAA TTTATACGCT CCCCTTACAG AATTTGTGAG AGGGAGCGTT 

TTTTtATTTA ATTGAGTAAA TCAAGAAATG ATAACGCAAA AATCAAAGTT GTAAATGATA 
TACATAGTGA "CATAGCAGTA - TGGAAA^XMT - AAGTAAACAG ~ AATTTA^TTT - TGTCGA^TCG _ 
40 ACAAtAAaCA aCTtGAaTGA GCTTGCTTTA ATGTTATGTn nTACGTAATT TTTACAATTG 

ATGAGGAAGC ATTCCCTTTA ATAATTAGGA ggtcaagaca TGACAAAATT tatttttgta 
ACAGGTGGCG TAGTTTCATC CATTAGGGGA AGGGT 
45 (2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS .- double 

(D) TOPOLOGY: linear 
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ATAATAACTG AAATTAAAAT TGCTAAATmG TGTTaAgCTA TCGCmACAAT GAAAATwCCG 60 

ATTTTGCGTT GTTGAAAATA TCTTTCCAAA CCAAGAATCG ATAATGGCAA TAAATATAAT 120 

AAATTTCCAT AAAATGACCA AGTAAAATTA AAGTATATAA CGACAGTTGA CATGCCGTAT 180 

AAAATCGTAG CG AT CAT ATT TGCTGAGCGT TTAAAGTGTA ATATTTTAAA TAAGTAGAAG 24 0 

GTCACGACAA ATGTTATGAT AGCTCGTATC AT AGC CAT AA TAAGTTGGTT TGTCGGCCAA 300 

AAATGTATTG TCGTCGGATT AAATATACCA ACCGTTTCTC CTATTTTAAT GAAkAGAAAA 360 

TTTAGCCACA TTAAAGGTGA CAGCGAATAA TAATnTGATA GTCCTTTCAT ATAATCGCCA 420 

CCTAmTCCAA ACGATGCATC ATrTAAACTA GAAnAACTAC GTAGATGTTC ATACAnATAC 4 80 

ATTTGAAATG GCATCATTTG ACGGAATCCA TCTCCAGCCC CGCTAAAAAC AGTACCATTC 54 0 

ACAATATAAT CATAGATATG AGTAGAAAAT AAAATAAGCG TTAATATTAC ACTAATGAAA 600 

GTTATAACAA AGAATTGTTT GACGTTTGAA TTTAGCCACT TTTTTAACAC AACATTATCC 660 

TCAACTTTCA AATTTAAAAT TAAGTTTAAC TGAAACTAAA GTTAATGAGG TTCTTGATAG 720 

GTAAAGACGA AGATGACTGT GGAACAGATA CCTTATCATA GTTACTTAAA CTTTGGATCA 780 

25 TTTTCAGTTT ATCATTAAAC AAATATATTG AATAATAAAa aTGTCATACT GATAAAGATG 840 

AATGTGACTT AATAAGTAAC TTAGaTTTAA CAAATGATGA TTTTTAATTG TAGAAAACTT 900 
GAAATAATCA CkTATACC 
30 (2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16397 base pairs 

(B) TYPE: nucleic acid 
3S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

TCGCCCnATA ATCAATTTAT TTTTCATGTG CCACTCCTAT ACAAGCTnAC AATGCTTCTT 60 

CAGTTAAGGC AATATCTTTT AATTTTGTTT GATATTTTTG TTCAAAGTCA TATTGTAACT 120 

GAACAATTTC TGGCAAACCA ATATGCCAAT CCGCCAATTT TTTTTTAyCT TtGAAGAGCT 180 

CTTTTGGTGA TGkTTGcGAC ACTATACTAC CTTCTTTCAT AACGATGACT TCATCTGCAT 24 0 

AACGCGCGAC TTCATTCATA TCATGTGAAA TTAGGATAAT TGCCTTATTT TCATCTGTTT 3 00 

SO GTAGTGACTT TAGTAATCTC ATTACTTGTC GTTTACTTTG TGGATCAAGT CCTGCTGTAG 3 60 

GTTCATCAAC CACGATAATA TCAGGATTCA TTGCCAATAT CGATACAATC GCTATTTTAC 420 
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AATCCATCAA CAGACGATGG GCATAGTTTT TGGCTTCATC TAAATTCATT TTAAAGTTTT 
TAGGTCCAAA TATCATTTCA CGCTCTACTG TGTCCTCAAA TAATTGAGAT TCGGGAAATT 
GAAATACCAT TCCAATTCTT TTTCTTACAG GTCTAATATA TTTATCTTTG GTCTTATGTG 
TAATAGTAAT GTCATCAACT GTAACTGTCC CAGTAGTCGG CTTTAACAG C GCATTAATAT 
TTTGTATCAA CGTTGATTTA CCACTACCCG TTTGTCCAAC GATGGCGTAA TATTTACCTT 
GTTCAAATTC TGTATTAACA TCATGAATAG CTTGATGCTG ATATGGTGTC CCTTTTTGAT 
AGGTATAACT TACATTGTCA AACCGTATAG TCATAGTTGA TCCACCAGCC CTTCATAAGT 
TAAGAATGAT GTTTGGTGTC CCAGCATTTG ATTTATTTTG ATTGGGAATG GCAAATCTAG 
ACCTATTCTT GTTAACTCTT CTGCATTGTC GAAAATTTCA GTCGCTGTGC CTTCTTTATA 
GACAGTCCCT TTATTCATAA CGATAACATG ATCTGCTTCC ATCGCCTCAG ATAAATCATG 
CGTAATAGAA ATGATTGTAA TATTATGTTC TGATTTAACT TTTCTCACTA AATCCAATAA 
ATTTTGACGT GCATCAGGAT CTAACATAGA AGTCGCCTCA TCTAATATAA TGACAGAGGG 
GTTAAGTGCT AATACACTTG CTATAGCCAC ACGCTGCTTC TGTCCCCCCG ATAATGCATT 
AGGTTCATAA TCTGCACGTT CTAACATATC AACTTGTTTA AGTGCTTCGC TGACTCTTCT 
ATGCATTTCG TCATATGGAA CCGCATGATT TTCGAGTCCA AATGCCACAT CGTATTTTAC 
AATTGAACCA ACAAATTGAT TATCCGGATT CTGAAATACA ATTCCTATGT CTTTTCTTAA 
CTTTTCAAAA TTATCATCAG TT AT AG CTTG ATTATTATAA AAAATTTCTC CAGATTTAAC 
TTTCTCTATG CCAATCATTA ACTTGGcAAT TGTAGATTTT CCAGAACCGT TATCACCAAC 
AATAGATGTC CACTGACCTT TAGGTATATT AAAAGAAACA TCTTTCAATG TCAAGGATGC 
ATCACTTTGA TATTGAAATG AAACATTTTT AAATACAATA ACTGAATTCT TATCCTCCAC 
_TTG*rcTCTCT _ CCTTTACGAT— TCX5TGTATGT- ATCATATTTT ACAATATTTA TAAATCGCTG " " 
TATATGACAT TGACTGGGTT CTCTATATAT TACTAGTATT TTCTGACTCA TTTCTAGTCT 
TTAAAGTGTT GTTTAACAAC TAATGATAAG GACTTTTATT CCTCTCTAAC AATTATGTAT 
AAACGTTAAT AAAATAAATG ATTTACTAAT ATAGGGGTGG TCGCGTTTGA TTCAACGATA 
ATACTTTCAC TTCATTCAGT TCTAGTGAAA TTGATCAAAC TAGCTTCATC ATATTTTTAG 
ATTCGCACTC AAAAAAGTAA ATATAAAGAA ATCGGACTTA AAAACATTTC TGTTCATAAG 
TCCGATATTT TATTCAATAA AAAAGCGCGC ACCCCATCAT AAGTTTGTTG AGTTCACGCT 
TTAAATCTTT ATTTAGTTGA TGGGGTACTC TGAGCTAGAC AATATTTGTA TGTGGCAAAC 
ATTAtCGTTG CACTCATTTG CTTTATATAA AAGTAGTTAG TGTATTTATA TAAATTCTTA 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
" 1740" 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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ACGAGTGTAA 
AAGTGCAGTT 
TGCATTACGA 
ACTGCGAACT 
TGTAGCTAAG 
ACCCATGAGT 
ATTTGTATTT 
CTTCAGATTT 
ATGAACGTAC 
CTTCTTCTTT 
TATTCAAGTG 
AACCATTAGT 
TATTTTCAAC 
CACCAATTGG 
TAGCAACTAG 
GGTTTAAAAT 
CACGTACATC 
TAATGTTCAT 
GTAAAACTCC 
TACGACGTAA 
ACTTACCGAA 
CTATCATTTA 
CTGTAACAAT 
AGGTACTGGA 
AATAGCTGAT 
ATGCTCCATA 
TTTTTTAGAT 
CTCATCAGTG 
TTCAATATTC 



CCACCTTGAC 
TGTGTAGTTT 
CGAGAAGCTA 
TCTTTTGCAC 
TCACGTAACA 
TAACCTCCTT 
AACTTCTTCT 
GTCAGCTAAC 
AGATAAGTCT 
TTCAATCATG 
TTCAGTCATT 
CCAAACATCC 
AGTATAGTTC 
TAAATCACTA 
ACGAATTTTT 
TTCAACATCA 
AATTTCTAAA 
AATAATTGTA 
CTCAATTTCA 
GGAGTTTCCT 
TTT AG CATCT 
AATATCCTCC 
ACCATCATAA 
GTAACGTCTC 
T CACGACCTG 
GCTGATTTAG 
CCTTTGAATC 
ATAGTTACAA 
TTTTTCACTC 



GTTCTGTGTA 
CATCTTCGTT 
AATCTCCTTT 
GAG CTTCTGT 
TAGCTTTACG 
TATCAATCTT 
AAAGATTTAC 
TCTTGAACAG 
AATTCTTCAA 
ATTTCAGCGT 
ATTTTTGCTG 
AATGTTAATT 
ACACGTTCAA 
GTATTATTTT 
AAGTG AC CAC 
CTATCATGTG 
GTTTTATCTT 
GAAACATCTT 
ATATACTTAA 
AGTGTAGTAC 
TCACTAATTT 
TTATATACGT 
ATTATACACG 
TGATCGCAGT 
GACCAGGTCC 
ATGCAGTTTC 
CTAATGCACC 
TAGTGTTGTT 
TACGTTTACG 



AcGCTCTGCG ATTTCACCAA 
TAAGATTTCA ACATTACGTA 
TTTACCTAAA GTGATTAATT 
AG TTTCAAT A CGTTCACTAA 
TTGATCAGAA GTACGACCTA 
CTTTTCTTAA TCCTAATCCT 
GACCTAAATT ACGCACTTTC 
AATTGATTCC TGCGCGTTTT 
TAGACATTTC TAATACTTTT 
TTTGCGCTTC ATCAGTAAGA 
CTAATGAAAC TGATTCTTGT 
TATCAAAATC ACTGCTTTGA 
CAGGTGAATA CAATGAATCA 
GTTCTGCTAA TGCGTAACCT 
CTTTAGATAC TGTTGCAATT 
TAATGTCGCT TGCTGTTACT 
CTTCAGAGTA AATTTTCAAT 
CAACTACATT GTCTACTGCT 
cGGCTGCACC TGGTAATGAA 
CGTAGCCACG TTCTAGTGGT 
CAATTGTCTC AATTCTAGGT 
CGACTTAATT TAAAATGTTT 
ACGACGTTTT GGTGGACGAC 
TACTTCTAAA CCTGCAGATT 
TTTAACTGTT ACTTCAACTG 
AG AAG CCATT TGTGCTGCAA 
AGCTGATGAC CATGATAAAG 
GAATGTTGAA CGGATGTGTG 
AGATACTTGT TTACGTGCCA 



ATAATTTTTG 
AAGTTTTAGC 
TCTCAACAAC 
TAATAAGTGA 
ATTTTCTGTA 
AAATCTTCTA 
ATCATGTCAG 
AAG CAGTT AT 
TCTTTTTGAT 
CCAACGAAGA 
GGTGTGATTG 
CCTACACGTG 
ACAGGGATTA 
CTACCCTTGT 
TTAAGCTCTG 
TCGCCTTCAT 
GCTAATTGTT 
GAGAATTCAT 
GATAGTAGGA 
TCAACAACGA 
TTTTCGATTT 
GCTCAGTGAC 
AACCGTTATG 
GTAATGCACG 
TTTTTAAACC 
ATGGTGTTGA 
CATT AC CGAA 
CTACACCATT 
TTTAAAATTT 



2340 

2400 

2460 

2520 

25B0 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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CGCGCGTkGT TTTTCGTTTT TTGACCACGA ACTGGTAAAC CACGACGGTG ACGGATACCc " 414 

ACGGTATGAT GAAATTnCCA TTAAAcGTTT GATATTTAAG TTAGTTTCAC GACGTAAGTC 4 20 

ACCTTCGACT TTATAACCGT CTACAACTTC ACGGATGCGA CCTAATTCGT CATCAGTTAA 426 

ATCTTTCACA CGAGTATCAG CTGATACGTT AGCTTCTTCA AGAATTTTTT GAGCAGTTGA 432 

CGTACCGATA CCGTATATAT AAGTTAATGA GATAACTACG CGTTTTTCAC GTGGAATATC 4 38 

TACTCCTGCA ATACGTGCCA TATTAATTTA CACCTCTCTT TTATTAACCT TGTCTTTGTT 444, 

TGTGTTTTGG ATTTTCACAA ATTACCATTA CTTTACCTTT ACGTTTAATG ACTTTACATT 4501 

j5 TTTCGCAAAT AGGTTTTACT GATGGTCTTA CTTTCATTTT TATACCTCCC TATATTATGG 4 56 ( 

AGTGACGATT ATTTATAACG ATAAGTAATT CTTCCGCGTG TTAAATCGTA CGGAGACATC 462( 

TCAACAGTTA CTTTGTCGCC AGGTAGAATA CGAATGTAAT TCATTCTGAT TTTACCACTT 468c 

20 ACGTGAGCnA AAATCTCATG ACCATTTTCT AATTCTACTT TAAACATTGC GTTCGGTAAA 47 4C 

GTATCTAATA CAGTACCTTC TAATTCAATT ACATCTTGTT TAGCCATTGA TTAACTTCCC 480C 

CCTTTTTGCA ATAGTAAGGT AATCGTCAAT AGACAACTTT ATTGTTACGA ATCTATCAGT 486C 

GATTAATTTT ATAAGTTAAA CAAAAATTAC GGGAATTAAT TATCGTTAAT TGCCACTCTC 4 920 

ATCTATCTAA TATGATTAAA TCATGCCTCA CTTAAAATAG ACCGCTAAAA GTTGATCTAT 4980 

TACAAATGAT CTAAAATATC AATGACATCT TTGGTAACGT CGCTAATATC TTTTGAACCA 5040 

TCAATATTTT TCAATACACC TTTTTGATCA TAGAAATCTA AAATAGGCTT AGATTGTTTA 5100 

ATATTAACAC TCAAACGATT AGCTACCGTT TCAGGATTAT CATCTTCTCG TTGATACAAT 5160 

35 TTACCACCAT CGATATCACA AATACCTTCG ACTTCGGAGG ATTAAATACA AGATGATACG 5220 

TTGTACCACA TGACTCACAG ATTCGACGAC CTGTAAGACG GTTCATTAAT TCTTCTTCCG 52 80 

gaacTtcgat attgatgaca gcatcaatgt ttctgtcaag ctcagacata atattattta 

40 ATGCCTCAGC TTGCTCGATT GTTCTTGGGA AGCCATCTAA TAAAAAGCCT TTTTTTGCAT 

CGTCTTCAGA AATTCTTTCC TTAACGATAC CTACAGT CAC TTCATCAGGA ACTAATTCGC 

CACGGTCCAT ATAAGACTTA GCTTCTTTAC CTAATTCAGT TT C TTCTT TT ATAGCTTTTC 5520 

4S 

TGAACATGTC ACCAGTTGAA ATGTGGGGTA TTGGGAATTT CTTGaCAATT TCACTTGCTT 5580 

GAGTTCCTTT ACCTGCGCCA GGTAAACCCA TCAAAATGAT ATTCATAAGT GCCCTCCTAA 5640 

^ AATTATCTAC CACCAAAGCC TTTATATTCT TTTTGAGATA CTTGAGCTTC TAAAGATTTC 5700 
ATTGTTTCAA TCGCTACACC AATAACGATA AGTAAACTTG TACCACCAAT CTGAATTGAT 
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5460 



5760 



TGTGGTAATC CCATAAACTT AGTTGCTAAT ATCGGTAGAA TTGAAATAAC GGCTAAGAAG 5820 
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CCAGGTCTAA TACCTGGAAC ATAGCTACCT TGTTTCTTAA GGTTATCAGC CATTTTTTCC 594 0 

GGATTAACTT GTACAAATGC ATAGAAGTAT GTGAATAGTA TAATTAGTAC AATATATACA 6000 

ACCATACCAA CATTACTTGA AGGATTTGCA GCATTCGCAA TGTTTTGTGC CCATTCTTTA 606 0 

TCTGGATAGA ACAACGTTAA TGTTCTAGGC AGTAAGAAGA ACGCCATTGC AAAGATTACA 6120 

GGAATAACAC CGGCTGAGTT CACTTTTAAA GGTAGATAAG TTGCCTGTGA AC CTAATCTT 6180 

TGAGCAGTTT GTTTCTTAGC ATATTGAATC GGAATTTTAC GAACGGCTTC AAGTACATAA 624 0 

ATAGCACCTA CAGTTAATAG TATCAGTGAC ACTAAAAGTC CTAATAC TTT CAACCATGCT 63 00 

AATGATGTAT CTTCTTGCCC AACGAACGCA tTTGTcCAAA TTGAATTAGA CTGGCTGGCA 6360 

ACGTTGATAA AATACCCGCA AATATGATAA TAGAAATACC ATTACCAACA CCGAACTGAG 6420 

TGATTTGATC ACCAAGCCAT ATTAAGAAAG CAGTTCCTGC TGTnCAAAAC TAGTGCTATT 64 80 

AATAAATAAC TCATAATTGA CTGATTGATA ATCAGCGCAC CTTTGAGATA ATTATTAAAT 654 0 

TGGAATGCCA TACCTATAGA TTGGATAAAT GCTAAAGAAA TTGCTAAATA ACGAGTAACG 6600 

TTATTTAACT TTCTTCTACC TACTTCACCT TGTTTTGCCC ATTCTGAGAA TTTAGGGACA 6660 

25 ATATCCATTT GTAATAATTG CATTACGATT GATGCAGTGA TGTAGGGTAC AATACCCATT 6720 

GCAAAAATAG AAAATCGTTT CAAGGCTCCG CCACCAAAAG TATTTAATAA CTCAGTGGCA 6780 

CCTTGAGAAC CTTGGGGATT ATCAAAAGCT GCAGGATTTA CTCCTGGAGC TGGTATATAA 6840 

GTCCCTATTT TAAAAATTAC TAACATTGCT AGTGTGAAGA AAATCTTGTT ACGAACCTCT 6 900 

TTTGTTCTAA AGAAGTTCAC AAGGGTTTGA ATCATTAGAT CACCTCGTGT GCTCCACCTT 6 960 

TAGCATCAAT AGCTTCTGCT GCTGAAGCTG AGAATTTATG AGCTTTCACT GTCAATTTCT 7020 

TATCAAGTGA ACCATTACCT AGTATTTTGA TACCAGATTT TTCATTCTTA ACAACACCAG 7080 

ATTCJACTAA TAAAGCTGGA GTTACTTCAG TACCATCTTC AAATTTATTA AGTTGGTCTA 714 0 

AGTTAACAAT AGCATATTCT TTACGATTTA TGTTAGTAAA AC CACGTTTT GGTAAACGAC 72 0 0 

GGAATAATGG TAATTGACCA CCTTCAAATC CTGGTCTTAC ACCACCGCCT GAACGAGCTT 7260 

TTTGACCTTT GTGTCCGCGA CCACTTGTTT TACCGTTACC TGTCGCAACA CCACGTCCAA 7320 

45 CACGATTGCG TTCTTTACGT GAACCTTCTG CCGGTTTTAA CTCATGTAAT TTCATTTCGG 73 80 

CACCTCCTTG ATTATTTTTC TTCTACTGTT ACTAAGTGCT TAACTTTGTT GATTTGCCCA 7440 

CGAATAGCAG GGTTATCTTC AACAACTACT GAACTGTTAG TCTTTTTAAG ACCTAAAGCT 7500 

so 

TCAACAGTTT TACGTTGTGT TTCAGGACGA CCAATAACAC TACGAGTGAG GGTAATTTGT 7560 

AATTTAGCCA TAACTAGTTT TCCCTCCTTA ATTGTATAAT TCTTCTACTG TTTTGCCACG 7620 
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CATGTTGATT 
TTCAAGTACG 
CATAAATACG 
ACGTGGAACA 
TACCTCTTGA 
TAATGCAGTG 
AACAACGCGT 
CCTTTAAATT 
CCGTGATATA 
TTTTTAGCAA 
ATGTCGCTGT 
TGAGCGTAGA 
GATAAGTTAG 
CTGATCATTT 
TTACGGCGAA 
CTTACTGAAC 
TTAACGACTG 
GGGTGAGAAT 
ACACCAACAA 
AATAAAGCAC 
- ACTTeAATTG 
TCACCTTTAG 
GGGATGTCAA 
TACCAAACGT 
ATTACACCTT 
TCGCTAGCTT 
ACACGCTCAT 
TCTACGTATT 
TTAATATTTG 



GGTGTGTTTG 
GCACGAACAG 
CTTCCTGAAC 
ACTACTAAAT 
GCTTTACCAG 
AAACGGAAAC 
TCTTCAAATT 
AAAATTCTAA 
AATATCCTCC 
TTGCTTCACC 
CTTTTGAAGA 
TATGCTTGTT 
TACGAACACG 
CAATACTCCT 
CGTATTCACC 
GGATGTTAGA 
TGTTTTTCTC 
AACCAACGTT 
GTTCAAGTAC 
GAGTTGTACC 
TGTTTTCTTC 
GACCTTTAAC 
TAATTTTCTT 
ATGCGATAAT 
CAGAAGTTGA 
TTGCATAAAC 
CGTTTTG AC C 
CAACATTTTT 
ATGCAGGTAA 



ATCCTAATGA 
GACCACCAGC 
CGTAACGGCC 
CTTTTTTAGC 
TACCGAAACC 
GACGACCACC 
CTTTCGTCTC 
TCCGCTTTCT 
ACGGTCAAAT 
GACTTTAGTT 
AGCTTGAGCT 
TGAACGATAT 
AGCATGTCTT 
TTCTTTAGAG 
TTGGTAACGA 
TGCTAATGCT 
AACTGAGAAA 
AAGGATTAAG 
TTTTACGTAT 
ATGGTTTGTT 
TTGTTTGAAT 
AGTTACATGA 
ACCAACACGA 
TTCTCCACCA 
TACTAATGCA 
ACGTAAACCT 
ATATTTTAAG 
AATGAAACCT 
TTCTAACTTC 



TTTACTTAAG 
GATAACTCCT 
AGTAATTGTG 
TGCTTCAACA 
TACACGACCA 
TTTTACAACT 
TTCTTCtCTA 
CTTGCTGCTT 
ACGATTTCTT 
GCTAATTCAA 
AATGTTACGC 
ACGTTTAAAC 
TTTAAACGCA 
TTTATCTATT 
ATACCTTTAC 
CCAACTTGTT 
GTAATGTTTT 
TCTTTACCTT 
CCTTGAGAAA 
CTATCTTCTT 
"GTCATTCTTT 
TTTCCATCAA 
CTCATGTTAT 
ACATTACGTT 
ATACCTAAAC 
GGTTTTGAAA 
AATAAACGAA 
TCACTCTTTA 
TCGTGACGCA 



ATATCAGTGA 
GTACCAGGTG 
TGTGGAGTTG 
GCTTTTTTGA 
TTTTTGTCTC 
TTTGCTACAC 
CGAGCCATGT 
CAGCTAATGC 
TAATGCCTTT 
CTTTAGTTGC 
CTTTATTATC 
GTGGCTTTTC 
CTTTATTTTT 
ATTTACCAGT 
CTTTGTAAGG 
CTTTTGAAAT 
CTTCAGCTTT 
GCATTTGAGC 
CACCTTGTAC 
TAGAATCAGA 
CATTTAAAGT 
AAGTTACTGT 
GGCACCTCCT 
TTCTTGCTTC 
CATTTAATAC 
TACGTTTTAA 
GTACACCTTG 
AGATTTCAGC 
CCATGTTTGC 



TACCTGCTAA 
CAGCCGGTTT 
TACCTTCAAC 
TTGCTTCTGG 
CAACTACAAC 
GGTTGATTGT 
ATTTGTCCCT 
TTTAAcACGT 
GTCAGCAGCT 
TGTAGTAGCA 
ATCAATAATT 
AGCTGTACCT 
ATCAATTTTA 
TTTACCTTCT 
CTCTGGAGGT 
ACCTTCAACT 
AATTTCTACT 
ACGGTAACCT 
CATATTGTTT 
TGGTCTTACA 
TCTTGATAAT' 
TACGTCACTA 
TATTTTTTAT 
TTTGTCAGTG 
TTTAGGCATT 
TCCTGTGATA 
TTTATCATCT 
AATTTCTTTT 
GTTTCTTACA 



7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

894 0 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 
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TCTTTTTTAT TACCA GCTAG CTTTACGAAC GCCAGGGATT TGGCCTTTGT AAGCTAATTC 9540 

ACGGAAACAA ATACGGCATA ATTTAAATTT ACGATATACA GAATGTGGAC GGCCACAACG 9600 

TTCACAACGA GTGTATTCAC GAACTGCATA TTTTTGTTTT TTTTGTTGCT TAG CAAC CAT 9660 

TGAAGTTTTA GCCACTTAAT TAGCCTCCTT TAAATAATTA TTTACGGAAT GGCATACCGA 9720 

AGTTAGCTAA CAATTCACGA GCTTCTTCAT CAGTGTT AG C AGTCGTTACG ATAACAATAT 9780 

CCATTCCTCT AACTTTACTT ACTTTATCAT AGTCGATTTC TGGGAAAATT AATTGTTCTT 9840 

TAACACCTAA AGTGTAGTTA CCGCGTCCGT CAAATGCTTT TTTAGAAACA CCTTGGAAGT 9 900 

CACGTACACG TGGTaATGAT ACTGAAATTA ATTTGTCTAA GAATT CATAC ATTCTTTCAC 9960 

CGCGAAGTGT TACTTTCGCA CCGATTGGCA TACCTTCACG TAAACGGAAA GTCGCGATTG 10020 

aTTTTTTAGC TTTAGTTACT AATGGtTTTT GACCAGTGAT CAATTCTAAT TCTTCAACAG 10080 

CATTGTCTAA TACTTTAGAA TTTTGTACTG CGTCACCTAC ACCCATGTTC ACAACGATTT 10140 

TATCTATTTT TGGTACTTCC ATTACTGAAC TATAATTGAA TTTTTTCATT AAGTTTTCAG 10200 

TAACTTCAGT GTTaAACTTT TC t TTTaAAC GGTTCaAAGT GGGATCCTCC TTTCaACTTG 10260 

T t ATTAATT A TTAGAkTTAA TTTCTTCGCC AGATTTTTTA GCGATACGAA CTTTTTT AC C 10320 

ATCAACAAAT TTGTAACCTA CACGAGTTGG TTCGTTTGTT TTAGGGTCCA ATAATTGTAC 10380 

ATTAGAAACA TGGATTG CTG CCTCTGTTTC TAAGATTCCA CCTTCAGGAT TTAATTGAGT 10440 

TGGTTTTTGG TGTTTTTTCA TAATGTTAAC ACCTTCCACA ACGACACGGT CTTTTTT AGG 10 500 

TAGAGTAGCA ATTACTTTAC CTTCTTTACC TTTGTCTTTA CCTGCGATAA CTTTAACGTT 10560 

GTCACCTTTT TTGATATGCA TGTGGGCACC TCCTTATTTG TATTGGTTGT TATTAATTAA 10620 

AGTACTT CTG GTGCTAATGA TACGATTTTC ATGAAGTTAC CTTCACGTAA TTCACGAGCA 10680 

ACAQGTCCGA AGATACGAGT ACCACGTGGG CCTTTGTCAT CACGGATGAT AACACATGCA 10740 

TTTTCATCAA ATTTGATGTA TGAACCGTCA TTACGACGAA CACCTGACTT AGTACGTACG 10800 

ATTACAGCTT TGACAACGTC ACCTTTTTTA ACAACGCCAC CTGGTGTTGC ATTTTTAACA 10860 

GTACATACGA TAACATCGCC GATGTTTGCT GTTTTACGAC CAGATCCACC TAATACTTTG 10920 

ATTGTAAGAA CTTCACGAGC ACCAGAGTTG TCTGCTACTT TCAAGCGTGT TTCTTGTTGG 10980 

ATCATTAGTT AAACCTCCCT TATCTCTAAA CTTGTATTAA ATAATTACTG ACTCTTCAAC 11040 

AATCTCTACT AAACGAAAAC GTTTTGTTGC TGATAAAGGA CGAGTTTCTT GAATTTTAAC 11100 

AATGTCTCCT AATTTAGCTG AATTGTTTTC ATCATGAGTT TTGTATTTTT TAGAGTATTT 11160 

TACT CGTTT A CCGTATAATT TGTGTGTTTT GTAAGTTTCA ACAAGTACTG TAATAGTCTT 11220 
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TTTTGTAACC TCCTCTTACT TAATTATTGA TTAGCCTTAC TTTGTTCAAT TTCTCTTTCA 
CGAGCAACAG TTTTTAGACG TGCAATCGTT TTTCTTACTG TACGAATACG TGCAGTTTCT 
5 TCTAATTGAC CTGTAGCTAA CTGAAAGCGT AGGTTAAAAA GCTCTTCTTT TGAAGATTTG 

ATTTGTTCTT CGATTTCTGA AGTGGTTAAG TCTCTAATTT CCTTAGCTTT CATTTGTTTC 11520 

ACCACCCAAT TCCTCACGTT TTACAAACTT AGTTTTTACT GGAAGTTTGT GACTTGCTAA 115 80 

10 

ACGTAGTGcT TCACGCGCAA CTTCTTCAGA AACGCCAGCA ACTTCGAATA AAATTCTACC 11640 

TGGTTTAACA ACTGCGATCC AGCCTTCAAC CGCACCTTTA CCAGCACCCA TACGTACTTC 11700 

?5 TAAAGGTTTT TTAGTATATG GTGTATGTGG GAAGATTTTA ATCCAAACTT TCCCGCCACG 11760 

TTTCATGTAA CGTGTCATTG CTATACGAGC AGATTCGATT TGACGAGATG TGATCCAAGA 11820 

CGTTGTTGTA GCTTGTAAAC CAAACTCACC AAATGTTACG TAtTACCGCC TTTAGAACGA 11880 

20 CCAGTTGTTT TAGGACGATG TTGACGACGA TATTTTACAC GTTTTGGTAG TAACATTATT 11940 

ATTTTCCTCC TCCACTAGTG TTCTTAGTAG GAAGAACTTC TCCACGATAA ATC CATACTT 12000 

TAACGCCTAA TTTACCGTAA GTAGTGTCAG CTTCAGCGTG tGCATAATCG ATGTCAGCAC 12060 

GTAACGTATG AAGTGGAACA GTTCCTTCTG AATATTGTTC AGCACGAGCG ATGTCAGCTC 12120 

CGCCTAAACG ACCAGATACT TGaGTTTTGA TACCTTTAGC ACCAAGTTTC ATAGCTCTAG 12180 

TGATTGCTTG TTTTTGTACA CGACGGAATG AAGCACGGTT TTCTAATTGA CGTGCGATGT 12240 

TTTCAGCTAC TAAACGAGCG TCAAGATCAA CTTTTTTGAT TTCAATTACG TTGATGTGTA 12300 

CTTTTTTATC AGTTAACGCA TTTAATTTGT TGCGTAATTT TTCGATTTCT GAAC CGCCTT 12360 

TACCAATTAC CATACCAGGT TTACCAGTAT GAATTGCAAT GTTGATACGG TTTGCAGCAC 12420 

GTTCAATCTC TACGTGAGAA ACTGATGCTT CTTTTAATTC ATTATCAATA AATTTACGGA 12480 

-TTTTTAAATG-TTCGTGTAAA _ AGTGAAGCGA~AGTCTTTTTC~ AGCATACCfiT ITAGCTTCCC 12540" 

AATCACGGAT AATACCAACA CGAAGTCCGA TTGGATTAAT TTTTTGACCC ACAGTATTCC 12600 

CTCCTTAAAA GTTAATTAAG CTTCTTTAGC TTCTTCTTTA CCGTCACTTA CGACGATTGT 12660 

AATGTGGCTT GTACGTTTGT TAATCGCACT TGCACGACCT TGCGCACGTG GACGGAAACG 12720 

45 TTTTAATGTT GGTCCTTCGT TAGCATATGC TTCTTTAACT ACTAATTCAT CTGTGTTCAT 12780 

GTCATAGTTA TGTTCAG CAT TAGCTAAAGC GGACATTAAT ACTTTTTCAA TTACTGGTGA 12840 

TGAAGCTTTG TTTGTTAATT TTAAAATTGC AATAGCTTCA GCAGCATTTT TACCTCTGAT 12900 

TAAGTCAAGA ACTAGTCTTA CTTTACGAGG TGCGATTCTT ATTGTTCTAG CAACCGCTTT 12 960 

TGCTTCCATT AGGATGTCCT CCTCTACTTA ATAGATATTA TCTTCTTGTT TTCTTGTCGT 13020 
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TATCTTCAGT TACATATACA 
CGATGAAATT AGGGAAAATT 
CGCTTCCTTC TTGAGCTTCA 
TAATACTACG AGCCATTTGG 
ACACCCAAAT AAGTTGATTA 
ATGATTTTTT ACCACGACGA 
GAGATGGTCT ACCGATAGGA 
TCATTACAGA ACCACGAACT 
CAACGTTAAC TAATTCGTGT 
ATAAGATCAT ACGAACTTCA 
CAAGTACTTG AGCACTTGCA 
GCTCGATGTT GTGTACTACT 
CTTTGATGTC AGCTTCAGCA 
CAATGATATA TCGTTTTTCA 
TTGGATCATA TTGAATAGAA 
CGATAACACG GTATTGACGT 
GGTTGTTACG TCCCGCTTTT 
TCGTGATTTC TGCGAAATCT 
ACTTTTTAAT AGCCATTGTC 
GTCGATTGAT CCTTCTTTAA 
GTAACGGCCC ATACGTTTTT 
TACGTTGAAG ATTTCTTCAA 
GTCGAAAGTG TATTTGTCTT 
CTTAAGAATA TCTCTTGCTT 
TCAGTAATTA CTAAGCTGTC 
GTCACTTGAA CGCCAGGGAT 
GTAACTACTA ATACTTTTTT 
GTTTTTGGAG CTTCGAAGTT 
AAAGATAATG CTGAGCGTAA 



GGTACGTGTT TACGTCCGTC GTATACTGCA AAAGTATGTC 1314 0 

GTAGAACGAC GTGACCATGT TTTGATTACT TGTTTCTTTT 13200 

ACTTTTTTCA TTAAATGCTC ATCGACGAAA GGTCCTTTTT 13260 

GCGCCTCCCT TCTTATTATG TGCGTGCAGC TTTAAGCCGC 13320 

TATTATTTTT TCTTACGTCC ACGAACGATA AGTTTGTCTG 133 80 

GTTTTCTTAC CAAGCGTAGG TTTACCCCAT GGTGACATTG 1344 0 

GCACGACCTT CACCACCACC GTGTGGGTGA TCGTTAGGGT 135 00 

GTTGGACGGA TACCTTTCCA TCTTGAACGT CCGGCTTTAC 13560 

TGTAGGTTAC CAACTTGACC GATTGTAGCA CGG CAAGTAG 13620 

CCAGATCTTA ATCTGATTAA TACGTATTTA CCTTCTTTAC 13680 

CCAGCTGAAC GAG CG ATTTG TCCACCTTTA CCAGGTTTAA 13740 

GTACCAACTG GAATGTTTTG TAATGGTAAT GCGTTACCAA 13800 

CCACTTTCAA CG ATTTG AC C TACTTCTAAT CCTTTAGGAG 13860 

CCGTCTGCAT ATACAACTAA AGCGATGTTT GCTGAGCGGT 13920 

TCAACTTTTG CATTGATACC ATCTTTGTTA CGTTTGAAAT 13 980 

TTGTGTCCAC CACCATGGTG TCTTACAGTC AATTTACCTT 14 04 0 

TTCGGTAGCG GTTTTAATAA TGACTTTTCA GGTGTAGTTT 14100 

AACGAaGTCA TATTACGACG ACCATTTGTT ATTGG CTT AT 14160 

GCTTACCTCC TTAATGGTAA TTGTTTTATT AGTTAAATAA 14220 

GAGTTACAAT CGCTTTTCTT CTTTTGTTTG TATAGCCTTG 14280 

TCTTAGGTTT GTAATTCATG ATATTAACAC TTGCAACTTT 14340 

CTGCCATTTT TACTTGTGTT TTGTTAACAC GAGTATCAAC 14400 

CAGCCATTGC TTCAGAAGAT TTCTCAGTGA TTACGGGGCG 14460 

CCATTATCCG AGCACCTCCT CAACTTTTTT AGCAG CAGCT 14520 
AGCATTAGTG ATATCTAAAA CATTTAAACC TTG AG CAGTT 14580 
GTTGCGTGCT GATAATTCAA CATTTACATC TTCGTTTTCA 14640 
AGGTTGTTCT AATGTAGATA ATACATTTTT GAATTCTTTA 14700 
GAATGCGTCA AC T ACAGTT A AGCCATTCTC TTGAGCTTTG 14760 
AGCTAAACGA CGCATTTTCT TAGGCATTTT GTATGCATAA 14 820 
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CCTTGACGAG CACGACCTGT TCC T TTTTGC TTCCATGGTT TACGTCCGCC ACCGCTTACT 
GCTGAACGAT TCTTAACAGC ATGCGTACCT TGACGTAATG AAGCACGTTG TAAATTAATA 
GCTTCGAATA AAACGCTATT ATTTGGCTCA ATACCGAATA CTGCATCGCT TAATTCGATT 
GAACCTGATT TAGTTCCGTC TAATTTTAAA ACATCATAAT TAGC CATTAT GCATTTCCTC 
CTTTCACTTC TTATTATTTA TTACCTTTTT TAATTGAAGT TCTGATTTCT ACTAAACCTT 
TTTTAGGTCC AGGTACGTTA CCTTTTACTA AGATAACTTT GTTTTCTGTG TCAACTTGAA 
CTACTTCTAA GTTTTGAACA GTTACAGTGT TTCCACCCAT ACGTCCTGGC ATTTTTTGGC 
CTTTAAATAC TCTAGAAGCA TCTGAAGCCA TACCTACAGA ACCTGGTGCT CTGTGGAAAT 
GAGAACCGTG TGACATAGGT CCACGAGATT GTCCGTGGCG TTTAATTGCA CCTTGGAAAC 
CTTTACCTTT TGATACGCCT GTTACGTCAA TAACGTCGCC AGCTACAAAA GTATCTACTG 
AGACTTCTTG AaCCTAcTcG TAAGCATCCA CGTCTACATT GCGGAATTCA CGAATGAAGC 
GCTTAGGTGC TGCGTCAGCT TTTTTAGCGT GACCTTCAGC TGGTTTATTA G CAT ATTTAT 
T AG ATTTTG C ATCTTTTTTG TATGCTTTTT TGTCTTCAAA TCCAACTTGG ATTGCGTTGT 
ATCCATCAAC TTCTACAGTT TTCTTTTGTA ATACAACATT TTCTTTAGCT TCTACTACTG 
TTACAGGGAT TAATTCACCG TTTTCTCCGA ATACTTGTGT CATCCCAATT TTTCTTCCTA 
AGATTCCTTT GGTCATCGAA AGTCCACCTC CTAAAATTGT CTATTATAAT TTGATTTCGA 
TGTCTACACC AGATGGTAAG TTTAAGCCCA TTAAAGCGTC AACTGTTTTT GGTGTTGGGT 
TTACAATATC GATTAAACGT TTGTGTGTAC GTTGTTCGAA TTGTTCACGT GAATCTTTAT 
ACTTATGCAC GGCACGGATG ATTGTGTAAA CTGATTTCTC AGTTGGTAAC GGAATTGGTC 
CAGAAACATC TGCACCAGAA CGTTTCGCTG TTTCTACAAT CTTCTCTGCT GATTGATCAA 
"TTACGCGGTG ATCATAAGCT TTTAATCTGA CTCTCATTTT^ TTGTTTTGCC 
CTCCTTATTC GTCTACATTT AGTGATAGAC TTCTCCACGA AAACTATCTT 
ATGGCAAAGC GGCCGGGTGT GTCAGTAACC TTTCGCTTCA TCGCTTTTCT 
CGTTAGTTAT ATTACACGAA AAACATCGAT AAATCAAGGC TTTTCACATA 
CTGTCTAACA CATACTTTTA TATTTnACTT TATATACTTA GTCAGTTCAA 
GATATTTTnA ATTTCCn 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 29555 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 



ATAATTTTCC 
ACACAGCGCC 
TAAAGTCCAA 
ATTTTTCTAT 
CTATTTTCGA 



14940 

15000 

15060 

15120 
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15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16397 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 
TnAGTTGTTT CTGCCACGAA AGATTCAATG GCTTTTCTTG CTTTACGCTT TTCTTTCAAT 
GGCAAATCAC CAATCATTTT TTTAAGGTGA TGTGGGTTTA CAACACCACT ATACTGGTAG 
TCATTTGAAn TTGTTTTTAG GGCTTGTTCA TCGATAGATC TCTCTCCAGC AAATCCTTTG 
AACTCCGCTT CTTTTTTAAT ACTTTCGAAA TTAACATATT CTTGATCGAT ATCATCATCC 
TTATTTAAAG AAGGTACAAC ATTGTCGATG AATTCTCTAA TTAGATCTCG TTTTAACCTC 
AATGtCGGAT CATCTGCATG ATCTAAAATG CGTCTAATTT GTTCTTGGTT ACGACGTTGT 
TCCGCTTTGT CTTCAAGATC AATTTGTCTC AATATATTCA TAATATAATT CACATTAATC 
GTATCATTAC GCATCATTTC TATTTCGAAA T CAAT ATCAT TTAAAATGGA TACTTTATTT 
TTCTCAGCCG TCGCTCTTTT TACTTGATCG TACACAGCTA AATATTTACT TTTATAGTCT 
TCATTCTCTT GTTCATCCAT TCCAATTTCA TCAATTGTAA ACTCAAACTC GTCAAATGCT 
TTTAAACGTA ATATTATTTT AGCTAATAAA CGATAAGCtT CAACAAAGCG CTTTAGCTCT 
TCTTCATCtT GaATGt. CATC AACCATGTGT GGTGTCGGCA CAATCATTTT AAGCTCACGA 
TAAGCGTCCA TAAATTCTTT TTTATACTCT T CAT AACTG C GCATTAAAAT TGTATCCGTA 
TCATTTGTTT GTGAGAATAC TCTCAGTGCA TCGTCTGTCT CTTTTTTCAA GTCACGATAG 
TTTACAATTT TACCAAATGG CTTTGATTCT TTTTCAACCC TATTTGTACG TGAATACGCT 
TGAATTAAAT CATGATACAT TAAATTCTTA TCAACATATA AAGTGTTCAG TACTTTACTA 
TCAAAACCAG TTAAGAACAT ATTAACAACG ATTAAGATAT CAATTTTACT ATCTTTAACG 
CCCTTTTTAA CGTTTTTtGA AATATGATTA AAATACTCAT TAGTTGtGGC TGnTGaAAAA 
TTCGTCTCGA ACTTTTTATT ATAATCACTA ATCATTATCT CTAATTTTTC ACGTGAATGA 
TATGGCACTT CACCATCACG ATCATCTTCA TTAGGTTTAA ACGTAAATAT ACCAGCTATC 
GTTAACGGTT GTTCCAACTT TTTGTTAAGT CGCTTAAATG TCT CATAATA TTTAATAAGC 
GCGTGAATAC TTTGGACTGT AAATATACTT GAATATTGAC GATTACGTGT ATATTTATCA 
TGATTATTGA TGATATGTCG TGTTACTAAT TCCACACGTT TATCCGCTAA CCATACTTCT 
TCCGTATCAA TTGCTTCAAC CATGC t GTT A TCTTCTGCTT TTAAAGCTTT ATTTTTAAAA 
GTATTAATAT AGTCAACTGA GAAACCAAGT ACATTACCAT CATGAATGGC ATCTCTAATT 
AAATACGTAT GTAAGCATCT ACCGAAAATA TCTGCAGTTG TTCTACCATC TTGACTACTA 
TTTTCTGGAA AACGTGG CGT ACCAGTGAAT C CAAAGT ATT GGGCATTTTT GAAATGTTGT 



60 
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1140 
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ACTTTATTCG TTTTATACTG TTCTAATAAA GGGGCATTCC CTTGAATCGC TTTAGCCATT 
TTTTGAATCG TCGTTACAAT AAGTGGCAAA CTTTTATCAT TTAGTTGGCG T AC CAGTTG C 
GAGGTATTAA AAGTTTTGTC TACAGCACCC TTAGCAAATT TATTAAATTC CTCTTCTGTT 
TGACTATCCA AGTCTTTACG GTCAACCAAA AAGATAACTT TCTTAATGTC ATCTTGCTGT 
GATAAAATCT GACTCGCTTT AAAAGAAGTC AACGTCTTAC CACTTCCAGT TGTATGCCAT 
ACATATCCAT TATTCCCTGT CTCAGTCGCT TGTTGAATAA GTGCTTCTAC CGCATACACT 
TGATACGGAC GCATTGCCAT CAGTATTCTA TCTGTTTCAT T AAT AAT CAT ATAGCGCGAT 2100 
15 ATCATCTTAG CTAATTGACA AGGTCTCATA AATGACTCAG CAAACGATTG CAATGTATTG 2160 
ATACGGTTAT TCTGTTTATC ACTCCAATAA AACATGTGAC TCTTCAATAG TTCGCTATCA 
TTATTAGAAA AGTATCGCGT TTCAACACCA TTACTAATGA TAAACATTTG TATGTAGCGG 
20 AATAAGCCTG TGTAATTTTG TTTGCGGTAA CGTTTTACTT GGTTAAACGC CTCATTAATA 

TCAATACCTC GACGTTTCAA TTCAACTTGG ACAAGGGGTA GTCCGTTGAT TAATATCGTT 
ACATCATAAC GTGCTTTATA TGTATCCTCG ACAGATACTT GATTCGTCAC TTGAAACTTA 
TTTTTACACC AACTTTTCGT ATCTAAAAAC GACAAATAAA TCTCAGACTC ATCATCACGT 
CTAAGTGGTA ATTTAT CACG TAAAATACGG GCACTCTCGA AAATACTTTT TCCATCAATC 2580 
ATCGTTAACA GACGTTGaAA TTCTTTATCT GTTAaGGGAT TGCCTTCTAA TTTGTCCGCA 
TGACGCTCAT TTAAAATCGT TCTAAAATTA TCAAGCAATT GCTTATTATC ACGTATCGTT 
ACTCTTTCGT AACCCAATTG TTCAAGTTGA TTCATCATTT CATTTTCTAA TGCGTATTCA 2760 
CTTTGGTATG CCATTCATAT CCCCTTCCAT ACACTTTCTA TTGCTCTAAA TATATCATAA 2820 
ACTTTAATGA AAAATGTTTG TTTTTTATCT TCAAACGTAA ATTTATTCTA ATTTTATTGT 2880 
CITj^CTTTT" AATATTTGTC TTTCA AAAATTTGAA TACAAATAAT 2940 

40 CAAATCATTG ATAAATTTTT TGTCTACGAT TAATGGAGGG ACTTGAATGG TGTTAATTAC 3000 
CTATCAAATC ATTTTATTTT TTATTATTAG TCTAAGTTAC TATTTAACTT TAAATCATTA 3060 
CATGGCAGTC ACTGTAGGTA ACTTCACTTC AATATTCGGC ATGTTCGCAG CCATACTCTT 3120 
TATGTACTAC TACCTACTCT ATAAAAGTCC CGAATACAAT CAACGCAAAC GATTTAAACA 3180 
TTTCATTCAT ATCACTAATT TGATAATAAT TGCTTTTAGC ACCTTCGTAT TAGTTCATTT 3240 
AGCATTAAAA TTATTCTTCA GCATTTAATT TCCATCTATG AAAAAAGCAA AGCTCAAATC 3300 
TGAACTTTGC TTTAATTTGT CACGCCTTTA TCATTTTCAA AATAG CCTCT ATGCCAGTTT 3360 
TACAAACTTG TAGCAACAAT TTTTCATCAA GCAACTGAAT CACATCAAAA ACTTCAATTG 



2220 
2280 
2340 
2400 
2460 
2520 



2640 
2700 



3420 



55 



MSDOC1D: <EP 078651 9 A2J_> 



933 



10 



15 



EP0 786 519 A2 

GTCGCAAGAT GCTTCCTGTA ATTATCAAGT GCCATTTTCG ATTGGGTTAT ACAATCTAGA 354 0 

ATCGCATGAT AATTTAATGC TACAAATCGA TAGTACAATA TATCTACCGT GAATAACTGT 3600 

GCAAATAGTG ACGTTGTAGC CGCCATACGC ATTTCATTTT CATCAGTTCT GCCATAAATC 3660 

AATGCATAGT CTGCAATTTG AGCCACTGGA TTATTAGCTG TACTAGATAT AGTTATGATG 3720 

GGAATACTGT AATGTGTGGC CACCTGTGCA ATTGACTGCA ATTCACTATG ACTACCTTGA 3780 

TTCGTCACAA AAATCATGCA ATCTCTATCA TCATGCGTCG CAAATGTTGA CACAAGTAAA 3 84 0 

TGCGTTTCAT GTAATAACCT GACATTTAAG CCAATACGAG ATAACTTTTG AAAAAGATCA 3 900 

CCAATAGTCA AACTCGATGC GCCAAATCCA AATAAAAATA TTGTCCTGGc ATTTTTCAAC 3 960 

ACATCACAAA TTGCATCAAT TTGCGCATCC ATAATATTAG TAG CTACAAA TCGCATCGTA 4020 

TTCGTTGCTC TAG CAATCAT TTTATTTTTC AAAGTTTCTA CAGATTCATT TTCAATCAAT 4080 

20 TCTAAATGTG GATTGGTTGC AATATCTTCG GGTAAGTATC GAGATATCGC AATCTTTAGC 414 0 

TCTTGAAAAC CTTGATGTGT CATTTTCCGA CTAAATCTAA CAATTGATGC TGTACTAACA 4 200 

TTCGTAACAT CTGCCAAATC ATTCACAGTC ATATCAATGA TTTTATGTGG ATTCTTTAAA 4260 

25 ATGTAATCAG CGATTATCTT TTCTGTCTTC GTAAAATCAC TCAACTGCTT ATCAATGCGA 4320 

TATAAAATAT TTGTCATCAT TAATCACCCA ACAAATCTGT CTGTCGCATC GCCTTTGTCG 43 80 

TTCCAAATAA ATATGTACAA ACGAATCCAC CAGCATACGC AGCAAGTAAT CCTGCAATAT 4440 

AACCTAAATA CATATTATCT GAGATTAATG GTAATAGTGA CACACCACTT GGGCCTATTG 4500 

CTTTGGCACC AATATGTCCA ATTCCACCTA TTACAGCGCC ACCAATACCA CCACCAATAC 4560 
AAGCAGTTAA GAAAGGTCGA CCTAATGGCA AAGTCACACC ATAGATTAAT GGTTCTCCGA 4620 
TACCTAGGAA ACCAACTGGC AATGCACCTT TTAAAGTATT ACGTAATGTT GTGTTGCGTT 4680 
TACATCTTAC CCAAAGTGCT AATGCGGCAC CTACTTGTCC AGCACCAGCC ATCGCTGCAA 4740 
40 TTGGCAATAA GTAAGTAGCA CCTGATTGGT TAATCATTTC TATATGAATT GGCGTAAAAA 4800 
TATGATGAAG CCCTAACATA ACTAACGGTA GGAAGCTTGC ACCAATGATA AATC CACTAA 4860 
ATACGCCACC AATACTAATA ATTCCGTTAA CTACTGAAAC TAAACTGTCT GAAACAAAAC 4920 
45 CTGCTAATGG CATAAAGATA AAGATAGTTA ATAGTCCTAC AATCAACAAT GCAATAGTCG 4 980 

GCGTTACAAT AATATCAATC GCATTTGGCA CAATTTTATG TAATCTCTTT TCGACAATAC 5040 
TTAAAATCCA AACGGCAAAA ATAACGCCAA TAATCCCACC TTGTCCAGGT TGCAATGGTT 5100 

50 

CTCCAGTGAA GACATTCATT AAAATATTTT TACCAGCAAT ACCCGTTAAT AACGTTGTAC 5160 
CACCAATCAC GCCACCAAGT CCTGGTGTCG CACCAAATTC TTTAGCCGCA TTAATACCAG 5220 
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GCGTAATCCA AGCACCTGAA ATATAGCCTG CCACCATTAA GTTACTCAGT ACTGCTGCAA 
TACCACCAAT TAATCCAGCT CCAATAAATG CAGGAATCAA CGGTATAAAG ATATTGGCAA 
TTGATTTCAA TACTTTATTC AACTTACCAT TCTTTTGTTT TGCTTTATGC GCTTCCTTAT 
TCGCCTTTGC TTTATCAGCT GCATATGATT TATAGTCCAT TTTTTCACTA TCATTGTGAT 
GGTGTGGTAT TGGGTCACCT AGTTTAACAC CACTTAATTC CGCCATATGA TTAG CCACTT 

w 

T ATTG a t GT A CCAGGTCCAA CCACAACTTG AATGCGTTCA TCGTGTATAA CACCCATGAC 
ACCATCAATA TGCCTTAGTT CTTGGTCATC TACTTTATTC TCATCTAATA CTTTAATACG 
CACACGTGTC ATACAGTTCA TGACACTATC TATATTATCC ATACCACCTA CTGCAGCAAT 
AATTCGTTCT GCAAGTTGTT GTTCTTTGGT CATTTAAATC CCTCCTAAGG TTGTCTATCT 
CTGATTGCTC GTTTAAaATG TCACCATTGT TTAATAACCG TCTTGTTGCT TCTTCCTTAG 
20 AAATGCCACA CAT AC C CAT A ACTGTCGCAA CTTTCACATC ATGCTCAGAT AC CTGATATA 

ACGCCATTGC TTCATCATAT GTGATAGCAC ATATTTCTTG AATAATACGC ACTGAACGGT 
CGATCAGTTT TTGATTGGTT GCTTTAACAT CAATCATGAG GTTATCGTAA ACTTTTCCGA 
CACCAACCAT TGTGATGGTT GAAAT CATAT TTAAAATTAA CTTTTGTGCt GTACCAGACT 
TTAAACGTGT TGAACCAGTT AATACTTCTG GACCAACTTT AACTTCTACT GGATAc TGCG 
CAATTTCACT TATAACTGCA TGTTCATTGC ATGAAATAGA TACTGTTGTA GCACCGATTG 
TGTTAGCAAA TGTTAAACCG CCTATAACAT ATGGCGTTTT GCCACTCGCG GCAATTCCTA 
TAACGACATC TTTTGATGTT AAATCTATAT TTTTCAAATC TTCTTCCGCT AATTTTTTGT 
35 GATCTTCCGC ACCTTCTACA GCCATCGTCA TAGCATGTTG TCCACCAGCA ATAATACCTA 

TAATTTCATG AGGGTCAGTA TTGAATGTAG GTACACACTC CGCTGCATCT AAGACACCCA 
acctTccact - tgtacctgca~ ccgatataaa" TC^TCGTCC~ACCCTTTTO ~" 
40 TTGTTTTTTT AATTACTTTT GTCAATTGTG GTATTGCCTT TCGAACTGCT AACGGGACTT 
OCTGATCTTC TTTATTCATC GTAATTAAAG CCTCTTCCAC AGT CATTTCA TCAAGATGCA 
TCGTCGCTTC ATTACGCGCT TCGGTCGTAC TATTTTCCAT CACTTCTTAC ACTCCCTAGT 
TTTTTGAAAA TCAAATGTAT CATTCGGCTC GATACAACTT AACAGTGGTA AGTCTTCTTT 
AATAATTTGT GCAaCAACAT TCACATTGTC ATGTGCACTA AGCGTTTGTC TCACAATTTG 
CATTTCGCCT TGATAACGTC CGTTATTCAA ATTATCAACG GTTACTGAAC CAATGCGTCG 
TTGCGTCGTA AACTGTGGTT GAATCGAATG TGGACATATT TGTCTTGACG TTTCCGAACG 
AATGACATTT TCCGGATTAT CCGGGCGTAC TTTATGACAC ATATCGAAAA GGTAAGTCAC 
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AAGTTGTTTT GCCTGCCTCA TTTCAATCAA TGAGTCTCCA ACTAACACTT CAGATACACC 7140 

AGTTTCTTGT AATAATTTAG CTGCAACGAC AGGATGACTA TGTCTCGTTG CTTCAATTGT 7200 

TGGCAAGCCT TTATGCAAAG GACCTCGCAA ATCACTCCCT ACAATAAAAC CATATATTTG 7 260 

TGCCTTTGGA TTAAATTGAT AAATGAGTTC ATTTTTCTTA TTGACCAAGT CAACAGATAA 7320 

TCCCGTATCT GGTCTTGGAT AATAGTTATG ACAAAATGAA AGTAATGTAA AATCATTCAA 7 380 

TTGTTGATGT AAGCTTGTTA ACAATTCCCG GGAAATAATA CTTGCATTCA AACAGCACTT 7440 

TAAACCCTGT GCCATTATCG CTTCGATTGC CTCAATTGAT GTACTATGAT CGATACGAAT 7500 

CATAAATTGT GCATCATATT GTCGAAGATG GTCATAAAAA GATGGTGTTA AAATAGATGG 7560 

ATTAGCATCT ATGAGGTAAG TCACTTGTTC ATGTTTTAAT AAATTGAGTA GTTTTGTGAA 7620 

ATAATGATAT TTTGTCTCGT CATCTTCTTC TGGTATTTGT ACAGATGTAA AAATCATTTG 7660 

20 GTAACCTTGT TTAATCATTC GCTTAATATA CGCTTCATCT AAAGGTTGTC CTAAATACAC 7740 

TGAAAAGCCT GTCAAAGTAG CCCTCCTTAA CAATATAATT ATTAGGAAAA TATAGTTGAT 7800 

TTGTGTAATC GCTTACATTT TACTATAAGA GAAAACACAT TACAATATTA ATCAGTTAAA 7860 

25 GCCTGTTCAT TGTAATAATC TTACATATTT CTGTCACAAG TTAATTATTA CACCATCAAA 7920 

GATTATCCTT TCTTTTAAGT GCTGATAATA GCTGCTACTG CTGGATTATT ACAATAACTT 7980 

TTATACATTT TATTCAGGAT TATCTTATAT TATGTTTTAA TAATAATCTG TGAACAATTA 804 0 

AGAGATTTGA AATTGAATTT AATAATTGTA TTGAAAACGC ATACTTCACC ATGCTAAAAT 8100 

AGGAGTCGCA AACAAATAAG ATTCAATAAG ATGTGATGGT TACCAACACA GTCTATTTGC 8160 

TCGTGTCTTT TTTTATTGAA TCTTAAATAA TAAATACAAC TTTGGAGGTT GGACAAGTGA 8220 

GGAAGAAACT TTTCGGTCAA TTGCAACGTA TTGGTAAAGC GCTAATGTTA CCTGTTGCGA 8280 

TTTTACCAGC AGCTGGTCTG TTATTAGCTA TCGGTACAGC TATGCAAGGT GAATCATTAC 8340 

40 AACACTACTT GCCGTTTATA CAAAATGGTG GCGTACAAAC TGTCGCTAAA TTAATGACAG 84 00 

GTGCTGGTGG TATCATTTTT G AT AACTTG C CTATGATTTT CGCATTAGGT GTCGCAATCG 8460 

GATTAGCTGG CGGTGATGGC GTAg cAGCT A TCGCAGCATT CGTCGGTTAC ATAATCATGA 8520 

ACAAAACAAT GGGCGACTTT TTACAAGTTA CACCTAAGAA TATTGGTGAT CCAGCGAGTG 8580 

GTTACGCTAG CATTTTAGGT ATCCCAACAT TACAAACAGG TGTGTTCGGC GGTATTATAA 864 0 

TCGGGGCCCT GGCAGCTTGG TGTTATAACA AGTTCTATAA CATTAACTTA CCATCTTATT 8700 

TAGGTTTCTT CGCTGGTAAG CGTTTCGTAC CTATTATGAT GGCTACAACA TCATTTATTT 8760 

TAGCATTCCC AATGGCATTA ATTTGGCCAA CGATTCAATC AGGATTAAAT GCATTCAGTA 8820 
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TATTAATTCC ATTCGGTCTA CATCACATTT TCCACGCACC GTTCTGGTTC GAGTTTGGTT ~ 894 0 

CATGGAAAAA TGCAGCTGGT GAAATTATTC ACGGTGACCA ACGTATCTTT AT CG AACAAA 9000 

TTCGTGAAGG CGCACATTTG ACAGCTGGTA AATTCATGCA AGGTGAATTC CCTGTTATGA 906 0 

TGTTCGGTTT ACCTGCAGCA GCTTTAGCAA TTTATCACAC AG CT AAACCT GAAAATAAGA 9120 

AAGTAGTAGC AGGTTTAATG GGTTCTGCTG CTTTAACATC ATTCTTAACT GGTATTACAG 9180 

AACCATTAGA ATTCTCATTC TTATTTGTAG CACCATTATT ATTCTTTATT CACGCaGTAC 924 0 

TTGATGGTTT ATCATTCTTA ACATTGTACT TATTAGATCT TCATCTAGGT TATACATTCT 93 00 

rs CAGGTGGTTT CATCGACTAC TTCTTACTCG GTATACTACC TAATAAGACA CAATGGTGGT 9360 

TAGTCATTCC TGTAGGTCTT GTATACGCAG TTATTTACTA CTTCGTATTC CGATTCTTAA 9420 

TTGTAAAATT AAAATACAAA ACACCAGGTC GTGAAGATAA ACAATCACAA GCGGCTACTG 94 80 

20 CTTCAGCAAC TGAATTACCA TATGCAGTAT TAGAAGCTAT GGGTGG CAAA GCAAACATTA 954 0 

AACATTTAGA CG CTTGTATC ACACGTCTAC GTGTTGAAGT TAACGACAAA TCTAAAGTTG 9600 

ATGTTCCTGG TTTGAAAGAT TTAGGCGCAT CTGGTGTATT AGAAGTCGGC AATAATATGC 9660 

AAGCAATTTT TGGTCCTAAA TCTGACCAAA TCAAACATGA AATGCAACAG ATTATGAATG 9720 

GTCAAGTAGT AGAAAATCCT ACTACTATGG AAGACGATAA AGACGAAACT GTTGTTGTTG 9780 

CAGAAGATAA ATCTG CAACA AGCGAATTGA GCCATATCGT GCATGCACCA TTAACTGGTG 984 0 

AAGTAACACC ATTATCAGAA GTGCCTGATC AAGTGTTCAG CGAAAAAATG ATGGGTGACG 9900 

GTATCGCTAT CAAACCTTCA CAAGGTGAAG TTCGTGCACC ATTCAACGGT AAAGTACAAA 9960 

35 TGATTTTCCC AACAAAACAT GCAATTGGTC TTGTATCAGA TAGTGGTTTA GAACTATTAA 10020 

TCCACATCGG TTTAGACACT GTTAAATTAA ACGGAGAAGG CTTTACTTTA CATGTTGAGG 10080 

AAGGTCAAGA AGTTAAACAA GGTGATTTAT TAATCAACTT TGATTTAGAC TACATCCGCA 10140 

40 ATCATGCAAA GAGTGATATT ACGCCTATTA TCGTGACACA AGGAAACATT ACAAACCTTG 10200 

ATTTTAAACA AGGTGAACAT GGCAACATTT CATTTGGCGA TCAATTATTT GAAGCTAAAT 10260 

AATGCTTACT ATAAACAGGT GCGTATACCT TCATAAGGTG ACGCGCCTGT TTTTTCTTTG 10320 

CTATTGTATT TTGCAGCATC ATTGATAGTT CGCTCTCCCC TTAAATTTTG AATTTTAAGA 103 80 

TCATCAATTA AAGCCCCCCT TCATACTCAT TTCCTAAAAA ATATTAATTG TTCACTATTG 1044 0 

TTAGCGTTTT CACAACAAAG TCAACTTCCT TGACCTTACA CTATATTCGA GGCTATCATT 10500 

TTAAGTGTAA ATATAGAGAA AAGGTGGCTT TTTTTATGAA ACAACGCATT GGAGCTTACT 10560 

TAATTGACGC TATTCATCGA GCAGGCGTCG ATAAAATTTT TGGTGTTCCT GGTGATTTTA 10620 
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ATGAATTAAA CGCAAGTTAC GCAGCGGACG GTTATGCCCG TCTTAATGGA CTCGCTGCAT 1074 0 

TAGTTACTAC ATTTGGTGTT GGCGAATTAA GTGCCGTCAA CGGTATCGCA GGTTCATATG 10800 

CTGAACGCAT ACCTGTCATT GCGATTACAG GTGCGCCGAC ACGTGCTGTT GAACAAGGCG 10860 

GTAAATATGT ACATCACTCA CTTGGTGAAG GTACATTTGA CGACTATCGA AAAATGTTTG 10920 

CACATATAAC CGTTGCACAA GGTTATATCA CACCTGAAAA TGCAACAACC GAAATACCAC 10 980 

GTTTAATTAA TACAGCAATC GCCGAAAGAC GCCCAGTTCA TTTACATTTA CCAATCGATG 11040 

TCGCAATCTC TGAAATTGAG ATACCGACAC CATTTGAAGT GACGGCAACT AAATATACGG 11100 

ATGCATCAAC ATATATAGAG TTATTAGCAA CTAAACTGCA TCAAGCGAAG CAGCCTATCA 11160 

TCATTACTGG ACATGAAATT AACAGTTTTC ACCTCCATCA AGAATTAGAA GATTTTGTAA 11220 

ATCAAACACA GATACCAGTA GCACAACTTT CATTAGGAAA AGGTGCTTTT AATGAGGAAA 11280 

ATCCATATTA TATGGGTATT TACGATGGGA AAATTGCCGA AGATAAAATA CGAGATTATG 1134 0 

TGGACAACAG CGATTTAATT TTAAATATTG GAGCCAAATT AACAGATTCA GCAACAGCAG 11400 

GTTTTTCATA CCAATTCAAT ATCGATGATG TCGTTATGTT AAATCATCAC AATATCAAAA 11460 

TTGACGATGT TACAAATGAT GAAATATCTC TACCATCATT GTTAAAACAG TTATCCAATA 11520 

TTTCATATAC GAATAACGCA ACGTTCCCTG CGTATCATCG TCCAACATCA CCCGATTATA 11580 

CTGTTGGCAC AGAACCATTA ACACAACAAA CTTATTTTAA AATGATGCAA AATTTCTTAA 11640 

AACCAAATGA TGTCATCATT GCTGATCAAG GTACATCATT CTTTGGTGCT TATGATTTAG 11700 

CATTATACAA AAACAATACT TTTATAGGGC AACCGTTATG GGGTTCTATC GGCTATACAT 1176 0 

TACCTGCAAC ATTAGGTTCA CAATTAGCAG ACAAAGATCG TCGTAACTTA TTATTAATTG 11820 

GTGATGGCTC ATTGCAACTA ACTGTTCAAG CTATTTCAAC TATGATTAGA CAG CAT ATT A 11880 

AACCGGTATT ATTTGTGATT AATAATGACG GCTATACGGT AGAACGACTT ATTCACGGCA 11940 

TGTATGAACC TTATAATGAA ATTCACATGT GGGATTATAA AGCTTTACCA GCTGTATTTG 12000 

GTGGTAAAAA TGTTGAAATT CATGACGTTG AATCATCAAA AGATTTACAA GACACGTTTA 12060 

ATGCAATTAA TGGTCATCCC GATGTGATGC ATTTTGTCGA AGTCAAAATG GCTGTCGAAG 12120 

aCGCACCGAA GAAACTCATC GATAt CGCTA AAGCTTTTTC ACAACAAAAT AAATAATTTC 12180 

ATCGTATACA GGGTATAAGT TTAAGCGAAT ACTTTATTAA ACGAATAGGA CTCTGATATA 12240 

AGATGATTAA TTTTAATAAA ACCGCTTTAG TGTTAATCGA CCTGCAAGAA GGTATTCTTA 12300 

AAATGGATTA TGCCCCATAT ACAGCTGAAA ATGTCGTTCA AAACGCTAAT AAATTAATAG 12360 

ATGTTTTTAG AAAAAACAAT GGCTTTATCG CTTTTGTTCG CGTGAATTTC TATGATGGTA 12420 
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TAGCAATTAT 


TTTATTATTT 
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X -5 O U 
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CATGCCTTAT 
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GTATCGAAAA 


TCCAACTGAT 


ATCGCTAAAG 


AAACAATTAA 


GTCTGGTACT 


ATCAGTATCA 






TTATGATGGG 
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ACGGACAAGT 


TGAAATCATC 


GCTATGAATA 


ACAATTTTCA 
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CAGGATTTCA 
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TTTTGGAGAC 


ATTGAACAAT 
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1524 0 


30 


CAATTATTTT 


GGAACCAATT 


CAAGGTGAAG 


GTGGCGTTAA 


TATACCACCG 


AAAGGATATA 


153 00 




TTCAAGCTGT 


GCGTCAACTA 


TGTGATAAAC 
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15360 
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AAGTTGGTCT 
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40 
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GAACAACTGG 


i c c t\ n 




TTGAACGATC 


AGAACGCTTA 


GGTTCATTTT 


TATTAAAAGC 


GTTGCTACAA 


Oil AAA CATC 


t c ^ £ rt 




CTAGTATTAA 


AGAAATTAGA 


GGTCGTGGTT 


TATTTATAGG 


CATAGAGCTT 


TV TV /*"* TV /■» TV /"*• TV TV 

AACACAGATG 


15 /zu 


45 


CTGCACCTTT 


TGTGGATCAA 


CTGATTCAAC 


GTGGAATCTT 


ATGCAAAGAC 


ACGCATCGTA 


15 /oU 




CTATCATTCG 


ATTGTCTCCA 


CCTCTAGTCA 


TTGATAAAGA GGAAATCCAT 


CAAATTGTTG 


15840 








AAAAATTAAC 


AATTAATCAT 


TTATATATGA 


CATAGGAGGG 


15900 




CAGCTTTTCA 


AGACGTTTTT 


SO 


ATTCATGATG 


ATTAAAGTAG 


GTATCGTTGG 


CGGTAg cGGT 


TATGGCGCAA 


TTGAATTAAT 


15960 




TCGATTGTTA 


CAAACACATC 


CTCATGTAAC 


GATTGCACAC 


ATCTACTCAC 


ATTCAAAAGT 


16020 



55 



iNSDOCJD: <EP 0786S19A2_I_> 



940 



EP 0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



ACTTACAGTG GaTAATAATG 
TAAAACATGT ATCCCTCCCT 
ATTTAGAATT AAGAATCGTG 
AGATGATTTG AATCATGCTA 
AACGAAGCTC ATTTCTAATC 
ACTTATTAGC GAAAAAATAG 
GTCAGGTGCT GGTCGTTCAT 
AGCGCTTATG CAATCGGAAA 
ATTGCGGGTC AAGATGTATC 
ATTTTATCAA CAATATATGT 
TTAATGACCT CTTATTATGC 
CCAACCACAA AAGAAGTACT 
ACAACGCAAA CAGCAATTTT 
CAAGCCATTC AAAATTTAAA 
TCACCAGTTT ATCCATAAGG 
ATAACTTTTC AATTATTAAA 
GTATGCACAT CGGTTTACGC 
TGGCAAGTGC AGCTGCCGTA 
CTGAAGACAC TTTACAAAAG 
ATGCAAATTC TTGTACCGGT 
TTGCTCAACA ACTTCAAATA 
GTGAATATTT GCCTATGGAT 
TTGCAACGCC AGGTGCGTTT 
TCGCTGTATC ACTAAAAATC 
CAGGTATGAT TCACCCAAAT 
TTGAATCGAA TACACTTCAT 
TTACTGTTGA TGGCGATACA 
TTGAACACCA AATACTTAGT 
ATTTTGTCTG TACATTTTTA 



ACTGTGATGT AATTTTCTTT GCGACACCAG 
TAGTAGAAAA AGGTATTCAT GTTATCGATT 
AAATATATGA AGCATATTAC AAAGAAACTG 
TTT ACAG CAT TTCAGAATGG CAATCGTTTG 
CTGGCTGTTT CCCTACAGCA ACATTATTAG 
TAGATTTGTC ATCTATTATT ATTGATGCTA 
TATCACAACG gTTcATTTTT CAGAAATGAA 
CCATAAACAC AAACCGGAAA TCGAGCAATA 
AGTCATATTT ACACCACATC TCGTACCAAT 
CAAATTATCA TCTGAATATA CGACTGAATC 
TAATCAG CCA TTTGTCAGAA TTAGAGATAT 
CGGTAGTAAC TACTGCGATA TCGGCATCTA 
AGT AT CAGTG ATTGATAACC TTGTCAAAGG 
TATATTATAT GATTTTGAAG TGACGACTGG 
GGTGTTAGAA TGAAACATCA AGAAACGACA 
CATGGCGATA TCAGTACACC TCAAGGCTTC 
GCTAACAAAA AAGACTTTGG GTGGATTTAC 
TATACTTTAA ATCAGTTTAA AGCTGCACCA 
TCTAAAGGAA AATTACAAGC ACTTGTTGTT 
CAACAAGGCA TAGATGATGC ACGACAAACA 



CCATCTGAGC ATGTTGCTGT TGCTTCAACT 
AAAATTAAGA CTGGGACCGA ACATATTAAG 
AACGAGGCAA TTTTAACAAC TGATACCTGT 
GATGGTAAAA CCGTTACAAT TGGTGGT AG C 
ATGGCT AC CA TGCTTGCTTT TATAACAACC 
CAATTATTAA AATCTTCGAC TGACCATACA 
AGTACAAATG ACATGGTATT AGTCATGGCA 
CAAGACCATC CACAATGGGA AACATTTGTT 
GCTAAAGCTA TAGCCAGAGA TGGCGAAGGC 



CACCCGTAAG 

TATCTGGCGC 

CTGCAGCACA 

ATAACAATGG 

CATTACATCC 

AGACCGGCGT 

TGAAAATCTA 

TTTATCTATC 

GACACGAGGT 

ATTACATAAA 

TGGGACTTTT 

TGTAGATGAA 

CGCAAGTGGG 

CCTAAATCAA 

TCACAACAAT 

ACGGCTGGTG 

TCATCGTCTT 

CTTATTGTCA 

AATTCAGCTA 

CAAACATGGG 

GGGGTCATTG 

GATGCTAATT 

ACAAAACATA 

ACCAAAGGTT 

GATGCATCGA 

TTTAATATGA 

AATCACCAAG 

GATGCATTCA 

GCAACAAAGT 



16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 
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CTATCGTAAG TTCAAATCTA GTAAAATCAG CTATTTTTGG CGAAGATGCC AATTTTGGTC 1794 0 

GAATCATTAC AGCTATTGGC TACAGCGGAT GTGAAATTGA TCCTAACTGC ACATATGTTC 18000 

AACTGAACCA AATACCTGTC GTTGATAAAG GTATGGCTGT ACTATTTGAT GAGCAAGCTA 18060 

TGTCGAATAC ATTAACTCAT GAAAATGTCA CAATTGACGT TCAGCTTGGT TTAGGTAACG 18120 

CTGCAGCGAC TGCATACGGT TGTGATTTAT CCTATGATTA TGTGCGTATC AACGCATCAT 1818 0 

ATCGAACATA AGGTGGTGTT GGTTAGATGA AATTTATTGT CATTAAAATT GGTGG CAGT A 1824 0 

CACTTAGTGA CATGCATCCA TCAATTATTA ACAACATTAA GCATTTACGA TCAAACAACA 18300 

TCTACCCCAT TATCGTTCAT GGCGGTGGCC CATTTATTAA TGAAGCATTA TCAAACCAGC 18360 

AAATCGAGCC ACACTTTGTT AATGGCCTAA GAGTGACTGA TAAAGCAACC ATGACCATTA 18420 

CTAAACACAC GCTCATTGCA GACGTTAACA CTGCATTAGT AG CTCAATTT AACCAGCACC 18480 

AATGTTCTGC AATAGGCTTA TGTGGTTTGG ATGCACAGCT GTTTGAAATT ACATCTTTTG 1854 0 

ATCAACAATA TGGATATGTC GGTGTTC CGA CCGCTTTAAA TAAGGATGCT TTACAGTATT 18600 

TATGTACTAA ATTTGTACCT ATCATCAATT CGATTGGTTT CAATAACCAT GATGGAGAAT 18660 

25 TTTACAATAT TAATG CTG AC ACGCTTGCCT ATTTTATTGC ATCATCATTA AAAGCGCCTA 18720 

TTTATGTATT AAGTAATATT GCAGGTGTAC TCATCAATGA TGTTGTTATA CCTCAATTGC 18780 

CATTAGTCGA TATTCATCAA TATATTGAAC ATGGTGATAT TTATGGAGGT ATGATTCCCA 18840 

30 AAGTGCTAGA TGCCAAAAAT GCGATTGAAA ATGGCTGTCC TAAAGTTATC ATTGCATCAG 18900 

GAAACAAGCC AAATATCATT GAATCTATTT ACAATAATGA TTTTGTTGGC ACAACAATCC 18 960 

TTAATTCATA ACTATGAAAT TAAGG CCTAA CAAGTTTTGA CACGCGAGAT GATTCCAGTT 19020 

CGATTATCCA TTGCGCTAAA ACATTTATTT ACCGTTCATC TCGTTAACAA TTTTGAATAC 19080 
AGT^CGATAC AATATGAGAT GTAAAAAACT AATAACCTTT TACAAATTTG TTTATCAAAA 19140 
TATTTTAAGT TTTGCAAAGC TTTTTATTGT GATTATTTTC ACAAAATACT ATAATGAGGA 19200 
TAGTAAATAG AGAGGAGTCC TTAAGTTGAC GAAACGACAA ATGGGTATAT TCATTTATGC 19260 
TGGAATTATC GGTGG CTTGT TATCTGGAAT TGTAAAATTA GGTTGGGAGG TCATGTTTCC 19320 
ACCTCGCACA CCAGAACGTA ATGCAACGAA CCCACCTCAA GAGTTATTGC AACAATTAGG 19380 
ATTTAGTAGT GAGTTTACGC ATCAAACATA TACATTTTCA AATATGGAAT TGCCTTGGGT 19440 
AAGCTTTATT GTCCACTTTA GTTTTTCTAT CGTCATTGCA ATTATTTACT GCATATTAGT 19500 
^° TAAAAAATAC GCTTACTTAG CAATGGGACA AGGTGCTGTT TTTGGTATTG CTATTTGGGT 19560 
ATTATTCCAC CTTATCATTA TGC CAATCAT GCATACTGTA CCTGCTGTGT GGGATCAACC 19620 
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AGTGCGACAA 


CATTTTGTCT 


ATCGCTATAA 


ATTAAATTAA 


TACACTGACT 


AACATTAACG 


19740 




TGAGTTTtAA 


ATCATCGTTT 


GAGTATGATG 


ATTGATGCTC 


ACGTTATTTT 


ATTAACTGAC 


19800 


5 


ATGATATGAT 


TCCAGCCAAC 


TTACGTGAGC 


ATTAAAGTCT 


CAAATGCGTC 


GTAACAAACT 


19860 




ATTATTTTCG 


GTAATTTCAA 


TATTGCTCAG 


TATATTTTTA 


CCTTATCACT 


TACTTTAATC 


19920 


10 


TCGTCATGAT 


TTTGAATGAT 


GCCATCGTGT 


ATTCACCTTT 


CATTTTTCCA 


ATAAAAAAAC 


19980 


ATCTAACAGT 


AAACATTTAG 


GCAGTATAGT 


TTAAAATCAC 


TGCGCAATGA 


TACTGTCAGA 


20040 




CGTCATATTA 


ACTACTCAAT 


AACTGAAATA 


CAGACACTTT 


TTTATAACCC 


CAGGGTGCCT 


20100 


15 


GTCCTAAGAA ACATACCTGT 


ACCATAAACT 


GATCAAAAAT 


AAATTGTTTG 


AACTTCACTT 


20160 




CACGTGATTG 


ATAAAAGTGT 


GATTGTGTCA 


TATCATAAAT 


GTCCAATCCT 


TTGATTAAAC 


20220 




V— X X v«r\^V.nnX 


CAATTTTGTA AAACTTTCTT 


TTTGTGTCCA 


TATTTGATAA 


AAATCATTTA 


20280 


20 


A A r*T A f 21 T AT 


x ITjA I Kj IXjQTl 


TCGTTTGTAG 


AGAAACACGT 


CACTAACGTA 


CGCCAGTCTA 


20340 




A TV r'flTTYl'TYl A 
rtrtLu X X\J 


1 Ax LrX~X~X"X'(_G 


ATATCAATAC 


CAACTGGTTC 


TTTATCGACA 


ACACACACGA 


20400 




TA TA A fV2 AT A 




AAGCTCACAT 


AGATGGGCTG 


TCCATCACGA 


TTGTGTTGAA 


20460 


25 


{^ATAXr'TYl*" 1 


CTTACCTCGT 


GGCGAAATGT 


GATAATGCCA 


TTCATGTGGT 


AATAAACCTG 


20520 




TC3TC!ATGTTn 

X 1 V_^^ X Ul X X \J 


AAX X LLA1A1 


T^ Z*™' Ti / Mil * T\ 1\ 7\ 

TtjCACTAAAA 


TATCTCCCAA 


TCTGTGCATG 


AGTTTATCTT 


20580 




A TTfiT^TPT 
VjM X iulnl^X 


ATAGTTGACT 


GTACGCGGTT 


TTTTATATGA 


CCAACGACTT 


TGTGATATTA 


20640 


30 


A TTrTn'P A A T 


ACTTTTCAAG 


TTACTCTGTA 


■rVX luwll ±n\~ 


Mrtft, X IvjIL. 


ATAACTTTC C 


20700 




CTACTTACTT 


ATTGAATATT 


GTTTTGATAT 


ATTGTGCCCA 


ATGATACAGC 


CAATTGTTAG 


20760 


35 


TTATCGTTGG 


CCATTTTTCA 


CTGATGTGAT 


TCATTATTTT 


TAATGTTAAT 


GTTGTATCTA 


20820 


TCATTGCTAG 


TTGTTGTTCA 


CGGTCAACAC 


TAGTTAATCC 


AATCGTTTCG 


TACATGTCTT 


20880 




GTTTCTGTAA 


AATTTCATTA 


AATGATTCAT 


CGCTGACGAG 


TTCAATTTCT 


TTGCGCTTAA 


20940 


40 


CGCATTCTAA 


CAAAGATTTC 


ACCGGCATTT 


TATTAGGTGA 


TAGCACATGG 


TAAATGATTT 


21000 




GTGGTGTGTT 


GACCTGTGCT 


AATGCGACAA 


TTTGTCTTGC 


AGTCGTATCC 


ACAAAAGAAA 


21060 




AATCTACAGG 


CATTTCAGCC 


ATGCTAACCC 


CGATACAATC 


CAGTTGTAAC 


AAATCATTCA 


21120 


45 


TTACCATTGA 


AAAACGGTTA 


GTCTTTATAT 


TTCTCATATG 


CCATCTTCCA 


TTGTAAGGAT 


21180 




TCGT CAAATT 


ACCAACACGT 


ACAATCCGAC 


CATCTAAGCC 


ATTATTTACA 


GCTTCTAATA 


21240 




CTTTTAATTC 


ACTATAAAAT 


TTGCTCCGTG 


TATATGGTGA 


TGTTAGTAGT 


TGCCCTTTAT 


21300 


50 


AGACATCCGC 


TTsTGAAAAT 


GTCACATCTT 


CTGTGTCTAT 


ATCAAAATAA 


GTTCCCACAC 


21360 




TTATCGTAGA 


CACATATATT 


AACCTTGCAT 


GATGTTGTTG 


TGCCAAACGT 


ATGACATCAA 


21420 
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CACCTGCATG AATAATCGTA TCCATGTTTT CTGGTAAAAC AACAT CATC C ATACACTCGA 21540 

AATCACCAAC AATGACTTCA ATGTTTGATA ACATTATTTC AACCGTCTCT TCTGAAAAAT 21600 

AATCATTTAA ATTCGTCATC AACTTATACC ATGCTATTTC CTCATTATCA GCACGTATGA 216 60 

AACAATAAAT GCGATGACTG TATCCTTGTA GTACTTCAAT CAGATAAGCA CCTAAAAAAC 21720 

CTGTCGCGCC AGTCAATAGT GTATTTCCTA GAGGTCGATG ACTTAGACTA TCCTCTAAAA 21780 

TACCCAAGTT ATAACGAGAC ATAACAATCT TTTGTAATTC CGAAAGATTA TCCGGTAATG 21840 

CAACTAATGA TTGTTGATTT TGGTACATAT AATTAACAAT CTGTCGCACG GTTTTATATT 21900 

GGTATAATGT CTGCATTGAA ATATGATGGC CAAATCGTTT TAAATGCGAG ACAACTAACA 21960 

TCGCCTCTAA TGAGTTACCA CCAAGTTCAA AGAAATCATC GTCAACACCG ACATCATTTT 22020 

GTTTCAATAC CTCTCCAAAT ACATCAACAA ATGTCTGCTC AATTTCATTA GAGGGTTCGC 22080 

20 TATACACTTT ATTAGACTGT TGTATAGGTG ATGGATTTGG CAAACGCGTA GTATCCACCT 22140 

TGTCATTCGT GGTTAATGGC ATACAATCGA TATGCGTTAT AGTCTTAGGA ATCATATACT 222 00 

TAGGCAGCTG ATCATTTAAA TATTGCTTCA AATCCTGTTC CACTTGTTGC TCTCCGACAT 22260 

25 AATAAGCATT CAATATATCA TGCGTATCAA AGTGACTTAC TGTTACAACA CAATCAGATA 223 20 

TACCACGAAT AGCTAATATT GCATTTTCAA TTTCATCAAG TTCAATACGG TACCCGTTAA 223 80 

CTTTCACTTG TTTATCTATT CTTCCTAAAA ATTCAATTTG ACCATCAGAT GTATAACGTG 22440 

CTAAATCACC ACTATGATAC AACTTTCCTT TACCAAATGG ATTATTTTGC CATTT AT CAG 22 500 

CCATTAATTC TGGACGATTA ATATATCCTA TCGCTAAACT ATCACCTGCA ATACACAACT 22 560 

CGCCTGGCAT ACCAATACCG CATAACAAAC CATCTGACAT AATATACACT TGGATGTTAG 22620 

ATAAGGGTTT GCCAATTGGA ATCGTCTCAG GTATCAAATC ACCACAATGA TGTGACCAAT 22680 

ACGATGTGAT GACTGTTGAC TCAGATGGTC CATAGGCATT GAAATACGTG CCACAATGCT 22740 

TCTCAATATA TTTAACAAAG GATGCCGTAC TAGTTGCCCC GCCTGTAATC AACTTTTCAA 22800 

TATAAAAGTC TTCCATAACA CTACACATCT GTAACGGAAT CGACGCAACC GTCACACGAT 22 860 

GCTTATTAAT GAGTTGTTGT AACTGTTCTG GATTAACACG TTCCTCTCTA TCTGGAATCA 22920 

45 CAAGCGTATG ACCATTTAAC AAACAACAAT AAATCTCCAT AACTGATGCA TCAAAAACAA 22 980 

TATTTGCATG TTGCAAAAAT ACTTCATTGT CGCCTAATTG CAATTCAGTT GACCATGCAT 23040 

GCACTAAATT CAACAAATTT CGTTGTCGTA TGGCAACCCC TTTAGGCATC CCGGTCGTAC 23100 

50 CAGATGTGTA AATAGCATAC ATCTCATTAT CTAACATCGC TGTGTTTTCA AGTTGATTGC 23160 

CATGTAAATC ATCATATTGT TCATTTTCCT TTGATTCAAC AAAGCCTTTA GCATTTTCCA 23220 
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TAGCATCCTC CAAAATTGCA CCTTGTCGTT TATTCGGAAA ATCAATATCG ATAGGTATAT " 23340 

AAGATGCACC TACTTTAACT GTCGCCAACA TCGCCGCAAT CATTTCAAAA CTACGTTCTG 23400 

TAAACAAGGC AACCCGTTGA CCATTGCCCA CACCATTTGA T AGGAG CATG TGCGCAATGG 23460 

CATCCACATA GTTGCGTAAT GTTTCATACG TCATTGTCAA ATCATTCATG ACTAGCGCAA 23520 

CATGATTACC TTGTCGTGAG ACAACTTCAT TAAAGTAACT TATGATAGAT TTATTTCCCG 23S80 

GGACATTAAG CATTCGATCG TTAACATGCG TATTGACCCA ATTTAGAAGT TCCTCCGTGC 23640 

CGTTTGGTAT ATCACAAATT TGTAGTGTAT CTTGATGCTT CAAAATATAA TCAATCATAA 23700 

TCATACATTG ATTACCCATG TGACGAACTG TTTCTGAGTG ATATAAATCG GTATTATACT 23760 

CGATATTGAT TGTATAGTCA TCGCGATCTT CTTCAATGAT GAAAGATAAA TCAAATTTCG 23820 

CCGTCACTGA TTTGGGTTGA ATGTGTGTTA ATTTACTATG CCCAAAATGA GCATGATTCG 23880 

so TTTCATTGTT TTGTAGTACT AACATGACAT CAAATAATGG ATTCCGTGAG GCATCATGTG 23940 

ATTGATCTAA GTCATTTACT AAACATTCGA ATGGGTATTC TTGATGCTCG TATGCCTCCA 24000 

AACTCATTTC CTTAACCTCT TGTAAAAACT GTGTCCACAT TTTATCAGGT GACGGTTGCC 24060 

25 CTCTATATAC CAACGTATTA GCAAACATGC CTAGCATTTG CTCCGTGCCT TTATGCATAC 24120 

GCGCACTCAT CACACTACCG ACAACAACAT CATCTTTTCG AGCATATCTA CTTAACAACG 24180 

TCATGACCAC ACTCATAAAG AACATAAAAT CAGTAATTTG ATGCTTTTCT ACATACTTTT 24240 

GAAGTAGCTG TCTCATTTGT TGATTCATTG TAAATGACAT CATTGCTCCA TTTGTCGTTT 24300 

TAATATTTGG TCTAACATAG TCTGTCGGTA AGCTTAAAAT AGGTACTTCA TCTTTGAATT 24360 

GAGATAACCA ATATTGTCTA TGTTTCGTCA TATCACGATG CGACATCCAC TCACTATAGT 24420 

CTTTATATTG CAATTTAAGT GGTAACAATA ATTTATGTTG ATAAAGTGCG TTAAGATCAT 244 80 . 

TCATTAATTG TATATTACTC ATACCGTCAT TAATGATATG ATGCGTATCT ATAAAGAGGT 24 54 0 

4Q ATGCATGTAA GGGACTTCTA ATGTATCTCA CTCTAATTTG ACTTGGCTTT TCCAAATTAA 24600 

AAGGTGCTAC AAATTGGCGC ATGATTTCTT GTTCATCCGT AAAATGCGTG TTAACTTCTT 24 660 

CAAAGTCAAC TGCAACATCT GCCACAATAC GTTGTCGAAC CTCATCATCT ACAACAATAT 24720 

45 ATTGTGTTCG TAAAATCTCA TGTCGCGCTA TCAAACGCTG CACTGCTTGT CGCAATTGAG 24780 

CTACATTAAG TTCTGATGAT AACCGC CAT A AAAAAGGTAC GTTATACACC GTATCTTTAT 24 840 

GGTTTGATTT CCATAATAAA TACATACGCT TTTGTGCAGA GCTCAGCACA TAATCATCTT 24900 

so TAACTATAGT TTCTGGAATC ACTTCATAGT TTTGTTCTTG AACCTTAGCA ATCGCTTGTG 24960 

CTAGTTCAAA TACAGTTGGC TTTTGTAATA AATCACCAAT TTGTAATCGT TTCCCAGTAG 25020 
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AATTATCATG AATACCTACT TGATTCACAT GTAAAATATC TGCAAAAATT TGGCATAGCA 2 514 0 

AGTGTTCGGT ATCTGTACTC GGTGCTACAT AGGCATCCGT ATCGACATAG TCCATGATAG 25200 

GCAATGCCTT CTTATCTAAT TTCCCATTAA TAGTAATAGG AATTTGCTCA ATATGCATGA 25260 

AATTAACTGG TATCATGTAC TCCGGTAAGG TCATACGTAA TTGTGATTTA AT CTT ATT AT 2 5320 

GTGATAATGT ATGCATCGCT TCATAATAAG CAACGATATA CTGATCTTGA TCATGATTTT 2 538 0 

GAACAATAAC AACTGCTTTA TTAATACCTT GTATACGCTC GAGCGCATGC TCAACCTCTG 25440 

ACAACTCAAT CCTAAACCCT CGAATCTTAA CTTGTTTGTC CTTTCGATAT AAATAATCTA 25500 

TGTTGCCATC GGGTAACAAA CGAACGATAT CACCACTTCT ATACATCAGC TGaTTTATAT 25560 

TTGAATCTTT GATAAATTTA TCTGCTGTCA ATTCTGGCTG ATTTAAATAA CCTGCAGCTA 25620 

ACCCAAAGCC ACTTGTACAT AATTCTCCAG GAATACCAAC GCCACACCGA CGCTCGCCTT 25680 

GCATGATATA AACATGAGTA CCCAGAATCG GTTTACCAAT AGGAATACGA TTTGGAACTT 25740 

TGTTAGGTAT ATTATACGTC GTTGTAAATG TTGTATTTTC AGTTGGTCCA TAACCATTAA 25800 

TAATTTGAGG ATGCTTCGGT TTTTGATTAA GCAAATCCAC CCACTTAGCA TTCAATACTT 25860 

25 CTCCACCAAT TAATAAATAC TTTAACGGTA CCAATACTTC TATTCGTTCA CTAGCAATCT 25920 

GATTAAATAA TGAGGAGGTT AACCACATAG TATTAACGTC ATTTTCATTG ATTAATTGTT 25980 

CTACCGCTAT TGGATTTAAT AATTGTTCTT TTTTAGCAAC AATCAGCTTT CCACCATTGA 26040 

30 GCAATGCACC ATATATTTCA AATGTTGCAG CATCAAAGGC TATAGTTCCT GATAACAAAA 26100 

TCGTCGTCTC TTCATTTAAT GGTACATAAT GATTTTGATG GACCAAGCGA ACAATACCTC 26160 

GGTGCGGAAT TAGTGTCCCT TTAGGGTTAC CAGTTGTCCC CGACGTGTAA ATAACATAAG 26220 

CATGATCTTC TAACGTGTTA CATTTAGAAA GATTATCAAT ATTTTTCCAC GCTATCTTAT 26280 

TCAA&TCAAT GTGATTAATA TTTTGTTTAC CATTTTCATA TAAAGCTTGG TACGTTATTA 2634 0 

CAACTTTAGG CGTTACATCT TTTAAAATGT ACTCCTGACG ATCACTTGGA TAGTTCGGAT 26400 

CAATTGGCAC GTAAGCCCCA CCAGCTTTCA ACACACCTAT CATCGCTATT ATCATCTCAA 26460 

TACTTTTTTC AGCTATGACA GCGACACGAT CATTAGGTTC AACACCATAC TGGTTTCTCA 26520 

AACGGTGTGC TAAATCATTC GCGCGTGCAT TCAATGTTTG ATATGTTATA AACAcTCCGT 26S80 

CAAATTGCAC AGCGACATGA TTCGGCGTTG CTTCAACTTG TTGCTCAAAT AAGGTAACAA 26640 

CTGTTTGCGC ATCATCTATC TCAGGCAAAC TTAAATTGAT ATCGTCATAT AATTGAATAT 26700 

CACGTTCTGT CATCAAATTA AGTTCATCTA CAGTTGTTCG TTTATTTCCA TTTTCTTCAG 26760 

TAATTTGCAA ATAAATATTT CGAACTAAGT CACTCAGCGT CTCGATTGAG AGCAAATCAT 26820 
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CTAAAGATGT 


ATGTGCATCA 


TGTATTTGAT 


GTACATCCTC 


AATAACATCA 


TTACAACAAG 


26940 


5 


ACATCATATG 


ATGATAACAA 


TGAAAAATAG 


TCTCTAGTGA 


AAGCGAAGAC 


TTCGCGCACT 


27000 




GTAATTGCGA 


CATATTTTGC 


AACACACATT 


TATTAAAATC 


TGTTGTAAAA 


CG TTGACATA 


27060 




CATCTTTTGC 


ATCGATTGTT 


AACGTTAACG 


GCACAATATT 


TCCGTGTAAA 


TCATTTGGTA 


27120 


10 


AATGTGATGG 


TACATGTATA 


CCTAATGTGA 


CATCATGTTG 


TTGACTCATT 


ATATGATTAG 


27180 




CTAAATACAC 


ACTAACAGCC 


AACGATGCCA 


TATCTATTGA 


TGTCATATCA 


TCAATCAAAT 


27240 




ACGTTTGATA 


TAAAGCTTGT 


TCAAATGGAT 


GCTTAATTGG 


AAAATAACTA 


TCAATATGGA 


27300 


15 


TGTCAGAGTT 


ATTCTCTAAC 


CGAAAATAGT 


TTGAGTCTAA 


TGCTATATGC 


GATGCATCTT 


27360 




GATTGTCTTT 


ATCATCATTT 


CTATTTATAT 


GTGCATGCTG 


TCGAGTATTG 


TTAATAACAG 


27420 




TATTGCCACG 


ATATGCATTG 


CATAAATCAT 


CAAGAAnAAT 


ATCAATTTGA CTATCATCga 


27480 


20 


AAATGGaCAC 


ATGAAAATCT 


aATAGTATAT 


ATGCAGCATC 


AGCGAACTGm 


AACAATTTAA 


27540 




CTTTGAATAA 


AGGTGAATCA 


TTAAAATGGT 


AAGTACTTAA 


TTCTTGCTTA 


AAAAAAGCTT 


27600 




CTAAATCATA 


GTTTGCGGAA 


GAAGATGGAA 


CTTGTTTTAT 


CTCAATAAAA 


GGCAGAAATT 


27660 


25 


CATGAAGTAT 


CATTGTTAAA 


TTGTCATCGG 


TAGTAACATC 


AAAAAAATGT 


CTTATAGATG 


27720 




CATGTTGTgC ACAATTGTCG 


ATAATGCATA 


CATCATTTTA 


GTAGCTTCAA 


CATTTTTAGC 


27780 


30 


GAGTTTAACC 


CAATACGCAT 


TACGGTGTGT 


CGTTGATTCT 


GTATTATTTT 


TGTATATACG 


27840 


AAAATATTCC 


TGTTGAAATC 


TCAAATTACC 


CATAATCATA 


AAAAGTCCTT 


CTTTCATATC 


27900 




ATAATACTCA 


TTACTTACTG 


AAATTGCATG ATGATATGAT 


AACCGACGAA 


ATGTTAATTA 


27960 


35 


ACTCGTTATG 


TAATGaTTAA 


TATaAAACAC 


CATTCGCAAC 


ATATGAGCGA 


TATATTCTAC 


28020 




CCTAAAATAC 


ATCTTGTATC 


ATCGTTACAA 


TTGGTATATT 


TTTCAATGTA 


AATTACATAC 


28080 




ATCTTCGATA AATAGCACAC TACAAATCGT TAATCACTTT CTGTTGTTCA 


CATCTCATTG 


28140 


40 


CAAACTCAAT 


ATTGTTGTTA 


CAAAATATCC 


ATGAAGCAAG 


TTTATATTAA 


ACAAACAACT 


28200 




CGCATAAAAC 


AATTGTT AT C 


CTTAAATTTT 


AACAAATTCT 


TAATAAATTT 


ATCTCTATTT 


28260 




TAATTACGAC 


CAAATTAATA 


GGTTTTCCAT 


ATAAAAAGAT 


GCATAAAATA 


AATATTTAAA 


28320 


45 


TAAATTCAAT 


TTGTATTCAC 


TTGTTTTTGT 


CCCCCAAATA 


CACCAGCAAC 


AAGCATGCTA 


28380 




GCACCAATTG 


TTAAAACGAT 


AAACATATAC 


AGTCCCATTT 


GTAATGACGT 


TAAGAAAACA 


28440 


SO 


CCCAACACAA 


TCCCTAACCT 


AGCTAGTGTT 


TCTGAAAAAT 


GAATACCTAA 


TGCATTAACT 


28500 


GCACTATATG 


TTCCTCTTTT 


AGCTTTAGGA 


ATAATTTTAA 


AGCGTTGTTC 


TGAAACTATA 


28560 




GGCGAATAAA 


TAATTTCACC 


TACAGTCGCA ATTATCATAA AAACAACTAA 


TAAGCCAAAC 


28620 
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is 



20 



25 



GCri n iti AA 


AATCTATTTT 


/»"» » •» m jv » TA /"XT» 

CAATACAACT 


1 IC-GAGATTiVy 


A A T A PTTf^ TArZ 


TAaAATGACG 


0 tin a n 


ACGACCGTAT 


m ft itiA ft rrvn ft 

TAATCATTAG 


CAAG ATTG CT 


TV 7v IS ■ 1 ■ 1 1 IV 

AACATCTTAG 


V^ALL X v? X MM 1 






CCTATACTTA 


TTGTTTCAAA 


C7I viA I 111 




r»7A ATAT ATG A 
VJvi X M X M X VJrt 


GGAGATTGAA 


*ooou 


AATTCACCCA 


my* 1 ! ft ft rr^/*^ ft 

TCATGATGAT 


ACTGAACCCC 


»v tv A TO TV ATA 
OxAAATCAA 1 A 


B. f IV *T* ZV T A A TT 






ft ft ft * yi m_ ft M tr tf v 

AAAACTAATT 


m ft #vt ft ft /■VTV^ ^^/*** 

TATAACTG CG 


JV TV JV FT" A I* TV 'I" 1 • 

AAATATAI 1 l- 


ATTATTTGTA 


AX X X X XVjMXM 


ACG ACTTG C!A 


o o q o n 


TGCCTCTTGT 


CATCACTTTG 


CTTTACTTGA 


III LUCjIL 1 I 


vj/\vjO X MMA„.\„ t\ 


AATATATAAA 


2904 0 


ATAAAGAGTA 


CAATTAAAAA 


T ATACAAG CT 


/""•/T A TTR A./""* A 


nnnvj X Mo X MM 


PATACTGTAG 


2 9100 


CCATACATCA 


ft ^^^^^^ft ^"l yo 1 1 ft ft 

AGCCACCTAA 


CAATGCCCCA 


TV rrt j» f~* /T* A f~*f~*f~* 

A TAGC- 1 Av_tA» 


ATA ^fnTTYIT 
A X MAvj X X X \j X 


C ATP P AATAfl 


29160 


CTAATCTTGT 


m ft ft m ft TV ft *T\^^ 

AAATATAATG 


TTCCACGTCT 


TCwlAAl X\» 


P A TPP B.T A 7X T 




29220 


ATAACTGGAA 


ATTGTAATCC 


CCAAACGATT 


OT-IV TV 7\«T*7A*TW2 

G 1 AAA 1 A 1 TjC* 


P A T & TYSO A R P 


A P A A AAAPCA 


29280 


ATAATTTGCC 


A CAATTG ATG 


qv^ » f^/^r* AAA rr-i 

TGACCCAAA I 


AvA»v-t- w\ Ibrt 




TATPAPPATP 


29340 


GTCGCTTGAT 


AAATAAGTAC 


TAGCAACTTT 


tTCGGAAATA 


TCTCAATAAG 


GTAACCAGAT 


29400 


ATAATGGACA 


ATGGAAATTT 


nAGAACCACT 


AAACCAACAA 


GATATATACC 


GACAATTGAT 


29460 


TGACTTAACA 


TATCTGTTAA 


ATATAGTGCT 


ATAAACGGTA 


TAAATGCTGT 


CGTAATAATT 


29520 


AGCTGTAAAA 


nATTGCTAAT 


CAATCGTACT 


TTCAA 






29555 



(2) INFORMATION FOR SEQ ID NO: 207: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

35 



T (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



40 



45 



50 



AAAAAAAAAA 


AAAAAnGGTG 


AATCTTTAAT 


TAAACACTAA 


TATTGTAAAA 


GATGTTAAGT 


60 


AAACGCTTAA 


TGACACTTAT 


TTTTTGAAAA 


TAATAGTAAT 


ATCATTTTGT 


TAAATGAAAG 


120 


AATAAAGCTA 


TAATmATTAT 


AGAATAACTA 


TTTAAAGGAG 


ATTATAAACA 


TGCCAATTAT 


180 


TACAGATGTT 


TACGCTCGCG 


AAGTCTTAGA 


CTCTCGTGGT 


AACCCAACTG 


TTGAAGTAGA 


240 


AGTATTAACT 


GAAAGTGGCG 


CATTTGGT CG 


TGCATTAGTA 


CCATCAGGTG 


CTTCAACTGG 


300 


TGAACACGAA 


GCTGTTGAAT 


TACGTGATGG 


AGACAAATCA 


CGTTATTTAG 


GTAAAGGTGT 


360 


TACTAAAG CA 


GTTGAAAACG 


TTAATGAAAT 


CATCGCACCA 


GAAATTATTG 


AAGGTGAATT 


420 


TTCAGTATTA 


GATCAAGTAT 


CTATTGATAA 


AATGATGATC 


GCATTAGACG 


GTACTCCAAA 


480 
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AG CTGACTTA 




CACTTTACAA ATATTTAGGT 


GGATTTAATG 


GTAAG CAGTT 


600 




ACCAGTACCA 


ATfi ATTI A a /""a 


TCGTTAATGG 


TGGTTCTCAC 


TCAGATGCTC 


CAATTGCATT 


660 


5 


CCAAGAATTC 


"Awii X X InL 


CTGTAGGTGC 


TACAACGTTC 


AAAGAATCAT 


TACGTTGGGG 


720 




TACTGAAATT 


Ttppa r* a n rr* 

X X \-V_rtA_>i>\L. 1 


TAAAATCAAT 


TTTAAGCAAA 


CGTGGTTTAG 


AAACTGCAGT 


780 




AGGTGACGAA 


VJV7 X VjrVj XXI d.7 


CTCCTAAATT 


TGAAGGTACT 


GAAGATGCTG 


TTGAAACAAT 


840 


10 


TATCCAAGCA 




CTGGTTACAA 


ACCAGGTGAA 


GAAGTATTCT 


TAGGATTTGA 


900 




CTGTGCATCA 


TP At"! A RTTPT 


ATGAAAATGG 


TGTATATGAC 


TACAGTAAGT 


TCGAAGGCGA 


960 


IS 


ACACGGTGCA 




CTGCAGAACA 


AGTTGACTAC 


TTAGAACAAT 


TAGTAGACAA 


1020 


ATATCCTATf* 


1 lAWvil X\j 


AAGACGGTAT 


GGACGAAAAC 


GACTGGGATG 


GTTGGAAACA 


1080 




ACTTACAGAA 


V- U X <rl X X L> 


ACCGTGTACA 


ATTAGTAGGT 


GACGATTTAT 


TCGTAACAAA 


1140 


20 


CACTGAAATT 


TT TiCZC A A A Zk r* 


GTATTGAAAA 


CGGAATTGGT 


AACTCAATCT 


TAATTAAAGT 


1200 




TAAfCAAATr* 




CTGAAACATT 


TGATGCAATC 


GAAATGGCTC 


AAAAAGCTGG 


1260 




TTAPiir , a^r , & 

a InL/iV^uUn 




ACCGTTCAGg 


aAACAGAAGA 


TACAACAATT 


GCTGATATTG 


1320 


25 


v- X v» 1 1 VjC 1 AC 


AAACGCTGGT 


CAAATTAAAA 


CTGGTTCATT 


ATCACGTACT GACCgTATTG 


1380 




CTAAATACAA 


TCAATTATTA 


CGTATCGAgA TGAATTATTT 


GAAACTGCTA 


AATATGACGG 


1440 




TATCAAATCA 


TTCTATAACT 


TAGATAAATA 


ATTTTCTnTA 


TAATCAAATG 


CTGACATAAT 


1500 


30 


TTTAGTTGAG 


GATTATTATG 


ACGGTATAAA 


TAAATAAAG 






1539 ■ 




(2) INFORMATION FOR SEQ ID NO: 208: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) -TOPOLOGY : linear 



40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 



45 



SO 



CAATTTCTAT 


CTATCAATGA 


TGTGCATACT 


TCCAnTTAAA 


TTAAtCGAAA 


TGaATCAAGG 


60 


TATATCATTC 


CTGCCTCTTT 


ATATAACaAC 


AAATAGTGAT 


TACAATATTT 


CGGTTATTAA 


120 


CACGAAAATT 


TTACAAGCAC 


CTATTTCATT 


TACATATATA 


TACAGCAAAA 


AAGAAAGCCC 


180 


AGAAATATTG 


GTGTTTATTA 


AATCATTTAA 


AAAGTATATT 


GCCAATGAAC 


AATTATAATA 


240 


AATTTCAAAT 


CTAAAAAACC 


AAGAATGCGA 


TTAATCATCA 


CATTCTTGGT 


TCAATTTTAT 


300 


TCATGAATTT 


TTTCAACATT 


AAACGTTAAG 


TTATTGTCTG 


AATTTAAATT 


AACTTTAATC 


360 
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CGTTGTACAA AACGTTTTAA TGGTCTTGCA CCGTATTGAG GTTCATAAGC TTCTTGACCT 4 80 

AGCCAAGCTT TAG CAT CATC AGAAACTTCA ATTGAGATTC GTTGTTCTAA TAATCTTATA 54 0 

5 TTTAATTGCG TTAAGATTTT ATCTACAATC AT ACT CATGT CATCAATAGA TAATGGTTTA 600 

AATAATACGA TATCATCCAT ACGATTCAAA ATTTCTGGTT TGAAATATGC ATTTAAACTT 660 

GTCATAACAG CTTTTTCTGT TGATTCTGTA ATTTCAC CAG TCTCTTTTAC GTTTTCTAAT 720 

10 AAAACTTGAG ATCCAATATT ACTTGTCATA ATAATAATAG TATTTTTAAA ATCAACGCTA 780 

CGTC CT T T AG AATCAGTTAA ACGGCt TCAT CTAAAATTTG CAATAATACA TTAAAGACGT 84 0 

CAGTAT 84 6 



15 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 
<B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



35 



40 



45 



50 



nTGGGAACAG 


TAAGCCAGTA 


TTTTTAGAAA 


GTTGCCATAC 


ATGAG CGTCG 


ATTTTTCCAA 


60 


TATGGCTATG 


ACACTAGAAC 


AATGGGAATT 


TGGAGGAAAA 


GTAAATGATT 


AAACCTAAAA 


120 


TAGCATTAAC 


CATTGCAGGT 


ACTGATcCaA 


CAGGTGGTGC 


CGG CGTAATG 


GCTGATTTAA 


180 


AATCATTTCA 


TTCATGTGGT 


GTATATGGTA 


TGGGCGTCGT 


TACAAGTmTT 


GTTGCTCAAA 


240 


ATACATTGGG 


CGTACAACAT 


ATTCATAATT 


TAAATCATCA 


ATGGGTAGAT 


GAACAACTTG 


300 


ATAGTGTCTT 


CAATGATACC 


TTACCTCATG 


CTATTAAAAC 


GGGGATGATT 


GCTACAGCAG 


360 


ATACTATGGA 


AACGATTCGT 


CATTATTTAA 


TGCAACATGA 


ATCTATTCCA 


TATGTAAT t G 


420 


ATCCTGTTAT 


GTTGG CG AAA 


rCggTGATTC ATaATGGwTA ATGACa CAAg 


CaAAACTTGC 


480 


AGCATaCGTT 


ATTGCCATTA 


GCTGACGTAG 


TAACACCGAA 


TTTACCAGAA 


GCTGAAGAAA 


540 


TAACGGGACT 


AACCATTGAT 


AGTGAAGAAA 


AAATTATGCA 


GGCTGGCCGC 


ATCTTTATTA 


600 


ATGAGATTGG 


TAGTAAAGGT 


GTCATCATTA 


AAGGCGGTCA 


TTCAAATGAT 


ACTGATATAG 


660 


CAAAAGATTA 


TTTATTTACT 


AACGAAGGTG 


TTCAAACATT 


TGAAAATGAA 


CGATTTAAAA 


720 


CAAnACATAC 


GCATGGAACA 


GGGTGTACAT 


TTTCAGCAGT 


TATAACGGCA 


GAACTTGCAA 


780 


AAGGTAGACC 


ATTATTTGAG 


GCTGTACACA 


AGGCTAAAAA 


GTTTATTTCA 


ATGAGTATAC 


840 


AATATACGCC 


TGAAATCGGC 


CGTGGTAGAG 


GTCCAGTGAA 


TCATTTTGCA 


TATTTAAAGA 


900 
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TGTTATACAA ACGATGTAGT TAAAAATTTT ACAGCGAATG 


GTTTATTAAG 


T ATTGGTG CT 


1020 




AGCCCTGCAA TGAGTGAAGC TCCCGAAGAA GCTGAAGAAT 


TTTACAAAGT 


TGCACAAGCG 


1060 


5 


CTATTAATCA ATATCGGTAC TTTAACAGCA GAAAATGAAC 


AAGATATTAT 


TGCGATTGCT 


1140 




CAAACGGCAA ATGAGGCAGG CTT AC CT ATT GTATTTGACC 


CTGTAGCTGT 


TGGTGCTTCT 


1200 


10 


ACATATCGAA AGCAATTTTG TAAATTATTA TTGAAATCAG 


CGAAAGTATC 


AGTAATTAAA 


1260 


GGCAATGCAT CTGAAATATT AGCGTTGATT GATGATACAG 


CAACTATGAA 


AGGTACAGAT 


1320 




AGTGATGCTA ATCTTGATGC GGTTGCAATA GCGAAAAAGG 


tTACGCAACA 


TATAAAACTG 


1380 


15 


CAATAGTAAT CACAGGTAAA GAGGACGTTA TTGtTCmAGA 


TAATAAAGCC 


TTCGTATTAG 


1440 




CTAATGGATC TCCATTATTA GCACGAGTAA CTGGAGCTGG 


TTGTTTATTA 


GGAGGCGTTA 


1500 




TTGCTGGATT TTTATTTAGA GAAACAGAAC CAGACATAGA 


AGCGTTAATT 


GAAGCGGTAA 


1560 


20 


GCgkATTTAA TATTGCTG CT GAGGTAG CTG CTGAAAATGA 


AAATTGTGGT 


GGTCCTGGTA 


1620 




CGTTTTCACC ATTGTTGCTT GATACGTTAT ATCATTTAAA 


TGAAACAACC 


TATC 


1674 


25 
30 


(2) INFORMATION FOR SEQ ID NO: 210: 








. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2232 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : doub 1 e 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 






35 


ATGAGTTGCC GATGAATTTA GCACCACCAA CGATTGCnTT 


TGATACTGTG 


TCCCAACCAG 


60 





_ CTTGTTTAGC _ ATATTTAATA CCTTCACGTA- AAGGATCGTT 


-ATCATATGGA- 


GCAATACGAA 


120 - 


40 


ATACGTTATG GTATTTCGTG TTTGAGTTAG TTACAACTTT 


GTnTTGCACT 


ACATCTGCAC 


180 


CTTTCGCTAA TTGAGAAGTA CCGTTACCTG TTTCTAATAG 


GGCATGTGAG 


ATAAGATAAA 


240 




CTTCATTAAT GCCATACATT TGAGCAGCTT TGTTAAATGC 


AGCACCTTGG 


TTTTCTAATA 


300 


45 


CACCTTTACC TTTTAAGAAT TGATTAATTT TATCAATAGA 


AATATTTTGT 


GGTTGGTCTA 


360 




AGCGTAAGAA TTGATATTTT AATGCTGGAT CTTGAGCTAA 


ACGCTTCGTA 


TC CATTG CAT 


420 




GCTTAACATC ATTAAATTTA GCATCTGTCC ACTTACCTGG 


TACACGTTGT 


ACTTGTGGTT 


480 


SO 


TATATTGTAA ACCAGCTTGT ATTTGAGCAA CTTGGTTTAA 


TGTCATACCT 


GTTTGATTAT 


540 




ACTTAATTAA TTCTTTAGCT AAATCAGTTG ATTTAATCCA 


TGCC AATTTA 


CCGTTAGATA 


600 




ATTTACCATA GTACCAAGTT TGTCCATTAA TGACTTGTTC 


TTTAACAACT 


GCGAATGGTT 


660 
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AACCATTACC ATTTTTAATT ACATAAGTGT AGTTATAATC TTTGGc AG CT GATGTAGTTG 780 

GTTTCACAGC AGTTGGTGCA GTTAAATCTT TTG CATTT AC CCAACCAGTG CGGTTATTAA 840 

TAGTACCGTA TAAATAAACA TCTTTGCCTA CAGATACTTG TTTCGTTGCA TTAAATGTAC 900 

CTTGAGCAAT GTTATTGCCT GTTAAAATGA CTTGGTTTTT AGTACCCCAA GG AACCATTG 960 

ATAAG CCGTT ATTTGATTTA TTAACAGTAT ATTTTTGAGT CGTTTTAACT TCTTTGCCTA 1020 

AGTTTTGAAC ATTTAAGTCT TTTACATTGA ACCAACCTAA TGGGATGTTA TGG CTTGTAT 1080 

TGTTTAATAA TACATACGTT TCATTACCAT GAGCACGCTC TTTTGTTACA TAGAACGTAC 1140 

GGTCTGCATA TTTCG CACCG TTTTTCGCTG TTTTTTCATA AACAGAAGCA CGAAT AC CAG 1200 

TGTTGTTTGG TTTAACTTGA GCAATCTTGC TAACTGTTTG AGTCGTTTGT GGTTTAGTAA 1260 

CAGTATAAGC TTTTACAGCT GTTTTTGGTT GTGCTACTGC TTTTTTAGGT GCAGCAGGTA 1320 

CAGCTAAATA TGCTTTACTT ACCCAACCAG ATTTACCATT TACAGTTCCA AATAAATAGA 1380 

TAGATTTATC AATTTGTTGT TGCTTAGT CG CTTTAAAAGT TTGGTTACCT GTACCAGAAA 1440 

CTGCACCAGC TTCTTGTTTA TAAGTGCCCC AAGGTACTGA ATATAATTTA GTGCCTGGgT 1500 

TTACTGTATA TGTTTGCATT ACATTTACAG GTGATTTTGC ATtGtTATAA ATACGTCACC 1560 

TTGTTTAACC CAACCAATTA AAGTTGGACT ATTGTAAT CT TTAACTAAGT AGAATTTGTT 1620 

30 TCCACCTAAA CTTGCTTCTT TTGTTACAGC AAATGTTTTT TGAACTTCTT TCGTTGGCTT 16 80 

ACCAGTTTTG TCATAAACTG TAGTGAATAA GCCATTGTTT TTAGCATTAA TTTG AG CAAC 1740 

AC CGTTT AAT GATGAAACTG TTAATTTATT ATTTGTTGTA GGTGTTGATG GCTTAGGTGT 1800 

TGGTGTAGGC GTAGGTTTAG CAGTATCAAC TAAATATGCT TTACTTACCC AACCAGATTT 1860 

ACCATTCACA GAG CCATATA AATAAATTGA TTTATCAATT TGTTGTTGCT TTGAAGCCTT 1920 

AAATGTTTGG TTTCCAGAGC CAGACACACT ACCAGCAACT TGTTTAGATG TACCCCAAGG 1980 

TACTGTATAA AGTTTCGTAC CAGGTTTGAT TGAATATGAT TGATTTACAT TTACAGGTGA 204 0 

TTTAGCTGTG TTGTAAACCA CATCGCCTTC TTTAACCCAA CCAAATTTAT TACCAGAATT 2100 

45 GTAATCTTGA ACAAGATAGA ATTTTTGATT ACCTAATGT A G CTGTTTTAG ATACAGCAAA 2160 

TGTTTTTTGA ACTTCATTAG TTGCTTTACC AGTTTTGTCG TATACAGTAG TATATAAACC 2220 

ACTATTTGTT GG 2232 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



35 



40 



50 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GATTTAAATA AAATTAATGG ATATCGTGAT CGTACGATGT TAGAACTTCT GTACGCAACG 60 

5 

GGAATGCGTG TATCTGAATT GATACATTTA GAGTTAGAAA ACGTGAACTT AATAATGGGA 120 

TTTGTACGCG TATTTGGTAA AGGCGATAAA GAAAGAATTG TACCATTAGG CGACGCAGTC 180 

w ATTGAGTACT TAACTACTTA TATTGAAACG ATTAGACCGC AACTTTTAAA AAAGACTGTT 24 0 

ACTGAAGTCT TATTTTTAAA TATGCATGGT AAACCTTTAT CACGACAAGC AATATGGAAA 3 00 

ATGATTAAAC AAAATGGTGT AAAGGCAAAC ATTAAAAAGA CGTTAACGCC ACATACGTTA 360 

CGCCACTCTT TTGCGACACA TTTATTGGAA AATGGCGCAG ATTTAAGAGC AGTGCAAGAG 420 

ATCTtAGGtC ACTCTGaCmT ATCTACTACC CmaCTCTATA CmCATGTTTC G r AATCTCAA 4 80 

ATTAGAAAAA TGTATAACCA ATTTCATCCT AG AG CAT AAA GTGAACAATA ACTCAAAAGT 54 0 

CACAATACAC ATGACTAAAA ATGTCTGTGC TATTGTGGCT TTTTTAAATT GGTTGATTAA 600 

TTACGTCTAT GTTTTCTTAA TTGAATCGCT TCTTCTTTTG CTGCAATCAC TTCTGAACGA 660 

TCACGGCGCA TGTGATGGTC TACAATAAAA GGATCTGTTG CTGTTTCCTG ATTATAATCA 720 

TAGTCTGGAT AGTTGGCCTT GATGATGCGT TCAAAGACTG GAGTTATTGG TAATATAACA 780. 

GATGAAAAAG GCTTTGCTGC ATTCAATTTT GCAATCTGTT GCTCAATTAA CAACTGATAA 84 0 

30 TCATTTAAAT TAAGGTATAA CGCATCTCTA T CTTT AG CAT TTTGTATTAT TTCTTTAGAT 900 

TTATTAAAAG ACTTATAGGC GCCTTTTAAA TTATTGCGGC GATAATGGTA ACAAGCAGTT 960 

GCAAACAAGA TTAAACTAAC AACTGCATCT TGCTTACTGT AGTTATTTTC AGCTTTCCAT 1020 

GCATCTTCTA AAATGTCATG ACATAGGAAA TAATGTTGCT TAGTATGAAA TTGATAATAG 1080 

~AAACTTATCA~ GTGCCfTGfTC~CAff^GTTA TCACCCCAAT TTAAAAGTJ^ G^TTATTTTCA 1140 

TGCTATAATA TTTTAGAGAA TTATGCACAT ATGACGCAAT ACGAGGTAGA TATTATGTAT 1200 

GAAGTTAAAT TAGATGCTTT CAATGGACCA TTAGATTTAT TGCTG CATCT TATCCAAAAA 1260 

TTTGAAATAG ATATTTATGA TATTCCTATG CAAGCATTAA CAGAGCAGTA TATGCAGTAC 1320 

45 GTTCATGCAA TGAAACAGCT TGAAATTAAT ATTGCAAGTG AATACCTAGT ATTAGCGTCA 13 80 

GAACTCTTAA TGATTAAAAG TAAGATGCTA TTACCACAAT CAACATCAGA TATGGATGTT 1440 

GATGATGACC CACGGGAAGA TTTAGTtGGG CGTTTAATAG rATATCaAAA TTATArAGAA 1500 

50 

TATACTGCtA TTTTAAATGA CATGAAAGAA GAAAGAGATT TTTATTTTAC CAAAAAGACC 1560 

GACAGATTTA TCtCATTTGG AAAcAGATGA ATCyTGGGAT CCaAATCATA CGATTGATTT 1620 

55 
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ATCTGTTGAA ATCCGAAAAG AGACATTTAC CATTCAACAA G CT ACAGAAC AAGTGACATC 174 0 

GAGATTGAAA GATAAAGATC ATTTTAACTT CTTTAGTCTG TTTACGTTTT CTGAGCCAAT 1800 

5 

TGAACAAGTA GTCACTCACT TTTTAGCTAT TTTAGAGATG TCAAAAGCAG GAATAATTAA i860 

T ATTG AG CAA CAACGTAATT TTGAAGATAT TAACATTATT AGAGGAGTGA ACTACCATTT 1920 

w TGGATAATCA TGGTATATTA GAGTCGCTTT TATTTACAGC TGGCGATGAA GGTTTAGATG 1980 

AAAAACAACT ATTAGAAATA TTAGATATGT CGAAAGACCA ACTCGTTGAA TTAATTGAAA 204 0 

ATT ATT CAT C ACATGGATTA ATGATACAAC GATTTGGAAT GA 2082 

15 (2) INFORMATION FOR SEQ ID NO: 212: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4219 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

25 

TCTATTCTCG TTCTTCCAAG ACCCTGaATT AGAAGTTAAG AAAATCGAAG AAGATGAGAA 60 

AGAATCTATT AAAAAAGCTC AAAAAGGTAT TTATAAAGAC CCTAGAGACA TCAATGATGA 120 

30 CGAACAAGAT GATGATACAA AAGATACTGT TGATAAAAAG GAATGATTGT AATTGCCTAA 180 

CAAAAACACT CAAGAATATT GGGAAGAACG CGGACGCAAA GCAATCGAGA ATGAGTTGAA 24 0 

GCGTGATAAA ACTAAAGCTG AAGAAATAGA ACGTATATTG AATATGATGA TTAAGCGCAT 300 

55 TGAAAAAGAG ATCaATGCGT TTATTGTCAA GTACGGAGAT TTTGCAGGCG TTACATTACA 360 

AGAAGCACAA AAGATTATTG ATGAGTTCGA TGTAAAAGCG TTTCAAGAAG AAGCAAAAAG 42 0 

ATTGdlCGAA AACAAGGAGT TTAG CGATAG AGCAAATGAA GAATTAAAGA AGTATAACAC 4 80 

40 

GAAAATGTAT GTATCTAGAG AACAGATGTT AAAGATTCAA ATAGAATTCT TAATTG CTTA 54 0 

TGCAACAGCT CAAACAGAAT TATCGATGAG GGAATATTTC GAATCAACAG CTTATCGTGT 600 

GTTCAGTGAT CAAG CGGGTA TTTTAGGTGA AGGTGTACAA GTAGCTAAAG AAGTTATAGA 6 60 

45 

TACAATCGTT GATACACAAT TTCATGGTGT CGTTTGGTCA GAGCGATTAT GGACTAATAC 720 

CGAAGCAATG AAACAAGAAG TAGAAGAAAT AATTGCTAAT GTAGTTATTA GAGGTCGACA 7 80 

50 TCCTAATGAA TATGTTAAAG ATATG CGCAA CACTTAAATA AATTCGAAGG CACAGCACGA 84 0 

CAAAAG AC CG CAGCAATTAA ATCATTGCTT TATACGGAAT CGGCACGTGT TCACG CACAA 9 00 

TCAAGCATTG ACAGCATGAA AGAAATTTCA CCGGAAGgAT ATTATATGTA TATTGCAAAA 960 

55 
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GACGCTAAAA TTGGTGTTAA TTTCTATCCT ATGCATATCA ATTGTCGTTC 


nun X XV7V»ww. 1 


1080 




TTACTACCTA AATCTATGTG 


' GCCGAAAAAA CCAAGCAAGA 


. AACGAAAAAC 


A A A A*F A nTTr 1 


114 0 


5 


GGAGGGAAAG 


l TGAAAAGCGG 


TGATTGATTT 


AAAAGTGAAG 


TTTTTTAAAG 


G PA AGTr A fJT 


1200 




TTTGTATGAC 


AGTAAATTAA 


ATGTTTGGAG 


GATACTAATA 


TGAGTAATAC 


TG A PAA AT A P 


1260 


10 


CTTAGAGACA 


TAGCAAGAGA 


ATTAAAAGGT 


ATACGTAAAG 


AGTTACAAAA 


GPGAAAPGAA 


1320 


ACAGTTATTA 


TTGATGCAAA 


CTTAGACAGT 


TTAAGGTCGG 


CAGTATTAGC 


PGATAA AP.A A 


13 8 0 




AAATCGAAAT 


ATAATGAACC 


TCTCTTTTAA 


TAGCTAGCAC 


TTAATTGTGT 


x VJVjrv_- x n. X X X X 


144 0 


I o 


TTATGTCCAA 


AACGTGCTGA 


TGACATAAAA 


AGCACGCATG 


GAAAAACAGT 


PGA PAG APT A 


1500 




TAAATGGAGG 


TATATCTCAT 


GGAAGAAAAT 


AAACTTAAGT 


TTAATTTGCA 


O X X X X X X VjI_I/\ 


1560 




GACCAATCAG 


ATGATCCGGA 


CGAACCAGGC 


GGAGATGGTA 


AAAAAGGAAA 


X WW X VJ/\ X f\f\Kj 


1620 


20 


AAAGAAAATG 


ACGAAGGTAC 


TGAAATAACT 


TTCACGCCAG 


AGCAACAAAA 


CI A A A f5T*TY2 A T 


1680 




GAAATACTTG 


AACGTCGTGT 


AGCCCACGAA 


AAGAAAAAAG 


CTGATGAGTA 


TP.P A A A A P A A 


174 0 


25 


AAAGCAGCAG 


AAGCTGCTAA 


AGAAGCTGCT 


AAATTAGCGA 


AAATGAACAA 


fifiRTfa. A A A A 


1 8 00 


GATGAATATG 


AACG CGAACA 


AATGGAAAAA 


GAACTGGAAC 


AATTACGTTC 


An A aHRSPRJ\ 


1860 




TTAAACGAAA 


TGCGTTCAGA 


AGCACGAAAA 


ATGTTGAGTG 


AAGCGGaAGT 


TY* A TTP A T/"* A 
IvjKI X wV\XL*A 


1920 


30 


GATGrGGTTG 


TCAATTTAGT 


TGTAACAGAT 


ACTGCTGAAC 


AAACTAAATT 


A A t/":ttp a b 

v?M/\ loll lj A/\ 


1 9 80 




GCTTTTTCTA 


ATGCAGTAAA 


AAAAGCGGTT 


AATGAAG CGG 


TTAAGGTTAA 


Pf2PTAf2APA A 


204 0 




TCGCCATTGA 


CTGGTGGAGA 


TTCATTTAAT 


CACTCGACTA 


AAAATAAACC 


VjV — rl/vrvrlw X 1 A 


2100 


35 


GCTGAAATAG 


CTAGACAAAA 


AaGAATTATT 


AAAAATTAAC 


GGAGGCATTT 


AAATPP.A APA 






AACACAAAAA 


TTAAAATTAA 


ATTTGCAACA 


TTTTGCAAGT 


AACAATGTTA 


A APPAPA APT 






ATTTAACCCT 


GACAATGTAA 


TGATGCATGA 


AAAGAAAGAT 


GGCACGTTGT 


TAA APflAPTT 




40 


TACAACACCT 


ATCTTACAAG 


AGGTTATGGA 


AAACTCTAAA 


ATCATGCAAT 


TAGGTA Af2TA 






CGAACCAATG 


GAAGGTACTG 


AGAAGAAGTT 


TACTTTTTGG 


GCTGATAAAC 


PAfiP.Tp.PTTA 


4& 4 U U 


45 


CTGGGTAGGT 


GAAGGTCAAA 


AAATCGAAAC 


GTCTAAGGCT 


ACTTGGGTTA 


ATGPTAPAAT 

/AX UL lALA/ii 


£*x D u 




GAGAGCGTTT 


AAATTAGGGG 


TTATCTTACC 


AGTAACAAAA 


f2 A A 1" 1 V "I"! A 
vjnrtl X \_ X X 


A TT A P A PTT A 
nX XxAw/Aw X X J"\ 






TTCACAATTC 


TTTGAAGAAA 


TGAAACCTAT 


GATTGCTGAA 


GCTTTCTATA 


AAAAGTTTGA 


2580 


50 


CGAGGCAGGT 


ATTTTGAATC 


AAGGTAACAA 


TCCGTTCGGT 


AAATCAATTG 


CACAATCAAT 


2640 




TGAAAAAACT 


AATAAGGTTA 


TTAAAGGTGA 


CTTCACACAA 


GATAACATTA 


TTGATTTAGA 


2700 




GGCATTGCTT 


GAAGATGACG 


AATTAGAAGC 


AAATGCATTT 


ATCTCAAAAA 


CACAAAACAG 


2760 
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TGATTCGTTA GACGGTCTAC CTGTGGTTAA CCTTAAATCA AGCAACTTAA AACGTGGTGA 2880 

ATTAATCACT GGTGACTTCG ACAAATTGAT TTATGGTATC CCTCAATTAA TCGAATACAA 2940 

AATCGATGAA ACTGCACAAT TATCTACAGT TAAAAACGAA GATGGCACAC CTGTAAACTT 3000 

GTTTGAACAA GACATGGTGG CATTACGTGC AACTATGCAT GTAGCATTGC ATATTGCTGA 3060 

TGATAAAGCG TTTGCTAAGT TAGTTCCTGC TGACAAAAGA ACAGATTCAG TTCCAGGAGA 3120 

AGTTTAATAA ATAATTAGGA GTGGTAACAT GCCCGAAATC ATTGGAATTG TTAAAGTAGA 3180 

TTTTACAGAT TTAGAAGATA ACAGACATGT CTATATGAAA GGGCATGTCT ACCCTCGTAA 324 0 

AGGTTATAAT CCTACAGATG AACGTATCAA AGCTTTAGCT AGTGTTGAAA ATAAACGCAA 3300 

CAAACAAATG ATTTACATTG TAAATGACAA ATTAACCAAA AAAGAACTTG TCGAAATAGC 3360 

AAGTGTTG CT GGCTTACAAG TTGATGAAAA ACAAACAAAA GCTGAAATTA TCAATGCTTT 34 20 

20 TGAGTCACTA GAGTAGGTGG TTATATGACT ACGCTAGCTG ATGTAAAAAA ACGTATTGGT 34 80 

CTTAAAGATG AAAAGCAAGA TGAACAATTA GAAGAAATCA TAAAAAGTTG TGAAAGCCAG 354 0 

TTGTTATCAA TGTTACCTAT TGAAGTTGAA CAAATACCGG AAAGgTTTAG TTACATGATT 3600 

AAAGAAGTTG CAGTTAAACG CTACAACAGG ATTGGTG CTG AAG t ATGACA TCAGAAGCGG 3660 

TTGACGGACG TAGCAATGCG TATGAATTGA ACGATTtCAA GGAGTATGAA GCTATTATTG 3720 

ATAATTACTT TAATGCTAGA ACGAGAACTA AAAAAGGAAG GGCTGTGTTC TTTTGAGATA 3780 

TGAAGATAGA GTTATTTTTC AATTAGAACA AGTAGCAACT TACAATCCTA AAACTAGCAA 3 84 0 

AAAAGAAAAC ACACTAATCA CTTATGATGC GATACCATGC AATATTAACC CCATTTCTAG 3900 

35 AGCAAGAAAG CAACTTGAAT TTGGTGATGT AAAAAACGAT GTAAGTGTTC TGAGGATAAA 3 960 

AGAATCAATA TCTTACCCTG TTAGCCACGT GTTGGTTAAT GGCATTCGCT ACAAGATAGT 4020 

TGATACAAGG ATATACAGAC ACGAAACGTC ATATTATATC GAAGAGGTCA ATTGATGAAT 4080 

ATAGATGGAT TAGACG CACT GTTAAACCAA TTTCACGATA TGAAAACCAA CATTGATGAT 414 0 

GATGTAGATG ATATTTTACA GGAAAACGCC AAAGAATATG TAGTACGAGC TAAATTGAAA 4200 

GCTAGAGAAG TAATGAATA 4219 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GCTTACAAGT ATATTCATAA TTACATATTC AAGGTCCTTG CATGTGGTAT TTTGCTATGG 60 

yCtTTaACTA CAACGGGGTC TAAGACTGCG TTTATCATAT TAATCGTCTT AGCCATTtAT 120 

TyCTTTATkA AAAAGTTATT TAGTAGAAAT GCGGTAAGTG TTGTGAGTAT GTCAGTGATT 180 

ATGCTGATAT TACTTTGTTT TACCTTTTAT AATATCAACT ACTATTTATT CCAATTAAGC 24 0 

GACCTTGATG CCTTACCGTC ATTAGATCGA ATGGCGTCTA TTTTTGAAGA GGG CTTTGCA 300 

TCATTAAATG ATAGTGGGTC TGAGCGAAGT GTTGTATGGA TAAATG CCAT TTCAGTAATT 3 60 

AAATATACAC TAGGTTTTGG TGTCGGATTA GTGGATTATG TACATATTGG CTCGCAAATT 420 

1S AATGGTATTT TACTTGTTGC CCATAATACA TATTTG GAGA TCTTTGCGGA ATGGGGCATT 4 80 

TTATTCGGTG CATTATTTAT CATATTTATG CTTTATTTAC TGTTTGAATT ATTTAGATTT 540 

AACATTTCTG GGAAAAATGT AACAGCAATT GTTGTAATGT TGACGATGCT GATTTACTTT 600 

TTAACAGTAT CATTTAATAA CT CAAG AT AT GTCGCTTTTA TTTTAGGAAT TATCGTCTTT 660 

ATTGTTCAAT ATGAAAAGAT GGAAAGGGAT CGTAATGAAG AGTGATTCAC TAAAAGAAAA 720 

TATTATTTAT CAAGGGCTAT ACCAATTGAT TAGAACGATG ACACCACTGA TTACAATACC 780 

CATTATTTCA CGTGCATTTG GTCCCAGTGG TGTGGGTATT GTTTCATTTT CTTTCAATAT 84 0 

CGTGCAATAC TTTTTGATGA TTGCAAGTGT TGGCGTTCAG TTATATTTTA ATAGAGTTAT 900^ 

CGCGAAGTCC GTTAACGACA AACGGCAATT GTCACAGCAG TTTTGGGATA TCTTTGTCAG 960 

TAAATTATTT TTAGCGTTAA CAGTTTTTGC GATGTATATG GTCGTAATTA CTATATTTAT 1020 

TGATGATTAC TATCTTATTT TCCTACTACA AGGAATCTAT ATTATAGGTG CAGCACTCGA 1080 

35 TATTTCATGG TTTTATGCTG GAACTGAAAA GTTTAAAATT CCTAGCCTCA GTAATATTGT 114 0 

TQ^TCTGGT_ ATTGTATTAA_ GTGTAGTTGT T A TTTTTGTC- AAAGATCAAT- CAGATTTATC 1200 - 

ATTGTATGTA TTTACTATTG CTATTGTGAC GGTATTAAAC CAATTACCTT TGTTTATCTA 1260 

TTTAAAACGA T AC ATT AG CT TTGTTTCGGT TAATTGGATA CACGTCTGGC AATTGTTTCG 1320 

TTCGTCATTt AG CAT ACTT A TTACCAAATG GACAGCTCAA CTTATATACT AG TATTTCTT 1380 

GCGTTGTTCT TGGTTTAGTA GGTACATACC AACAAGTTGG TAT C TTTTCT AACGCATTTA 144 0 

ATATTTTAAC GGTCGCAATC ATAATGATTA ATACATTTGA TCTTGTAATG ATTCCGCGTA 1500 

TTACCAAAAT GTCTATCCAG CAATCACATA GTTTAACTAA AACGTTAGCT AATAATATGA 1560 

ATATTCAATT GATATTAaCA ATACCTATGG TCTTTgGTTT AATTGCaATT ATGCCATCAT 1620 

TTTATTTATG GTTCtTTGGT GAGGAATTCG CATCAACTGT CCCATTGATG ACCATTTTAG 1680 

CGATACTTGT ATTAATCATT CCTTTAAATA tGTTGaTAAg CaGGCAATAT TTAt TAA t AG 174 0 
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TATGTAyTAT TTTGATATAT TTTTATGGAA TTTACGGTGC TGCTATTGCG CGTTTAATTA i860 

CAGAGTTTTT CTTGCTCATT TGGCGATTTA TTGATATTAC TAAAATCAAT GTGAAGTTGA 1920 

ATATTGTAAG TACGATTCAA TGTGTCATTG CTGCTGTTAT GATGTTTATT GTGCTTGGTG 19B0 

TGGTCAATCA TTATTTGCC 1999 



(2) INFORMATION FOR SEQ ID NO: 214: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

20 TCATTATTAA GACTATTATA TATAATGAAT TTTAACTGGT TTATTAAACG AGAACGTCGG 60 

GAATTAAGTA ACTACAATAA AAATAAGATA TGACAATAAG GAGACTACAC GCGTGATCAT 120 

TGC CAT AATT ATATTGATAT TTATTTCGTT TTTCTTTTCA GGAAGCGAGA CGGCATTAAC 180 

25 

GGCTGCCAAT AAAACAAAAT TTAAAACTGA AGCTGACAAA GGTGATAAAA AAGCAAAAGG 24 0 

CATTGTAAAG TTACTTGAAA AACCAAGTGA GTTTATTACA ACGATTCTAA TTGGGAATAA 300 

TGTCGCGAAT ATTTTATTAC CAACACTTGT TACAATTATG GCTTTACGTT GGGGGATTAG 360 

30 

CGTTGGTATT GCATCAGCTG TTTTAACAGT TGTTATCATT TTGATCTCCG AAGTGATTCC 420 

CAAGTCTGTC GCTGCAACAT TTCCAGATAA AATAACAAGG CTTGTATATC CAATTATTAA 480 

3S TATTTGTGTC ATTGTGTTCC GTCCTATCAC ATTACTTTTA AATAAGTTGA CGGACAGTAT 54 0 

TAATCGAAGT TTATCTAAGG GCCAACCTCA AGAACATCAA TTTTCAAAAG AAGAATTTAA 600 

AACAATGTTA GCAATTGCTG GACATGAAGG TGCTTTAAAT GAAATTGAGA CGAGTAGGTT 660 

40 GGAAGGTGTC ATTAATTTTG AAAATTTAAA AGTAAAAGAT GTAGATACAA CACCTAGAAT 720 

TAATGTGACG GCATTTGCTT CAAATGCGaC ATACGAAGAA GTTTATGAAA CGGTTATGAA 780 

TAAGCCATAC ACTAGATATC CAGTGTACGA GGGAGATATT GATAACATTA TTGGGGTGTT 84 0 

45 

TCATTCTAAA TATCTGTTGG CTTGGAGTAA TAAAAAAGAA AATCAAATTA CAAACTATTC 900 

AG CTAAG CCA TTATTTGTGA ATGAACACAA TAAAGCTGAA TGGGTATTAC GTAAGATGAC 960 

so TATTTCTAGA AAACATTTAG CAATTGTGTT GGACGAATTT GGTGGTACTG AAGCGATAGT 1020 

GTCACATGAA GACTTAATTG AAGAATTATT AGGTATGGAA ATTGAAGATG AGATGGATAA 1080 

AAAGGAAAAA GAAAAACTTT CTCAACAGCA AATTCAATTT CAACAACGGA AAAATCGCAA 1140 
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GTATTGAATA TCCAATTATA CAAGCAGGTA TGGCAGGAAG TACGACACCG AAATTAGTTG 1260 

CATCAGTAAG TAACAGTGGT GGGTTAGGCA CAATAGGCGC AGGTTACTTT AATACG CAGC 1320 

AATTGGAAGA TGAAATAGAT TATGTACGCC AATTAACGTC AAATTCTTTT GGCGTAAATG 1380 

TCTTTGTACC AAGTCAACAA TCATATACCA GTAGTCAAAT TGAAAATATG AATGCATGGT 1440 

TAAAAC CTTA TCGACG CGCA TTACATTTAG AAGAGCCGGT TGTAAAAATT ACCGAAGAAC 1500 

AACAATTTAA GTGTCATATT GATACGATAA TTAAAAAGCA AGTGCCTGTA TGTTGTTTTA 1560 

CTTTTGGAAT TCCAAGCGAA CAGATTATAA GCAGGTTGAA AGCAGCGAAT GTCAAACTTA 1620 

TAGGTACAGC AACAAGTGTT GATGAAGCTA TTGCGAATGA AAAAGCGGGT ATGGATGCTA 1680 

TCGTTGCTCA AGGTAGTGAA GCAGGTGGAC ATOGTGGTTC ATTTTTAAAA CCTAAAAATC 174 0 

AATTACCTAT GGTTGGAACA ATATCTTTAG TGCCACAAAT TGTAGATGTC GTTTCAATTC 1800 

20 CGGTCATTGC CGCTGGTGGA ATTATGGATG GTAGAGGAGT TTTGGCAAGT ATTGTCTTAG I860 

GTGCAGAAGG GGTACAAATG GGCACCGCAT TTTTAACATC ACAAGACAGT AATGCATCAG 1920 

AACTACTGCG AG ATG CAATT ATAAATAGTA AAGAAACAGA TACAGTCATT ACAAAAGCGT 1980 

TTAGTGGAAA GCTTGCACGC GGTATCAACA ATAGGTTTAT CGAAGAAATG TCCCAATACG 2 04 Of 

AAGGCGATAT CCCAGATTAT CCAATACAAA ATGAG CTAAC AAGTAGCATA AGAAAAGCCG 2100 

CAGCAAACAT CGGCGACAAA GAGTTAATAC ATATGTGGAG TGGACAAAGC CCGCGACTAG 2160- 

CAACAACGCA TCCCGCCAAC AC CATC ATG T CCAATATAAT CAATCAAATT AATCAAATCA 2220 

TGCAATATAA ATAATCGACC GCAATCCACA AAAGCACAAG CACCCCCAAA CATTATTTTA 22 80 

35 GTGCTTG CCA TTTTTGTGGA TTGCGTTTCT ATTTTACCAA TTTAATCAAA CGAAAACATC 2340 

AAGCTGAAGA TCG CCGA AA G ATTTTAATCA AGCAAAAACA TCAAAC^AAA^GTTCGCTC 1 _2 4.0.01 

ATGATTATGA TAAAAGTTAT ATGGTATGAT GACATTGGTG ATATATATGA TAAACATCGG 2460 

ATTAACAGGT TGGGGTGATC ACTATTCATT ATATGAAGAT TTAGAACGCC AAACCGATAA 2520 

ACTTAAAACA TATGCTGGAC ATTTTCCGGT TGTCGAATTA GATGCGACAT ACT ATG CGAT 2580 

ACAACCGGAA AGAAATATAT TGAAATGGAT AAAAGAAACG CCTGATACAT TTGAATTTGT 2640 

GGTCAAAATT CATCAAGCAC TcACATTGCA TGCAGACTAC AAAACATTTG CAGATACAAG 2700 

GCAAGAACTA TTTGATCAAT TTAAGAATAT GTTAGAGCCC TTACATACAC AGAAAAAATT 2760 

SO AG CAATGGT A TTGGTTCAAT TTCCGCCATG GTTTGACTGC AATGCACAAA ATATCAAATA 2820 

TATTTTGTAT GTAAGACAGC AATTACAAGC ATTTCCAATG TGTGTAGAAT TTAGGCATCA 2880 

ATCATGGTTT AGTGATGCAT TTAAAGAACA AACATTGGCA TTTTTAACAG AACATCAAAT 2940 
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AATCACAAAT 


GAAATTGCGT 


TTGTACGTTA 


TCATGGACGT 


AATCATTACG 


GTTGGACTAA 






GAAAGATATG 


TCAGATCAAG 


AATGGCGCGA 


TGTACGCTAT 


TTATATGATT 


ATAATGAGCA 




5 


AGAATTAATA 


GACTTGGCAC 


AAAAGGCACA 


AAT ATT AG CA 


CAAAAAGCTA 


AGAAAGTTTA 


JloU 




CGTCATATTT 


AACAATAATT 


CTGGTGGTCA 


TGCAGCAAAT 


AATGCCAAAA 


CATATCAGCG 




10 


ATTATTGAAT 


ATAGAATATG 


AAGGGTTAGC 


ACCACAACAA 


TTAAAATTAT 


TTTAAGAGG C 


J J U \J 


GACGACTATG 


TTATTAACAA 


TTACATTATT 


AGTTTTAATC 


GGAGGTTTGT 


CAGCGATTAT 


T ri 

J J O w 




AGGGTCTATC 


GTAGGCATTG 


GAGGCGGTAT 


TATTATCGTT 


CCAACAATGG 


TTTACCTCGG 


J *± ^ u 




TGTTGAACAT 


GGATTACTAC 


ATAATATTAC 


AACACAAGTA 


GCGATAGGGA 


CGTCTTCAGT 


*?4 ft ft 




CATTCTAATT 


GTGACAGGAC 


TTTCTTCATC 


ACTTGGATAT 


TTAAAAACAA 


AACAAGTTGA 






TATTAAAAAT 


GGTTCCATCT 


TTTTATTTGG 


ACTATTACCA 


GGTTCATTGC 


TTGGGTCCTT 


3600 


20 


CATTAGTAGA 


TATTTAACAT 


TTGAGTCATT 


TAATTTATAT 


TTTGGTATCT 


TTTTAATTTT 


J D O \J 




CGTAG CCATT 


TTATTAATGG 


TAAGAAATAA 


GATTAAACCG 


TTTAAAATTT 


TP. GATAAAP P 




26 


CAAGTATGAA 


AAGACTTATG 


TAGACGCTAA 


AGGTAAAACA 


TATCATTATA 


aTGTTPPAP P 


j / ou 


ATTGTTTGCT 


TTTATTACAA 


CGTTTTTAAT 


TGGTATATTG 


ACAGGTTTAT 


TTGGTATTfl^ 

X X Nj^J X *A A A WW 






AGGTGGCGCA 


CTAATGACGC 


CACTAATGCT 


TATTGTATTT 


AGATTTCCAC 


CTCATGTAGP 


J3UU 


30 


TGTTGGAACA 


AGTATGATGA 


TGATTTTCTT 


TTCAAGTGTC 


ATGAGTTCTA 


TAGGG PAPAT 


~\ q £ri 

JJOU 




TGCTCAAGGT 


CACGTAGCTT 


GGGGTTATGC 


AATCATnTTA 


ATTATTTCTA 


GTTATTTTG^ 


4 n ^ n 




TGCGAAAATC 


GGTGTCAAAG 


TGAATCAATC 


AATTAAGTCA 


GATACGGTAG 


TAACATTATT 


4080 


33 


GAGAACAGTA 


ATGTTGTTAA 


TGGGTATATA 


TTTAATTATT 


CGTGCGTTGA 


TTTAATACAA 






CTTTAAAAGG 


AGGACGTCAA 


TTTGAGGCTT 


ACAATTTATC 


ATACGAACGA 


TATTCATAGT 


4200 




CATTTACATG 


AATACGAACG 


CATTAAAGCA 


TATATGGCAG 


AACATCGGCC 


ACGACTTAAT 


4260 


40 


CATCCTTCTT 


TATATGTTGA 


TCTAGGTGAT 


CATGTAGATT 


TATCCGCACC 


TATAACTGAA 


"* J ^ V 




GCAACTTTAG 


GTAAAAAGAA 


TGTGGCATTA 


CTAAATGAAG 


CAAAATGTGA 


TGTTGCAACA 


43 80 


45 


ATCGGTAATA 


ATGAAGGGAT 


GACCATTTCA 


TACGAAGCTT 


TAAATCACCT 


TTACGACGAA 


444 0 


GCAAAATTTA 


TAGTGACATG 


TAGCAATGTT 


ATAGATGAAT 


CAGGTCATTT 


ACCAAATAAT 


4500 




*» X X x 1 \. i. X 




titj A CAT AG A C 


GGTGTGAAAA 


TACTATTCGT 


TGCAGCGACA 


4560 


SO 


GCACCTTTTA 


CCCCATTTTA 


TCGTGCACTA 


AATTGGATTG 


TTACCGATCC 


ACTTGAATCT 


4620 




ATAAAAGAAG 


AAATTGAACT 


TCAACGAGGT 


AAATTTGATG 


TATTAATCGT 


GCTAAGTCAT 


4680 




TGTGGCATTT 


TCTTCGATGA 


AACATTATGC 


CAAGAATTGC 


CTGAAATTGA 


TGTCATTTTT 


4740 
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GCAgCTGGAA AGTATGGTAA TTATCTTGGA 


, GAGGTTAATT 


' TAACTTTTCA 

x nn\^. x x x x \J-/A 


OfJPAP ATA A H 


a a t? r\ 
4 86 0 




GT AG TACATA AAACTGCAAA GATTATTCCT 


TTAGAAACAT 


TACCTGAAGT 


TGAAAfTTrA 


A CX O f\ 

h y 2. o 


5 


TTTGAAGAAG 


AAGGAAAAAC 


GTTAATGTCC 


AATTCAGTAA 


TTCAACATCC 


AGTAGTGPTT 

«r\VJ X fWS X X X 






AAGCGTAGTA 


TGAATCACAT 


AACTGAAGCT 


GCATACTTAT 


TAGCTCAAAG 


X VJ XXX VJ X V3AVv3 


c n f\ 


10 


TATACACATG 


CACAATGTGC 


CATCATCAAT 


GCTGGCTTAC 


TCGTTAAAGA 


TATTGTAAAA 


jiUU 


GATGAAGTGA 


CAGAATATGA 


CATTCATCAA 


ATGTTACCGC 


ATCCGATTAA 


TATGGTAAGG 


CI 

OlbU 




GTTAGACTTT 


TTGGTGTGAA 


ATTAAAAGAG 


ATTATAGCTA 


AAAGTAATAA 


ACAAGAATAT 


coon 


15 


ATGTATGAAC 


ATGCACAAGG 


TTTGGGTTTC 


AGAGGGAATA 


TATTTGGAGG 


AT AT A TTPTT 


coon 
5«£oO 




TATAATTTAG 


GGTACATTCA 


TTCTACAGGG 


OGTTACTATC 


TGAATGGAGA 


AG AAA TfY^ A A 






GACGACAAAG 


AATATGTACT 


AGGTACGATA 


GATATGTATA 


CGTTCGGTCG 


x xnx x lV<Vw\ 




20 


ACATTGAAAG 


AATTACCAAA 


AGAGTATTTA 


ATGCCAGAGT 


TTTTAAGAGA 


TATATTTi A A 

XrtXrtX X X4*4A#r\ 






GAAAAATTAT 


TGGAATATTA 


AAAAGTAAGA 


TTATTGGATT 


TTCATTTGTf* 

A X ^— rf* XXX VJ X \™. 


ATflA iTTTPf 
r\ X Vxrvri X x x ^— .Vj 




25 


ATATAATGTT 


TAAAGATACA 


CTTAACAGGA 


GGGTATGTGT 


TGTTATGG CG 

X X X *Ti X VJ 


apa A A a a arr 1 


5580 


AGGAAATATT 


ACGTAAACCG 


GATTGGTTGA 


AAATAAAATT 


AAATACCAAC 


GAAAAPTATA 


CiTx A 

3o4 U 




CAGGACTTAA 


GAAGATGATG 


AGGGAAAAAA 


ATCTTAATAC 


TGTATGTGAA 

X w X f\ X VJ X VJXAi*\ 


f3A Af2fTA A 


57 00 


30 


GTCCTAATAT 


ACATGAATGT 


TGGGGTGCAC 


GTCGTACAGC 


GACATTTATG 


AX X X X*r»v7\j x vi 


C "7 *c /\ 

5760 




CCGTATGTAC 


AAGAG CTTGT 


CGTTTTTGTG 


CGGTTAAGAC 


AGGTTTACCT 


AATGAACTTG 


5820 




ATTTAAATGA 


GCCTGAACGT 


GTAGCTGAAT 


CAGTTGAATT 


AATGAATTTG 




5ooU 


35 


TTATCACTGC 


TGTTGCGCGT 


GATGATTTAA 


GAGATGCTGG 


TTCAAATGTT 








CAGTACGTAA AGTTAGAGAA AGAAATCCAT TTACAACGAT 


TGAAATTTTA 

X\MVU^X X X x*r^ 


fVATPAfiATA 


OvUU 




TGGGCGGGGA 


CTATGATGCG 


TTAGAAACAT 


TAATGGCGTC 


AAGACCTGAC 


AX X X l/VUW.^ 


OUoU 


40 


ATAATATTGA 


AACTGTTCGT 


CGCTTAACAC 


CGAGAGTTCG 


TGCGCGTGCG 


A ("TTA PY3 A P A 

«*V»- X X /^>—VJiT^V^/A 






GAACATTAGA 


GTTTTTACGT 


CGTTCAAAAG 


AATTACAACC 


GGATATCCCA 


ACTAAATCAA 

** w x ****** x 


fii fin 


45 


GTATTATGGT 


TGGATTAGGT 


GAAACTATAG 


AAGAAATTTA 


TGAAACGATG 


GATGATTTA C 
wn x un xxx *r*.\— 


O *& *± v 


GTGCGAATGA 


TGTAGATATT 


TTAACGATTG 


GTCAATATTT 


ACAACCTTCA 


PGT A A A P ATT 


V ^ V V 




TAAAGGTTCA 


AAAATATTAC 


ACGCCTTTAG 


AGTTTGGTAA 


ATTAAGAAAA 


GTGGCAATGG 


6360 


SO 


ATAAAGGGTT 


TAAACATTGC 


CAAGCTGGAC 


CTTTAGTACG 


TAGTTCTTAT 


CATGCGGATG 


6420 




AGCAAGTAAA 


TGAAGCTGCT 


AAAGAAAAGC 


AACGCCAAGG 


TGAGGCACAG 


TTAAATAGTT 


6480 




AATATTTAAC 


CATTAATAAG 


GCATAAAGGC 


TTAGTTTGTA 


CAAAACGAAC 


GTGTCATAGA 


6540 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



AGGTGAAGAA TTTGATAAAA GTAGATCAAC ATTACTTTGA 
AATGTTTTAA TGAAGAACAA TTTATTGCTA GGTATTCAGA 
ACATAGTTGG TGACTATGGT TACGATCAAT TACGATTAAA 
ATAAAAAAGC AGAGATGAGT AAACGTTTTT CAAATATTCA 
GTAACTTTGG TTGTCCTTAC TTTGTATTAA GACATTTGTC 
TAATCGAAGA AGTTCATCCG TCTGATGTGA TAGATGACGA 
AGATTAAGCC AACCATTCAA GATACTGAAC ATTAATAAAA 
GGGAATCATG CAATTCAAGC ATGGACCTGT AATCTAGTTA 
ATGACTTCAT TTAAATACTC AGTAATTTCA TCGCCTTCTT 
TGAGCGATAT AGCCTTCTTC TTTTAAATCA TCAGTACCGA 
TG CAT ATT AA GTACGAGTGT CTT AC CAT AA TGTCTATTTG 
TATCGACTAT GCTCGCCAAC AAAACCAACA AACTGAACTT 
TATAAATACA TATCAATCAT TTTGTAGCGA CTCCTTTTAA 
CGACAAATGA AGTATACTGC AAAATTATGA TAATATATAA 
TATTTTGTAG ACAAAGATAA ACTAACTCAG AAATTAGCCT 
GATTATCATG AGAGCAAGCA CAATCATTAT GCATTTGAAC 
GAATCATCGG TAGATATAGG GAATATGATT ATCGATGCAT 
AATTATAAAG ATGTGATTGA TATATTAGAA CTAGAAAATG 
CAGG CG ATT A ATAAAACTGT CGGTATTCGT AAACAATTTA 

gatgttgaga ttatcatgcc aatgtttga 
(2) Information for seq id no: 215: 
(i) sequence characteristics: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 
ACCGCCACCC ATTAATGATT GCTTAAAATC AATAGTCGTA CCATTTAATA CGGGTGCATC 
TTTTTTGTCT ACTAATACTT TTAATCCAAA GTATTCTAAG ACTTCATCAT TTTCACCAGG 
CGCTTCTTCT GCACCCATAC CGTATGTTAA ACCAGTGCAC CCGCCACCAT TCACTTTAAT 



ATTAATAGAA 
TATTTTAGAT 
AGGTTTTTAC 
AGATTACATA 
TAAACAAGAG 
CAATAAACTT 
CCCTTAGCTA 
GGGGTTTTTA 
CAGCATTTAC 
TAATACCGAA 
TATGGACTAA 
GACTCTCTTC 
AAGTAGTAAA 
GTGAGAGGTG 
ATTTACAAGC 
GCATTGCTCA 
TTATTTTAAG 
TTATTACTAA 
CATATGATTA 



AATTATCGCG 
AAATATGATT 
AAAGATTCTA 
TTTGAATATT 
GTTAAAAAGT 
CAAGATGTGA 
GATTGAAAAT 
TCTTTAATGA 
ACCTAAAATA 
TTTATTTGTT 
CATCAAATCA 
GTTGTCATCA 
GTTAGTATAA 
ACAAGGAATG 
ATTAACTGAT 
AATGTTGATA 
GGATCCTGGT 
AGAAACACAG 
CACAGCCTTA 



6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7769 



60 
120 
180 
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TGCTTCTGTT AATATAACTG TTGGCATGAT AACTCCTCCT TAAAAAATCC AAGTTTCTTT 
TATATGTGCA TATATATTTT GTAATAATTC TTCCGGCGAA TCACCTTCAA CAATATCACC 
ATTTACTAAA GCATACAACC CGGCTGAACA TATACCACAA TGTGTCAGGC AACCATACTC 
TAACACATCG ACATCTGGGT CATTTTCCAG TTGATTAAAA ACATAATCTC CACCTTTTGC 
CATGTTAGAG AGACAAAATT CTACGATCGG ATTCATACTT CACCTTCTTA TTTCATTTGT 
TACAATATTA TAG CATTTT A AAACTGGTAT TTTAACATGA TGTGCTCAAT TAGCAACAAC 
TGATGTTTCT TATCCCAGTT ATGTAATAGT GCCTTAGTTA GTAC 
(2> INFORMATION FOR SEQ ID NO: 216; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 
GAATGATGAA AGGAATAGAA AAGAAAAGAT AAATAATGTA ATAGATTTAT CCGAGAAAAT 
TGAAAGAACA AAAGATATGC CAATCAAGAA TACTATAACT ACTCAATTAG GAAATAAACT 
TATTGGCACA AAAAAAGCTC GTTTTGATGA TAAGAAAGTA GTGTCGTTTG GAGCATTTGA 
AGATGAATAA AATAAATGAT AGAGATTTAA CAGAATTGAG TAGCTATAGG GTTTATCAAG 
ACATCAATAA AGATAATGAC TTTACAGTTA ACGAAAAACG ATTTAAGCAG GCAGATGTAT 
TTGAAGATTT ATATAGAGAG AAACTAAAAG ACACAAATAA ATTAAGAGAG TATAATTATT 
TACAAAATGA ^^^TTTTAA^ A^2Q(^TJ^AA^ TAGGTGATGA, GATATGCTTA AAAAAGCAAA— 
ATTTSTCTTA ATGGCAACGA TACTACTATC AGGATGTTCA ACTACCAATA ACGAATCCAA 
CAAAGAAACA AAATCTGTAC CAGAAGAAAT GGATGCTTCA AAATATGTAG GACAAGGATT 
CCAACCACCT GCAGAAAAAG ATGCGATTGA ATTTGCAAAG AAGCATAAAG ATAAAATTGC 
TAAGCGAGGC GAACAATTTT TTATGGATAA CTTCGGTCTA AAAGTTAAAG CTACAAATGT 
TATAGGTAGT GGCGATGGTG TAGAAGTATT CGTGCATTGT GATGACCACG AyATCGTATT 
TAATGCGAGT ATTCCATTTG ATAAATCAAT wATTGAsAGT GATAGCTCAT TAAGAAGTrA 
GGAyAAAGGy GATGATATGA GTACTTTAGT TGGTGCAGTA CTCAGTGGGT TTGAATATCG 
AGCACAAAAA GAAAAATATG ATAAATTATA TAAATTTTTC AAAGATAATG AAGAGAAATA 
TCAATATACA GGATTTACAA AAGAAGCAAT TAATAAGACG CAAAATAGTG GTTATGAAAA 
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ACCATTGTTA AACAAAAGTG ACAGTGAATT TTCAAAAGAA TTGTCAAATG TTAAGAAGCA 


1080 


5 


ATTAAAAGAT AAGTCTAAAG TTTCGGTAAC TACTACTCTA TTTAGTAAAA AAAAGAACTA 


1140 




TACTAAAAAA AGTAACAGTG AAAATGTAAT AAAAATGGCA GAAGAAATAA AAAAAGATAA 


1200 




AGAGATACCA AACGGTATAG AGCTTAGTAT AAAATTTTCG GACAATAAAA TAAATACGGT 


1260 


10 


TAAACCAAAT TTTAACGGTG aAAGCACTTC AGAATATGGT GTGTTTGATC AAGAATAAAA 


1320 




TTAATGATGa AAATTTAACG GAGAATAGTG TATATTGAGT AGATCmAGAA TAAAAAGATA 


1380 




ATT CT ACT AT TGTTGTGAAG G GAAATAAGT AGAAGATTTT AAGTGTAATT TCTGGTGATT 


1440 


15 


TAAATAATAA TATAnATGGn AGTACTGATA TAAnACTTTT T AAC CT ACT A GATTCTTATA 


1500 




ATTTGCTTTC CATTTTATGA CGATTTTTAC TCCAATTGAG TGATAGAATC CAAAAAAGCC 


1560 


20 


ATCTCCAAAA ATTAATCC 


1578 


(2) INFORMATION FOR SEQ ID NO: 217: 




25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 




30 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 






TGTTTTCCTT GGGTTAAAAC ATG CTTGCTA TG CGTTTGT A AATATGACTT GCTGTTTTnA 


60 




CCTGnATACC CGTCACACCA TGGAAGTAAA AATGTTTCTT GCTCTTGGCT TACAATTTTA 


120 


35 


GCTTTAATCG CTTCATATGC TTTATATTGG TCTTCTGTTA ATTGCTGTTT TGATTCTTGT 


180 




TCGAAAACAC GATCTTTAAA TGGGTCTCTT TCAACAACCG CGTCATATTT TTCAACATAA 


240 


40 


CCTTFTTTGA TAAGTCCATC TAAACTGGAT TTTGAAAAGC CCATATCCTC AATATCAGTT 


300 


AAAAATATTG TTTTATGTTG TTCTTCAGAC AAGTAAGCAT ACAAATCGTA TTGTTTAATA 


360 




ACTTTCTCCA ACTTAGCTAA TACTTCATCA GGATGATACC CTTCAATGAC ACGAACAGCA 


420 


45 


CGCTTGGTTT TTTTAGTTAT ATTTTGTGTG AG AAT CGTTT TTTCTTCAAC GATATCATCT 


480 




TTTAACAACT TCATAAGCAA TTGAATATCA TTATTTTTTT GCGCATCTTT ATAATAATAG 


540 




~ m . mrl . « » mmm rrwrVT TV TV »T TV n R <Tf2 A GCTCTATGTC ATCTTTCAT C 

TAACCATGCT TATCAAATTT TTGTAA I AAA bL i\*aavjvj i vjv^ i\.iniui«„ niV * 111 


600 


50 


TTAAATGCTT TTTTATACTT CGCTTTAATA GCACTCGGAA GCATCACTTC TAGCATAGAA 


660 




ATACGTTTAA TGACATGAGT TGAACCCATC CACTCACTTA AAGCTATTAA TTCTGATGTT 


720 


55 


AATTCTGGTT GTATATCTTT CACTTCTATG ArnTlTlTA ACTTCGAAAC GTCAAGTTGT 


780 
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ACAATTACAC 


GCACACCAGG 


TTGGATGACA 


GATTCGAGTT 


GTTCGGGAAT 


TATATAATCA 


900 


5 


AATTTATAGT 


CAACGCTCTT 


CGACGCGACA 


TCGACTATGA 


CTTTCGCTAT 


CATTATTGCC 


960 




ACCTAGTTTC 


TAGTTCATCT 


AAAATTTGTG 


CAGCTAATAC 


tacttttttt 


CCTTTCTTGA 


1020 




TATTTACTTT 


TTCATTATTT 


TTAAAATGCA 


TTGTCAATTC 


ATTATCATCA 


GAACTAAATC 


1080 


10 


CGATAGACAT 


ATCCCCAACA 


TTATTTGAAA 


TAATCACATC 


TGCATTTTTC 


TTGCGTAATT 


1140 




TTTGTTGTGC 


ATAATTTTCA 


ATATCTTCAG 


TCTCTGCTGC 


AAAGCCTATT 


AAATACTGTG 


1200 




ATGTTTTATG 


TTCACCTAAA 


TATTTAAGAA 


TGTCTTTAGT 


ACGTTTAAAA 


GATACTGACA 


1260 




AATCACCATC 


CTGCTTTTTC 


AT CTT ATGTT 


CTAATACATC 


AACCGGTGTA 


TAGTCAGATA 


1320 




CGGCTGCTGC 


TTTTACAACA 


ATATCTTGTT 


CGTCAAATCG 


GCTTGTCACT 


TGTTCAAACA 


1380 


20 


TTTCTTCAGC 


ACTTTGAACA 


TGAATAACTT 


CAATATCTTT 


TGGATCCTCT 


AGTGTTGTAG 


1440 


GACCAGCAAC 


TAACGTCACG 


ATAGCTCCTC 


GATTTCGCAA 


TGCTTCAGCT 


ATTGCATAGC 


1500 




CCATTTTTCC 


AGAAGAACGA 


TTGGATACAA 


ATCTGACTGG 


ATCGATAACT 


TCAATAGTTG 


1560 


25 


GTCCTGCTGT 


AACCAATGCG 


CGTTTATCTT 


GAAATGAACT 


ATTAGCTAAA 


CGATTACTAT 


1620 




TTTGAAAATG 


AGCATCAATT 


ACAGAAACGA 


TTTGAAGCGG 


TTCTTCCATA 


CGTCCTTTAG 


1680 




CAACATAACC 


ACATGCTAGA 


AATCCGCTTC 


CTGGTTCGAT 


AAAATGATAC 


CCATCTTCTT 


1740 


30 


TTAAAATATT 


AATATTTTGC 


TGCGTACGTT 


TATTTTCATA 


CATATGCACA 


TTCATAGCAG 


1800, 




GCGCAATAAA 


TTTCGGTGTC 


TCTGTTGCTA 


GCAACGTTGA 


TGTCACCAAA 


TCATCAGCAA 


1860 




TACCTACACT 


CAATTTTGCA 


ATTGTATTTG 


CCGTTGCAGG 


TGCAACAATG 


ATTGCATCTG 


1920 


35 


CCCAATCACC 


TAATGCAATA 


TGCTGTATTT 


CTGAAGGATT 


TTCTTCTATA 


AAAGTATCTG 


1980 




_ TATAAACAGC- 


ATTTCGACTT- 


ATTGCTTGAA 


ATGCTAATGG- 


-TGTGAGAAAT- 


-TTTTGTGCGT 


204 0 


40 


GATT6GTTAA 


CATAACGCGA 


ACTTCATACC 


CAGATTGTGT 


TAACTTACTT 


GTCAAATCAA 


2100 




TTG CTTT AT A 


TGCCGCAATG 


CCACCTGTAA 


CGGCTAATAA 


TATTTTCTTC 


ATATTCAATC 


2160 




TCCCTTAAAT 


ATCACTATGA 


CATTTACGCT 


TTACATCATC 


ATATGCGCAC 


AAATGCTCAT 


2220 


45 


tacitittta 


TAGATACAAA 


TTTAGTATTA 


TTATAACATC 


AATCATTGGA 


TAAACTAAAA 


2280 




AAACACACCT 


ACATAGGTGC 


GTTTGATTTG 


GATATGCCTT 


GACGTATTTG 


ATGTACGTCT 


2340 




AGCTTCACAT 


ATTTTTAATG 


GTCGAAACTA 


TTCTTTACCA 


TAATAATCAC 


TTGAAATAAC 


2400 


50 


AGGGCGAATT 


TTACCGTCAG 


CAATTTCTTC 


TAACGCTCTA 


CCAACTGGTT 


TAAATGAATG 


2460 




ATATTCACTT 


AATAATTCAG 


TTTCAGGTTG 


TTCATCAATT 


TCACGCGCTC 


TTTTCGCTGC 


2520 




AGTTGTTGCA 


ATTAAATACT 


TTGATTTAAT 


TTGTGaCGTT aATTGGTTtA AAgGTGGATT 


2580 



55 
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TTTArGTGcT CAGCTTCTAC AATACATTGA ATTCtATTCy TCGcAAGtTC TACTTCAtCA 2700 

TTAACTACAA cGTAAyCGTA TAAATTCATC ATTTCrACTT CTkTACGCGC yTCGTTAATA 2760 

CGACTTTGTA TTTTCTCATC AGATTCTGTT CCTCTACCTA CTAATCGCTC TCTCAAGTGT 2820 

TCTAAACTTG GAGGTGCTAA GAAAATAAAT AG CGCATCTG GAAATTTCTT TCTAACTTGC 2880 

TTTGCACCTT CTACTTCAAT TTCTAAAAAT ACATCATGAC CTtCGTCCAT TGTAT CTTTA 2940 

ACATATTGAA CTGGTGTACC ATAATAGTTG CCTACATATT CAG CATATTC TATAAATTGG 3000 

TCATCTTTGA TTAAAGCTTC AAACGCATCC CTAGTTTTAA AAAAGTAATC TACGCCATCA 3060 

15 ACTTCACCTT CACGCATTTG ACGTGTTGTC ATTGAAATAG AATACTTATA TGATGTACTT 3120 

GGATCTTCAA ATATn CGTnT TCTAACAGTA CCTTTACCTA CTCCAGATGG TCCTGATAAA 3180 

ACGATTAACA ATCCTTTTTC ATTATCCATG CCTTACGACC TCTCTAAGCT AATCTTCTAT 3240 

TATTTAAATA TGATATCACA TTGTT CTTTA TATTGTATAG CATATTTGAA ATTGCATGCC 3300 

ATAATTTCTA TTAAGTCTAA CAATAT CGTT ATATTGCACG ATTAATTTTA ATTAAATAAA 3360 

TTGAATTGCA AACTTTTAGA TAATGTAAAA TGTATGG CAT AATGTATGGT TCAATAACTA 3420 

TACTGAAAAG TTA CAAT CAT GTTAAAATGA AACGAATGAT ATGAAGAAGG TGGAAGATAA 34 80 

ATTATGGCTT ATGATGGCTT ATTTACAAAG AAAATGGTTG AGTCTCTACA ATTTTTAACA 354 0 

30 ACAGGACGTG TTCACAAAAT CAATCAACCT GATAATGACA CGATACTAAT GGTTGTACGT 3600 

CAAAATAGAC AAAACCATCA ATTGTTATTG TCAATCCATC CAAACTTTTC AAGATTACAA 3660 

TTGACTACTA AAAAATATGA TAATCCATTT AATCCACCCA TGTTTGCGCG TGTTTTTAGA 3720 

35 AAACACTTAG AAGGTGGTAT TATCGAATCG ATTAAGCAAA TTGGTAATGA TCGTCGCATT 3780 

GAAATCGATA TAAAGAGTAA AGATGAAATT GG CG AT ACTA TTTACCGCAC TGTCATCCTT 384 0 

GAGATTATGG GTAAACATAG TAACTTAATT TTAGTAGATG AAAATCGCAA AATAATTGAA 3 900 

GGATTTAAAC ACTTAACACC AAATACGAAT CACTATCGTA CAGTAATGCC AGGATTTAAT 3 960 

TATGAAGCAC CACCTACTCA GCACAAAATA AATCCGTATG ATATTACAGG TGCAGAGGTG 4 020 

TTGAAATATA TCGATTTTAA CGCAGGTAAT ATTGCTAAAC AATTATTGAA TCAGTTTGAA 4 080 

GGATTTAGCC CTTTAATTAC GAATGAAATC GTTAGTCGTC GTCAATTTAT GACTTCATCA 414 0 

ACATTACCAG AAGCATTTGA CGAAGTAATG GCAGAAACCA AGTTACCACC TACTCCTATT 4200 

SO TTTCATAAAA ATCATGAAAC AGGTAAAGAG GATTTCTATT TTATAAAGTT AAATCAATTT 4260 

AATGATGATA CAGTTACATA CGATTCATTA AATGATTTGC TTG AT CGTTT TTATGATGCG 4320 

CGTGGCGAAC GTGAACGCGT TAAACAACGT G CGAATG ATT TAGTT CG ATT TGTTCAACAG 43 80 

55 



40 



45 
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ATAAAGATAC TGAACAGTTA TATGGTGAAT TGATCACTGC TAATATATAT CGAATTAAGC 4500 

AAGGCGATAA AGAAGTGACG GCATTGAATT ATTATACGAA TGAAGAAGTT GTCATTCCTT 4 560 

s 

TAAATCCTAC AAAATCCCCA TCAGCAAATG CTCAATATTA TTATAAACAA TATAAy CGT A 4 620 

TGAAAACGAG AGAmCGTGAA TTACAACATC AAATTCAATT GACGAAAGAC AATATAGATT 4680 

w ATTTTTCAAC AATCGAACAA CAATTACATC ATATTTCTGT CCATGACATT GATGAAATTA 474 0 

GAGATGAATT AGCAGAACAA GGCTTTATGA AACAGCGTAA AAATCAAACT AAGAAAAAGA 4 800 

AAGCGCAGAT TCAATTACAA CATTATGTAT CAACTGATGG CGACGATATA TATGTTGGTA 4860 

15 AGAATAACAA GCAAAATGAT TATTTAACAA ATAAAAAAGC TAAAAAAACT CACACATGGT 4 920 

tACACACAAA AGATATTCCT GGTTCACATG TCGTTATATT TAATGATGCA CCAAGTGATA 4 980 

CGACAATCAA GGAAGCGGCT ATGTTAGCAG GATACTTTTC AAAAGCTGGT AATTCTGGAC 504 0 

20 

AAATACCTGT TGATTATACA TTAATTAAAA ATGTGCATAA ACCATCaGGT GCAAAGCCTG 5100 

GGTTTGTAAC ATATGACAAT CAAAAAACTT TGTATGC 5137 
(2) INFORMATION FOR SEQ ID NO: 218: 

25 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 
GTTTTATCGC AGCAGTAAAG CTATCAATCG GCGGTTCAAT TGATGATGCA TTAGCAGAAA 60 

~TCAriACAATC~ ATTTTAGTTA~AAATTTACTA~ATAATGAaAA^ATGTAAACCT TTTTCAAATG l~2TT 

AAACTTTATa AaAAATATGA TAGTATATAT GTAAATGTTT AATAAAATCT GGAGAAATAG 180 

GAGGACATTG CCATGCAACA CCTTATAAAA AAACATGTAT TGAATGGCGA GTTTGATTTA 240 

GTACGACAAT TGATGTCCGA AACAGATTTT ATGGAATTTG AAG AAG CAT A TATTTCAAGT 3 00 

GCGCATGAAG TAGAAAGTAT GATGTTTTAT ACATGTATTT TAGATATGAT TAAGTACGAA 360 

GAATCATCTG AAATGCATGA CTTAGCATTT TTATTGCTTG TGTATCCACT AAGTGAATAT 420 

GAAGGTGCTT TGGATTCTGC TTATTATCAT GCAGACGCTT CCATAAAACT TACTGACGGC 4 80 

50 AAAGAAGTTA AAAGTTTGTT ACAAATGTTA TTATTGCATG CGATACCAAC ACCTGTTATT 540 

TCAGATAAGA AGGCTTTTGA TATCGCCAAG CAAATTTTAA AATTAGATCC TAATAATAAT 600 

GTTGCTCGTA ACGTCTTAAA AG ACACTG C C AAACGTATGc gACAaCGTTG TTGTTGATAT 660 

55 
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AGTTTTAACA 


TTTGGTTGGG 


TTGGG CATAT 


GTTCCAGCCT 


TTTTTAATAC 


TTAAAAACTA 


780 


5 


ACGAAgTATA 


CTTGTGTGCA 


CAAATGGTTT 


TTATACAACA 


TTTTATAAAT 


TTATACATTT 


840 




TAATAAAGAA 


CATACGATAG 


ATGGTTTAAA 


CCTTGTTAAC 


TGAGAAATTT 


TGATATGTAT 


900 




TCTTCGAAAT 


TTAACTAAAT 


ATACGAAATT 


CAAGAAGCAC 


AATAATTAAT 


CATTTTTCCT 


960 


10 


ATACAAAAGT 


TCGTATGACT 


G CATT AT AAA 


AG CATAAATT 


TATAATTTTT 


TTAAATGTCA 


1020 




TTGAACGTGA 


TAATGTGAAT 


GGATTGAGCA 


ATTTTGAAAA 


AGTGAAAAAT 


AACCTATGCG 


1080 




ACTTGCAATT 


AATTTTCAGT 


ACGTTATAAT 


GCACACTGTG 


CAAAATTAAG 


GAGGTCTATT 


1140 


15 


ATTCACATGA 


TGATGAaTAA 


AGAAGCAACA 


AAAATTGGAT 


TTGCCTACGT 


CGGCATTGTA 


1200 




GTGGGCGCAG 


gATTTTCAAC 


TGGACAAGAA 


GTTATGCAAT 


TTTTCACTAA 


ATATGGCTTG 


1260 


20 


TGGGCTTATT 


TAGGTGTTAT 


TATATCTGGT 


TTTATTTTAG 


CTTTTATTGG 


GCGCCAAGTA 


1320 


GCAAAAATTG 


GTACTGCCTT 


TGAAGCGACA 


AATCATGAAT 


CAACATTACA 


ATACGTATTC 


1380 




GGTGAAAAGT 


TTAGTAAAGT 


CTTTGaTTAT 


ATTTTAATCT 


TCTTCTTATT 


TGGTATAGCT 


1440 


25 


GTAACCATGC 


tAGCTGGTGC 


AGG CG CAACA 


TTTGAAGAAA 


GTTATAACAT 


ACCTACATGG 


1500 




CTAGGTGCTT 


TaATTATGaC 


ATT AG CGATT 


TATATTACGT 


TGCkATTAGA 


CTTTAATAAA 


1560 




ATAGTACGTG 


CACTAGGTAT 


CGTTACACCA 


TTTTTAATTG 


TTTTAGTTGT 


ATT AAT CG CT 


1620 


30 


GGCGTTTATT 


tATTTAAAGG 


TCATG t TTCA 


TTAGCAGAAG 


TTAACCAAGT 


AGTGCCtGAA 


1680 




GCAAGTATTT 


GGAAGGGAAT 


CTGGTTTGGT 


ACAATATATG 


GTGGATTAGC 


TTTTTCTGTA 


1740 




GGTTTTAGTA 


CCATCGTAGC 


AATCnGTGGG 


GATACTGAAA 


AGCGTACAGT 


GTCAGGTGCA 


1800 


35 


GGCGCGATGT 


ATGGTGGTAT 


TATCTATACT 


GTATTACTAG 


CATTGATCAA 


CTTTGcATTG 


1860 




CAAGTGaATA 


TCCAACTATT 


AAAAATGCCT 


CAATTCCTAC 


ATTGACGTTA 


GCAAATAATA 


1920 


40 


TCCATCCTTT 


AAT AG CAACA 


GTGkTATCTG 


TTATTATGCT 


GGCGGkTATG 


TATAATACTA 


1980 


TTCTAGGACT 


AATGTATTCA 


TTTGCAGCAC 


GTTTTACAGA 


ACCATACAGT 


AAAAATTATC 


2040 




ATATCTTTAT 


TATTATAATG 


ATGGTAGCAG 


GTTATTTATT 


AAGTTnCGTA 


GGATTTGCTG 


2100 


45 


AATTAATTAA 


TAAGTTATAT 


ACnATTTATG 


GGATATGTAG 


GCTTATTnTA 


TTGTAGTAGC 


2160 




TGTAATTATn 


AAATATTTCC 


AAACGTAAAA 


ATGGCGGATA 


AAAAACATAT 


TGCTTTAATA 


2220 




TCATATGGAG 


GGGATATCCG 


AAACTTTACA 


ATTTGAATCA 


CTTTGGT 




2267 


50 


{2) INFORMATION FOR SEQ ID NO : 219: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

GTCAATGTAA CCTAATAGTT TATGTCTATC TTGTGTACCA ACTACTACAT CGACACCAGG 60 

AATTTCCATA ATTTCAGCTG ATGAAGTTTG CGCATAACAA CCTGTTACAC AGATTACAGC 120 

ATCAGGATTT TGTCTTATTG CACGTCTAAT TATTTGACGA CTTTTTTTAT CACCCGTATT 180 

CGTTACTGTA CAAGTATTAA TAACAAATAC ATCAGCATTC GCTTCAAAGT CAACGCGCTC 24 0 

ATAGTTTGCT TCTTTAAATA ATTG CCAGAT TGCTTCAGTT TCATAATGGT TTACTTTACA 300 

75 ACCTAATGTG TGaACGCAAC TGTTGACATA AATATTCACC CCATTAATTC TTTTTCATAA 360 

CTTATTGCAC TTAACGCATA CAATGGCGCA GTTTCTGCCC GTAAAATTCT CGGCCCAAGA 420 

CCAACAACTG TACTAGTATT ACTAAATAAT GAAATTTCAT TTTCTGACAA ACCACCCTCA 4 80 

20 GGaCCAAAAA TCATCAACAC TTTATCCTGA GCATTGAATT GTTGTAAAGT TTGCTTGAAA 54 0 

TTGCTTAACT CACCATCTTT TGCTTCCTCT TCATATGCAA TAAGAATATA GTCATAATTA 600 

TCAATAGTAT CACAAATTAA TTTTAAATTC GACTCGAATT GAATAGATGG AATCACTAAA 660 

CGATAGCTTT GTTCAG CAGC TTCTTTAATT ATTTTTTGCC AACGCTCTAT CTTTTTGGCA 720 

ACTTTTGCCT CGTTTAATTT AACAATTGAA CGTTCCATGC TCACAGCTAT AAATGATGAA 780 

GCACCCAATT CAGTAGCTTT TTGTAGCAAC CACTCATATT TGTCAGCTTT GATTAGTCCA 84 0 

CTGCAAATCG TAACATCAAC TGGCAATTCT GTATTAATAT TTTGTTTTTC TTTTAAATCA 900 

ACTTCAATTT TATCACTTGT TATGTCAGCA ATTTCACATA AATAAACTGT TTGATCATTA 960 

35 AAAGTTAAAA TAATTTTACT ACCAACATCA TATCTCATTA CATTTGTTAT ATGATGAATA 1020 

TCTTCTTTTT TTGTAATAAA AAAACGCTGA CTTACATCAG CGTTTTGGnT- CTATGAAATA 1080- 

ACGTTGCACA TTATTCACTC ACTTTCTGGC CAACAAGACA AACCCAACCG TTGTCATGTT 1140 

GTTCTGAAAT AATTTTAAAA CCTACACGCT CCATATGTGA CTGTATACCT TCATACTTCT 1200 

CTTTTATAAT ACCAGAAGTA ATAAAATAAC CGCCTTCATT TAGAGTATTA TAAGCATCTT 1260 

CAATCATTTC ATCAATAATA TGCGCTAAAA TATTTGCTAT T ACAAT AT CA AATTTTTCTG 1320 

TTTCGTCTTT CAATAAGTTA CCTGGAACAG CTTCAATTAA CGTTTCACAA TGATTTCTTC 1380 

TGAAGTTTTC TTTAGCTACA CTCACTGCCA TTTCATCAAT ATCCAACGCT TTAATACGTT 144 0 

so TTACACCGAT TAGATGACTT GCAATACTTA ATATACCTGA GCCAGTACCA ACATCAATTA 1500 

CTGAATGCTG TGGCAATACA TATGTTTCTA TTGCCTTCAA ACACATACTT GTAGTCGGAT 1560 

GATCACCTGT TCCAAAAGCC ATACCTGGGT CGAGCTCAAT GCAAAGCTCT TCATCCGCTT 1620 
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GGAAATAGTT TTTCCATTCA 


TTTTCCCAAT 


' CCGTCTCTGC 


AATAATTTGC 


TCACTGAATT 


1740 




GAACGTTATG 


! TTGATCAAGT 


TCATCTAAAT 


TTAATAACTC 


ATCTTTAATT 


TGCTGTCGCA 


1800 


5 


ACTTATCATC ATAAGTCATT 


TCATTAAAAT 


AGGCTTTCAA 


TCTTACTCCC 


TTATCTGGAT 


1860 




AATCCTCTTT 


TTTCAAAGCG 


TAAATTTCAC 


CGTATTTATC 


TTCTGGTTGG 


TTAATTAAAT 


1920 


10 


CATCTGAATC 


TTCTATCACG 


ACACCATTTG 


ATCCATGATT 


TTCAAGTATA 


TTGGTAGCCA 


1980 




ATTCTACTGC 


TTCATGATTA 


ATAATAATTG 


AAAGCTCTGT 


CCAGTTCATA 


CTTTATTCTC 


2040 




CCTTAAAGAA 


TCTTTTTGCT 


CTATCTTTAA 


AATTCGAAGG 


TTGTTCATTA 


ATTTCTTCAC 


2100 


75 


CATTTAATTG 


GG CAAATTCT 


TTCATTAGTT 


CTTTTTGTCT 


ATCTGTTAAT 


TTAGTAGGCG 


2160 




TTACTACTTT 


AATATCAACA 


TATAAATCTC 


CGTATCCATA 


GCCATGAACA 


1TTTTTATAC 


2220 




CCTTTTCTTT 


TAAGCGGAAT 


TGCTTACCTG 


TTTGTGTACC 


AGCAGGGATT 


GTTAACATAA 


2280 


20 


CTTCATTATT 


TAATGTTGGT 


ATTTTTATTT 


CATCGCCTAA 


AG CTG CTTGT 


GGGAAGCTAA 


2340 




CATTTAATTT 


GTAATAAATA 


TCATCACCAT 


CACGTTTAAA 


TGTTTCAGAT 


GGTTTAACTC 


2400 


25 


TAAATACTAC 


GTATAAATCA 


CCAGCAGGTC 


CTCCATTCAC 


GCCTGGAGAG 


CCTTCACCAG 


2460 


CTAATCTAAT 


TTGTTGTTCA 


TTGTCGACAC 


CTTCAGGTAC 


TTTCACTTCT 


AATTTAACTG 


2520 




TTTTATTTTC 


AGTACCTTTT 


CCGTGACATG 


TTGGACAAGC 


TTCTTCAAAT 


TCTTGACCAC 


2580 


30 


TTCCATTACA 


TTTAGGACAA 


ACTTGTTCAG 


TACGAACTCT 


ACCTAAAATT 


GTGTTTTGTT 


264 0 




CTACAGCTAC 


ATGACCAGCG 


CCATTACAGT 


AACTACAAGT 


CTTTTTACTT 


GTTCCAGGCT 


2700 




TTGCACCATC 


ACCATGACAT 


GTTTCGCATG 


TTACATCTTT 


ACGGATTGAA 


ATTTCTTTTG 


2760 


35 


TTGTACCAAA 


TACCGCTTCT 


TCAAATGTTA 


ATGTCATTGT 


ATACTGAAGA 


TCATCACCTT 


2820 




TTTGCGGTGC ATTTGGATCT 


CTTTGTCTGC 


CGCCACCGAA 


GAAAGAGCTA 


AAGATATCTT 


2880 


40 


CAAAgCCGCC 


GCCACCGAAG 


CCACTAAAAC 


CGCCAAAGTC 


AGAGCCATTG 


AATCCTTGTC 


2940 


CACCAAAACC 


TTGTGGACCA 


TCATGTCCAA 


ATTGATCATA 


GcTTGCGCGT 


TTATTATCAT 


3000 




CACTTAAAAC 


TTCATAGGCT 


TCAGAAATTT 


CTTTAAACTT 


TTCATCTGCA 


CCTTCTTCTT 


3060 


45 


TGTTAATATC 


TGGATGATAT 


1 i-i-l-lCGAAA 


GCTTTCGATA 


CGCTTTTTTG 


ATTTCATCTT 


3120 




TTGAAGCATC 


CTTACTAATG 


CCTAAAACTT 


CATAATAATC 


TCTTTTGGCC 


ACAGCTATCT 


3180 




CTCCTTTTCT 


TAATTAACTC 


ATATAGTTTA 


ACGTAATATG 


TCATACTATC 


CAAATAAAAA 


3240 


50 


GCCAAAGCCA 


ATGTTCTATT 


GACTTTGACT 


TTTCAGATCA 


TGACAACATT 


CTAATTGTAT 


3300 




TGTTTAATTA 


TTTTTTGTCG 


TCGTCTTTTA 


CTTCTTTAAA 


TTCAGCATCT 


TCTACAGTAC 


3360 




TATCATTGTT 


TTGACCAGCA 


TTAGCACCTT 


GTGCTTGTTG 


TTGCTGTTGA 


GCCGCTTGCT 


3420 
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TATCTTCTAT 


ATCTTGACCT 


TCTAAAGCAG 


TTTTAAGAGC 


GTCTTTTTTC 


TCTTCAGCAG 


3540 




ATTTTTTATC 


TTCTTCACCG 


ATATTTT CGC 


CTAAATCAGT 


TAAAGTTTTT 


TCAACTTGGA 


3600 


5 


ATACTAGACT 


GTCAGCTT CG 


TTTCTTAAGT 


CTACTTCTTC 


ACGACGTTTT 


TTATCTGCTT 


3660 




CAGCGTTAAC 


TTCAG CAT CT 


TTTACCATAC 


GGTCGATTTC 


TTCGTCTGAT 


AATGAAGAAC 


3 720 


10 


TTGATTGAAT 


TGTAATTCTT 


TGTTCTTTAT 


TTGTACCTAA 


GTC"iTTTGCA 


GTTACATTTA 


3780 




CAATACCGTT 


TTTATCGATA 


TCAAACGTTA 


CTTCAATTTG 


AGGTTTACCA 


CGTTCAGCTG 


3 84 0 




GTGGAATATC 


AGTCAATTGG 


AATCTACCAA 


GTGTTTTATT 


ATCCGCAGCC 


ATTGGACGTT 


3 900 


1S 


CACCTTGTAA 


TACGTGTACA 


TCTACTGATG 


GTTGATTATC 


TACTG CTGTT 


GAATAGATTT 


3960 




GAGATTTAGA 


TGTAGGAATC 


GTAGTGTTAC 


GTTCAATTAA 


CGTATTCATA 


CGTCCACCTA 


4020 




AAATTTCAAT 


ACCTAAAGAT 


AGTGGTGTTA 


CGTCTAATAA 


TACTACGTCT 


TTAACGTCAC 


4080 


20 


CTGTGATAAC 


GCCACCTTGG 


ATTGCAGCTC 


CCATTGCCAC 


TACTTCGTCC 


GGGTTTACTC 


4140 




CTTTGTTAGG 


CTCTTTACCG 


ATTTCTTTTT 


TGACAGCTTC 


TTGTACTGCT 


GGAATACGAG 


4200 


2S 


TTGATCCACC 


AACTAAGATA 


ACTTCATCGA 


TATCTGAGTT 


TGTTAAGCCA 


GCGTCTTTCA 


4260 


TTGCTTGGCG 


TGTAGGTTCC 


ATTGTTCTTC 


TAATTAATGA 


ATCTGATAAT 


TCTTCAAATT 


4320 




TAGAACGAGT 


TAAGTTTACT 


TCTAAGTGTA 


ATGGACCGTT 


TTCACCAGCT 


GAGATAAATG 


43 80 


30 


GTAATGAGAT 


TTGAGTTTGT 


GATACACCTG 


AT AAGT CTTT 


TTTAGCTTTT 


TCAGCAGCAT 


444 0 




CTTTCAAACG 


TTGTAATGCC 


ATTTTATCTT 


GAGATAAGTC 


TACGCCATTT 


TCTTTTTTGA 


4500 




ATTCTGCAAC 


TAGGTAGTCA 


ATAATTACTT 


GGTCAAAATC 


ATCACCGCCA 


AGTTTGTTGT 


4560 




CACCGGCTGT 


TGATAGTACT 


TCGAATACAC 


CGTCACCTAA 


TT CT AGG AT A 


GATACGTCAA 


A C O (\ 




" 'ATGTACCGCC 


ACCTAAGTCA" 


AAAACAAGAA" 


CTTTTTCATC 


TTTATCAGTT 


TTGTCTAAAC - - 


A C O C\ 

— 4boU — 


40 


CATATGCTAA 


TGCTGCAGCT 


GTTGGTTCAT 


TAATGATACG 


CTCAACTTCT 


AAACCAGCAA 


4740 


TTTTACCAGC 


ATCTTTAGTT 


GCTTGACGTT 


CAGCATCGTT 


AAAGTATGCA 


GGTACTGTAA 






TTACAG CTTT GTCAACTTTC TCACCTAAaA TAgTTTCAGC 


TGTATfTTTT 


AAGTTTTGTA 


a a c r\ 


4S 


AAATCATAGC 


TGAGATTTCT 


TGTGGTGTGT 


ATGATTTACC 


TTCAATATCT 


ACTTTATAAT 






CAGTACCCAT 


ATGACGTTTA 


ATAGATTGAA 


CAGTGTTTGG 


GTTTGTAATA 


GCTTGACGTT 


4 9 8 0 




TTGCTACTTC 


a CCAACTTGA 


GTTTCTCCAT 


TTTTGAAAGC 


TACAACAGAT 


GGTGTTGTAC 


5040 


SO 


GTGAACCTTC 


AGGGTTTTGA 


ATTACTTTTG 


GCTCATCGCC 


TTCTAATAcT 


GTnACACATG 


5100 




AATTTGTTGT 


ACCTAAGTCT 


ATACCAATAA 


TTTTACTCAT 


AATAAAATTC 


CTCCATTTAA 


5160 




TCATTAAATT 


AATTTAATTT 


TAAACAATGT 


CTTTTCGCCA 


AATTTAAGTT 


ATTGGTTTAC 


5220 
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AGTGATTTCG CCAGATTCAA AATCAGGGTT AT CATCTTG A ACTACAGCTT GGTGAATATT 5340 

^ TGGATCAAAT GCTTCACCTT CAGTTTTAAT AACTTCAAGA CCATTATCTT TTAGTGCGTT 54 0 0 

AATCAAACTT TCATGCACCA TTTGTACACC TTTTTGAAGA GATTTAAAAG TCTCATCATC 54 60 

ACCTTCAATT TGAAGTGCAC GTTCTATATT GTCTATTGCT GG T AAAAT AT CTGTTAACAC 5520 

W ACGTTGTGCT TGATATGTTT TGTTTATTTC ATTTTCTTTT TGAATTCTAC GCTTATAATT 55 8 0 

TTCAAACTCA GCGTAGAGCC TTAAATATTT CTCTTCGTTT TCATCTGCTA ATTGTTGAAG 564 0 

TTCATTAATT TTTTGATCTT TTGGATCTAT TTCTTCAATA ACATTCTCGT CAGACGTTTC 5700 

TTCTATTGCT TCATCTTGTA AATGACCTTT ACTTTCTTCA GCTTGTTCAA CTGAATCATC 5760 

AATATTTTGT TTGACGTTTG TTTCTTCAAC TGTTGATTCA GTGTTTTTTT CAACTGATTC 5820 

GTCTTTATTT GTCATTTTCT GTCCTCCAAT ACTTTCTAAT CCATCATTAC CAAATTC TAT 5880 

TTAATAATTG AATGACATTT TGATAATGCA TAGCTGTAGG TCCAATCACA GCGATTTGAC 594 0 

CTTTTAACGT TTCATCAAAA TGATATTGAC TTGTTACAAT TGAAATATCA CTTAAG CTGT 6000 

CATCAATTTC ATTAC CAATT TTTACATTAA TATTTGGTGA AGATATATCT TGTAATAATT 6060 

CTGCAATTCT ATTTGATTCT ATATATTGTA GAATGGGCTG AATTGAAGAT ACATTACTTT 6120 

CATTCAATGC ATCAATAAGT TTAACCTTTC CACCCATATA AATGCTATTA C TTTG ATT AG 6180 

30 AAATATGATT ATTCATCGTA TTTAACAATT TATTGATAAA AATTTCTTCC TGCTCTGATT 624 0 

GAaCAAAAGA GACAATATCA TCTTGTAAAT TCTGATTAAA CTCAGTTAGT TTGTTTGTAA 6300 

CAAAATTTGA TATTGTATTT AGTTTGTCAT TATTAA 6336 
(2) INFORMATION FOR SEQ ID NO: 220: 



20 



25 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 059 base pairs 

(B) TYPE: nucleic acid 
40 * (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



4S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

TTCATGATTA TTATCTGTTG TAGACACTGC TGGATCTTCC GATGTATCTT TCGATGCATC 60 

TTTCGATTTG TGTATTTGCT GATTCAAATG GTCTAGGTCT TCTAACGCCT TATTTACCAT 120 

TGCTTCATCA TTTTTATCAT CTTTTTCTCC ATGTTTTGTT GTAGCCGTTT GTGACATATC 180 

ATTTTTCATT GCATTAAGAT CGTCCTCGCC ACTTTGTTGA CCCCTATCAA CATTTGAAGA 24 0 

AACCTCATTT AAATCTTTAA GCAATTGATC TAATTTACTG TCTATATCAC TTTGACCGTT 300 



50 
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TTCATCTATT TGCGATGCTG TTTTCGCTTC ATTTAGTTGT GCTTTATAAT GTTCTTTAGA 4 20 

TGAAGCCGAT AACTGTTTTA ATTGCTCAAT TTGACGAATT GCCTTGTCAA CTTTGTCTAA 4 80 

5 

TAAATCTTGC TTAGATAATA TCTCTTTTGT AATTTCAGTA TCCTTTTCAG ATGCAGCTTG 54 0 

GGCATCGTAC GGCAAGATAT TCGTTAAAAT GATACTTGTC GCCATCATTG TCGAACACGA 600 

w TAACTTTACA TATAATTGAA ACGGTTTCCC TCGATATTTA GCCATCAACA TACTCCTTCC 660 

TCACTTACTT CCTTCAAAGA ATTACATACT ATTATATACC TGTTTACAAG AAATTTACAC 720 

TTATCTATCT AGTTATTGTT GTTAGTAATT ATCTACTTAT TACTTAGCTT ATATTTAAGT 780 

15 AAACAAAACA AGCATGACGT AATATCATAT TGTCCATGTC GCTAACATCA TATTACGTCA 84 0 

AATCTTTTAT ATTAAATGAT GTTTTATTTT AGACTGCTTT TTCCTTTTAG CTTTCGAGCG 900 

CCTGTTTAAA AACTTGCTCG AATTGTTCAC GCGAGATTTC GTGTGCATGT GCTTTTTGTG 960 

20 

CTAATAAAGC ATCTCGAAAC TGTTGTTGAT CTTTCAAACT TTCTAACATT TGTATTAATT 1020 

GGTCTTTACT TTCCATTGTT ATCTCATCAT TATGCTCAAA TAAGTGCTCT GATAATGTTA 1080 

2$ CTTTAGCATG GTGTGCGGTT TGACGATAAC CTAAAATCAA CAACTCATAG TCAAACGCTT 1140 

GTTCCACCGC ATTTAAAATT TCATTACCCT CATTGATATC AAGATAAATA TCACATAACT 1200 

GGTATAGTTC ATTTACCCTG TCAATATTAA TAGATGGGTA TAAATGCACA TTAGCATATT 1260 

30 GATCAAGTTG CATTAGCTTA TCAGACATCT CTGTAATAGC AGCGATGTGn AACTTAAAAT 1320 

CTGGTAAAGt TyCAACCAAT ACCTTGATGT TACGAatTGa TCCgAGTTAG TTAATATTAC 1380 

AATTTCTTTA GTATATCTAT TACGACTACG ATAGTTATAT AGATATCCGC CTTGTAAAAT 1440 

35 

ACGAGATTGA- ACCTTTGCGT^CTGC^WTT^ GAGCATCGTT TCATATTCGT TTTTATCTGG 1500 

AATAATAATA TTACAATGTC GTTTCATATC ACCTTTACAC ATCAATTGCA TATTTCCCGG 1560 

GACATTACCA TTACAGTGTT CTTGCCATAC CAAAACATCA CTACCTTTTG ATGGCAAATT 1620 

40 

ATATAACACT GAAAATGGTA GGGCTAGTGA GTTAATAACG AAATGATGTT CCGTAATTTC 1680 

AAGTTGCTTG ATAAAAAATA ATGCGAATGC GAGCTTTGAA GGGAAAAAGT AAGACTTCCC 1740 

45 TTGCCAATCC AATATGACAT CAGATGTTAC AAAATTTTCA TAAATCACTT CTTTACCTTC 1800 

TGCTGTCATA TATTTCTTCA AGATCGCTTT ACGATTTAAA TCGTAAACAG TTTGTGCAAA 1860 

TTTAATACCA TTCTTAGAAT AATAATCGAC AAATCGGACA CGTTGTTGGT CATCAAACCA 1920 

50 TTCGACACGA CTAACAATTC TAGGGCGCTC TCCACTTTGA yAAAATATTT TACCTCGyAG 1980 

ACGTCCCATA TCATTaATTG TAGCCGAATT GTTGTTACCT TTAATTTCCC AAAAAGCTGG 2040 

TACAGTAACC TGATTAAAAA ATCGTGGTTT CATATTTTCT GTATTATGAT TATCTGCAAA 2100 
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TAAATCTTCT TCCAACTTAC m ^ AAC7TTCGTG AATGATCGTT 

s AATAA^AA TOtmoa CCTCTTGaAC TAATGTTTCC CATTTTAAAA 

TAATA TCTTC AGTCATAAA* CTTCfiTAAGA GATGTCATGT TO 

CACC^ ACASTC0CAT GAOTsOTr TCcoATaAM ra 

„ gg^caaaa MccAmc cccatct ^ M 

GGTTACCATA ATTCACATTT AATCCAATCA TACCTAG.CC ^AGCCTACC GCTTCCATTA 
™CCC AAAACCTTCG c™„g aamam _ 
TTTCATCAAG ^AACA^C CCTAGTAAAC GAATATAATC ^CGCOG 
TAATTTTACG CAGTCGC^C 

CCACT^ TTTAGCCACG ATAACCCTT ^ACAAGCC, ATCAATATGT TTCTCATTCG 
CTAAACGAGA TGCACTAATC ATCGCATA.G GCTTTCTTGA TAATGTTGGA TATGATAATO 
CATCAATGCT .CCCACAGGA ATAG.ATAAA CACX^CG AXAACC^A TATTGCTCAA 
ATTGTCGACA AACCATATGA X^GAA^ 

TAGCTTTTGA AAA^A, TCATAATAAT TGTTCCATAG TATATGCTGC TCACTCATCA 
TATTATTACT ATAA^c. GCATGAATCA CAACACCAAC TTTACTATCA „ 
GCAAAACAGC CTGACCAATA TCAGAAGCGC GGTCTAATAT GACAATATOG TCTCGGGTTA 
AA^AATcg XTGTAAAAAG TATGCAA.AA mmm GTTATACAAC ACCGCATCTT 
"AACACATA TATAGAGCTG TCTCCATCAA TATATTCGTT ATAAGCGATG GAACCATCTT 
GATTATAAAA TATAA^ ™a T C AGC.GG.GCA TAATACXCAG 

AAAATATGCG COTATAACTA TAAAAATCTT TACGTACTAA CATACTATTA ATTACAAATT 
-CACGATC CACAATArCT TTTTGTTCAT TTTGCAGATA ACATGTTACA AA^A^ 
TCCCATTAAA ATATAGGOGG ACXATCTTAC amm TCTAAAACTA A^CA^AC 
CAAGCTCACG TTCAATGTCA XCXAACG^ ACG^^G TGCTAAAGAA ATATCACTAA 
AATACTGATA CAACCAAATA ACTTCTTGAT C^AAACCC AA^^GC GTTAATGTCT 
GTATGTTCTC TGACTGTATA AAA.CXAAAA ACACAAATTT AGTGTCTTGA 
TCAATAA^T AGCACGGTAA GCTXCXGCAT ATTCAACACC GCTACTCGCC CAGCCTATAC 
CAAAGTTTAT ATTATATATT GTCATGCGCT ACCCCn^C ATTTATGGAA AATGTATAAC 
*™CC .C^ATCAA A TCT AA,CA T GC^ACAA A TOTOTO a CCAX^ 
TTXGATATXT CG^aa CTTCAAATGA ATCTAAGGCA ACXCXATGGT ATTCAAAAAT 
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GACTTGTTCT AACCAACATG AATCAATTGC TTTCAAAAAG ACTTTTTGAA CGAAAATATT 4 020 

ATAATAATAT GCACTTTGCA TGTTTTTACG ATTCAAAGCT AATTGCTTTT CAAATTGCTC 4080 

TAATAAAAAT GTCACTACTG CTTGCTTATC TTTAAAATTA ACACAAGCCA CATCTTTATT 414 0 

AAATTGGAAA CTTAAATTTT GATAAATATA CTCGACAACA CGCGATTTTG TTAGCACCTT 4200 

TTCCTCATTT ACAAACATTT CAAATACATC TTTAGCTAAC GCTTTAAAAT CTTGATTCTC 4260 

AGCATCATCT ATTTCTAAAA CTCGATTGCG TTCCTCGTAT ACAAGATCTC GCTGTATACT 4320 

AATGCTTTTT TCAAATTCAT TAGCCATTTC ACGAGCTTTA ACCCCTTGTT CTTCCGAGAt 43 80 

15 aCGcTGCGCT TTAACTACAA TTTGCTTAAC TTTGCGATTA AACAAATTAC TTTGCGATAA 444 0 

TCGTTGTGCA TCTAATGAAT ATAATTGATT ATTTTCCGCT AAATTACTAT CGCTCCATCG 4500 

CTTAACTAAA TAATCATCTA GTGAAATATA TATACAAGAT GATCCCGGAT CCCCTTGTCT 4 560 

ACCAGAACGA CCACGTAATT GCCTGTCTAC ACGGCTATTT TCCATATGTT CATGAATAAT 462 0 

AACAGCTAAT CCACCTAATG CTTCGACACC TTCAC CAAGT TTAATATCTG TGCCTCGACC 4680 

TGCCATACTA GTCGCAACAG TCATGGAACC AATTTGCCCT GCTTCAGCTA TCATCTGCGC 474 0 

TTCTTTTGCA ACATTTTGCG CAATGAGTAA ATTATTAGGA ATATC CATTT GGAATAATAC 4 800 

TTTCGAAAAG TATTCAGCCG CTTCAGCAGT TCTCGTTATG AGTAAAACCG GTCGCCCCGT 4 86 0 

TTCATGAAGT TCAACTATAT CATGAATCAT CGCGATGTTT TTCTCATCAA CTGAACGAAA 4 92 0 

CACTTTATCT GGTTCATCGA TACGTTGAAT CGCTTTATCA GTTGGTACTT GTACGACTAT 4 980 

TTTTGAATAC AAATCAAAGA ACTCTGATTC GCCTAATTTT CCTGTAGCTG TCATACCTGA 5040 

55 AAATGATTCA AAAAGTTTAA ATAAATTCTG GAAGGTAATT GTTGC CATAA CACTTTTATC 5100 

TGTTCAAACC TCCATACCTT CTCTCGCTC^ ~ SiSB ~ 

AGTTCCCGGT AACATACGAC CTGTAATACG GTCAATTAAA ACAATATCAC CATTATATAC 5220 

AAAGTAATCG ACATTAGATT CAAACAAATA TTGTGCGCGC AGTGCTAAAT TAATATTACG 528 0 

CACTAGGACC ATCGCTTGTT CGCTATATAA ATCTTCAACA TTAAAGTATG ATTGTGCCGC 534 0 

TTCAATACCT TGATTTAACA GC CAT ATTTC TTTTTTGGTC TTCTTCATTT TAAAATGCAC 5400 

GTCTTCAATC AATGTATCTA CAAACTCTTT CACAATATGA AATAGATTTG ATTGTAATCT 5460 
TGGTGCACCC GAAATAACTA ATGGTGTTTG AGCAGCATCT AAAATGATTG AATCCACTTC 5520 
ATCAATAATA CCGTAATTTA ATTGTGGTAA AAATTTCCCT TCCGCACTAT CAGCCAAATT 5580 
ATCAATTAAA TAATCAAAAC CGAGACGTCC ATTAGTTGTA TATATAATAT CATGTTCATA 5640 
TATATTACGT TTTTCCCCTT TTTGATACTC ATAATCCACA ATATCAACAA AACCTAATGA 5700 
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TAATCATTCG TTGTAATTAA ATATGTTCCT TTTCCCGAAA GAGCATTTAA ATATAAAGGC 
ATCGTTGCCG TTAATGTTTT ACCTTCGCCT GTTTGCATCT CCGCAATGTT ACCTTCATGC 
AATACAATCG CTCCGATTAA CTGAACTTCT TTAGGATACA TACCTAATAC TCTCCAGCTC 
GCTTCACGTG CCACTGCATA AGCTTCAGGT AACAATG TAT CTAGTGTATC AACTCCTGAT 



5820 
5880 
5940 
6000 

GCTAAACGTT CTTTAAATTC TATTGTCTTT TGTTTTAACG CATCATCAGA ATATGATTTA 



10 ^iiiiaauu CATCATCAGA ATATGATTTA 6060 

ACTTCATCGC TCCATGTATT GaTGsGTTcA CTATTTTTCT AATCGACTTT AGTCTTAATT 6120 

CGTTTATCGT AACATCTAGT TTATGTTTCA TTTACTTCCC CACCATTCAG TTTCGATACA 6180 

TCTAAGTAAT CTAAAAATCG TACTGGATTC ATTAAACGTG ACATATAATT TAGATGTTTG 6240 

TCTTGCTCTT CTTTAAAATA AACCTCGACA TTTGTATCTT TTAGTTCATG ATTTCCTGGG 6300 

ACATGTTCTG TAAGCCATCC TTTTAAATCA TCATCTTCAT GGCTTGTACG ATACACTTTG 6360 

CAACCCAAAT GCTGAGCGAC ATAAGTTGCA AAAACATTTG ACTTTGACCC ATAACTAATC 6420 

AAATTAATAG CCTTTAGGGT ATCTTGACTT TGCAAATCAT TCTTTAGTTG CTTAATATTT 6480 

CCCTCGATAT TGTCGTCCAT CCAACGTTCA ACGAG CCAAA CATGACCAAA CAGTTTCAAA 654 0 

AAATCATTCG AAATAGTTGG ATAGGTGTCA GATGGTTCTG CAATAATGAC ATTGATCATA 6600 

TCATTTCCAT ATTGGTCATC GCCTATCTTC GTCACCCGCA TGCTTTTATA CTCTAAATCA 6660 

^ TATTGATGCG TCATCTCTGT GATTGTTAAA CATCTAAATA TAAGACTCGT CGATGCTGCA 6720 

TTCATCATTT TTATTTTATA AGCATAGGCk TCATCAGGAT ATTGAATCGT AATACTATTT 6780 

GACTTTACAA TCTCAGTACT TAG TTTTGTG CCATTTTTAT TATAAAAAAT GATGATAAAA 6840 

3S TACACTGAAC CAGCAGGCGT TGCATCAAAA TCAAAATGCA ATTTATAATG CTGTCCTCTA 6900 

CGCAAAATTG GkAAACTTGG CGcACTTTTA TATTTTGAAA ATTGCTTTAA CATCAACCAC 6960 

TCATGAATCG GTAATCCAGA GGGCATCAAA GGATTTATAA AAGTCACTTC ACCATTTGAA 7020 

AATGATACTT TAGAGCCATA CATAAATGTA GTTTGTGAAA TATAATTCCA AGTAACTTTA 7080 

AATGTTTTGT TTTTCAGCAT GTTGAACTCT CCCAAACTTG TCTTCCAAAA TAATGTTGTA 7140 

AAAATTAACA AACCAACTTG CAATGGTAGG TGAATCATCA TTATGTCGCC CAGGAATACT 7200 

GCGATTCATC ACTCTTGCTT GGTGTGCTGT CAATACAGGT AATAG CTCTT GAAATG CATG 7260 

TGGATCATAA TCATCATGTT GCATATATGC TATGGCAAAA ACAGTTTGTG ACAATGATTy 7320 

CTTTTGAAAT GTTTG CCAAA ATTTTTGATT TAATG CCTGT ATCGACGCTT GAGATGTATC 7380 

ACCTTCATTA GACACCAGGA CGTCTAATGC TGTACCGAAC TCTTCTGGTC TAAGTAATCG 7440 
CATATGTTCA GCAATCGTTC CAATATTAAC AAGTGGTTTA CCAACAATAA TTGCCTGAGG 
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TAATTCATGT GATTTAAAAT TCAGCTTTTC TAATGTCTCG TCAATAACAT TGATAATACC 
TTGTTCATAT TCAGATGAAC CGATATAAAA ACTACCACCT TCAACACGAG GATCGCCGAT 
AAGTAAAAAC GGTGCATTCA TACGTTTCAT CATATAATAT CCTTCGAAAC CTTCCGCTGT 
TCGATAACCA CTAAAATATA CGTTTAGTGG CGGTTTCATA TCACCAGGGT GGAAATAATA 
AATAAATTCC TGTCGTTGAC TATCTACGAA ACGACTACCA CCAAGTAAAA ATTGACCCAT 
GTCTAATCTA GACCATCGTT TGTGTATAGG TCCTAAATGT ACCGTCCCGT TCCCACGCGC 
CTTAACAGTT ACACTTATAT AAGCATCAAA TGGTTTCGCA GGTATCTCTA AAGGACTGTC 
TAACAT AT CA TCAGTCAATA CGATTTGTTC AATTAATGCA CCATCAGCGC CAGTCTGAAT 
CAATCTAAAT GTATATTGCA ACTCGACCGC ACCATCAATA TCAAATTCTG GC CAT ATTTG 
AATGACTTTA TCTTTATCGT AAACGAGATT ATTTTGCCAA GATGCGATAG GTTTAAATTC 
TTTCCCAAAT TCTCCACTCA ATGTGAGCTC TGAATTACCT TGGTAAACGA CATCTCCTTT 
AAAATTCGGA TGCACAAGTG CTAACTTAGG AGAAACCTTA TCTCCATACT GTCCTGAGAA 
GCTAACTGCC TCTAATTTAT TATTACGTTC TTCAATATTC CGGTAATGTA ATGGTTGAAC 
AACGTATTTT TGGACATTTT CGTCTTGTTC ATATTCAACT G AC CAAAATG ATTCATCAAC 
ATACGTATTG TATGGTTCGC TTATCATTTG TAATAAATTC GTTAATGTCT CCGAGTATGG 
TGCTTGAATA TAGATAAAAT CAAAGCGCCC TTCTGCTTCA ACAATCGCTT CAATAGCCTC 
TACATAACCA CTATCAAATT CAAACAATCC AATATCGAAG TAATCCCAAC TCACACCTTT 
TTTGTGTTGA AAAATAGGTT CTAAATCGTC TCCTCCAATT TGCAAAACTC TAAATTTACG 
TGGCATCATT TTCACCTTCT ATTAACTCAT CGAGCTGATT AATAATATTC TTAGAAGCAT 
ATGCATCTAT TAATTTTAAA -GAATAGGCGI--ACGCATAATT _CCAATTTTTC_ A^ATAAAATA 
AATAATAATT TAACGCATCA TCTAATTCAT CAACTGTATT TATAATACGG CCATTGTCAT 
AATCAGAGAC GTAATCTGTT TGTTGACCAT TAATTTGTGG AATCCCAGCG CTAATTGCAC 
TAATTTGTAA ATACAAGTCA GGTTCTTTTG ACATATCTAT CACAAGTCGC AACGTCCGCA 
ATGCTTCTAC AACATCATGT TCAGCATGTA TCGTCTTAAC AG CAATGATG TCATCTTGAT 
CTTCAGGTGT CATTAATGCT GAAACATTAA CATCCGCATT CTGTTTAGCT TGGTATTCCT 
CATTTACCGA CGTAATACAT TCACGAAGCC ACATCGGTAT GTCATTTTGA TGGCGCGATA 
ATAAAATTAA ACGGTAATAA TCTTCCTGTG CGATATAATC CACAAGTCGT TGCATCATTT 



35 



40 



GTTGCAAATC AGCGTCACTC ATACCATCTA TCCATACACC TATAAATGTT TCCATCAATT 
GACTACTTAT ATTAGGTGAT TGTCTCGTTT CAAATGGTGT GATTCGAATC ATTGTATTCT 
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TTAAATGGGC 
TTTTAAAAAA 
GCCTAGCATC 
CTTGAAATTT 
GTTGATATCT 
CTCCATTAAT 
TCGCTGATAA 
ATTCTTCAAA 
CACTCGTCAC 
TTAAGTGATG 
ACACAGACGA 
GTGCATGATT 
AACTAATCAT 
CTTGCCACCA 
TACCAATACT 
TCTGCAATAT 
GTAGATAAAT 
AATGCCAATC 
TTACCAGGTT 
TTAGTATTGA 
ACTGAAATAC 
TTAAACATAG 
ATCATTAAAG 
GTTGCGGAAA 
AATAAGATAA 
GCATCTATAT 
GG CATTGGTC 
GTTCCAGTAA 
GAAACATATT 



ATTCTTTACG 
TGAATAACTT 
TGAAGCCACA 
TTCTTCAATT 
TTTTGAAACA 
CGTTAAATAT 
ATAACCTCTG 
CCAAATTGAA 
ACATTTTAAT 
ATAATTAATC 
ATACTTTGTC 
GAGAACAATC 
ATCGTCAAAT 
TCGTTGGTCA 
GGTTTAAAAA 
TAATACTGAT 
GCGGTACAAA 
CAAACCAACA 
TaACTCCTGA 
TTAAAAATCT 
CTACTGGACT 
ATAAAATAAA 
TAATACTGCC 
CGTTCATTAA 
ACAATGTGAT 
ATTCCATTTT 
CTGCGATGCC 
CCAAAATCAA 
CATGAATCAC 



ATAGATTGAT 
AATGATTTCG 
ATCACATGAT 
AGTTGAGCCA 
GTGACTCTGC 
TCTTGGTAAG 
TCATCAAAAA 
TACCCTTCTT 
AAATACGGTG 
GCTTGTGGCG 
TCATATAAGT 
AGTTGATAAT 
TCCGTCTTAT 
TCGTACCAAG 
TGGCTTATAT 
GTAAACTAAT 
CAATGTGAAA 
TACGCGTCGT 
AAAATAATTG 
CGATAAAAAA 
ATCAAATGTC 
ATGAATGCCA 
TGCAGGGTTA 
ATCGATATAT 
TATCACAAGG 
TTGATGCATC 
GTAGCGACTA 
TATTGTTAAG 
AAAATAACTT 



ATTCCTCATC 
CTGGAATATG 
CATCTTCATG 
TATTGTTATA 
CATTTTTCAA 
AAGCCTCTCC 
TATAACGCCG 
GACTAAAATA 
TGTACACAAA 
CATGGTGACT 
CATATCGATG 
CTAAGTCATT 
TTTGTAGTTG 
CTGGAATAAA 
TTATCAAAAT 
ACAATCAGTT 
TAAAGCGGTA 
GCTTGATAAT 
CCACTCTTTA 
TAACCCAATA 
AGCATTGGCA 
CTTTTTAAGA 
ACTTTCCAAG 
GGTATTCTCA 
ATGATTAACA 
ATTGATTTAA 
TTTTTGTCAG 
TAAATATTGT 
TGAATAACAC 



TGACACAGTT 
ATTGG CTATT 
TATTTGTTGT 
TTCTGTTTGT 
ATCTTCATGA 
CTGATCATCA 
TTGTAACTGA 
AATATTTGTA 
CTCAACATCA 
GAATCCTTGA 
TAAAAATGTT 
TTCAAGGTGC 
ATAATACGGC 
GTATTTCATA 
ATAAATATGT 
GTACTGAGaA 
TACCAATAAT 
TTAAATAACG 
AGAAATCTTT 
ACATTTGAAT 
TGTCATCTGA 
AAACAAAAGC 
ATAAATAAGA 
CTTCTACTAA 
ACGCAATCAC 
TAATACTAAC 
CTAACCAAAC 
CTTGATGAAC 
TTAAAATTAA 



TCATTTCTAT 
TGTCGATTGT 
GCAATCATTG 
TGATAGTGAT 
AGTACACAAT 
AAATAACGTA 
TCTCTTTCAA 
TAGGTCTGTT 
TCCGGCCATT 
ATTTCATCAA 
CTTAAATTTG 
ATTCCCATTA 
ACAGTCGTGT 
ATTACCTCCT 
ACGAATTGTT 
ATAAATTTCA 
GACTGTAACT 
TTCTGTATCC 
GGATTTTTGT 
CACTAAATAT 
TATGCTTTTA 
TGAAATACTC 
TTTCATATTT 
TTCAATAAAT 
AATATGACTT 
CATTACAATC 
TAATAACATC 
ACGTTCTTTC 
TGTTAAGATG 



9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
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GAAATCAGCA 
TTTAAAGTAT 
TTTACTTGCA 
AAGATAAAAC 
AGTTTTAACA 
CTAATTATTA 
TAGCCAGCTA 
TTCTTTCTCT 
TTTTGTTTTA 
TTATGTTTAG 
GTTGCTTCAC 
TCTGACGGAT 
GTTGAGTCAG 
GTCGAATCGG 
ATTGATGTTG 
ATTGATGTTG 
ATGCTTGTTG 
TCTGATGTAC 
TCTACTGTAC 
GTTGATAATG 
ATTGATGTTG 
TGTGACATTG 
GTTGAACCTG 
ATTGAGTCAG 
TCACTTGTAG 
TTACTTGTTG 
ATACTTTGCG 
GTGGAACCAC 
TCACTCATTG 



TCAAGATAAT 
TAACGTCTCC 
TATCGTTaTA 
ATGTGTATAG 
TGTTGCACCT 
AAATCCTATA 
TAAATTCAAT 
TCATTAAACC 
TTGAGTCACC 
TCACTTCAGA 
TTGATAAGCG 
GCATTGAGTT 
AAATGCTTTG 
ATGTGCTCAA 
AGTCGGATTT 
AGTCAGATAC 
AAGTTGAACC 
TCAATGATTC 
TTTGTGAACC 
ATGTCGACAC 
ATATCGATGT 
AATCACTTAA 
ATATTGAGTC 
ATGTTGAAAG 
ACATTGAGTC 
AGCTTGACTG 
AGCTACTCAA 
TTAAAGATAT 
ATATTGAATC 



CATTGATGTT 

ACCCATATTA 

GGAAACGATG 

CATACGTTTA 

CTTTTATATC 

TCGATTTTTC 

TTATGCTTGA 

TAAACCAACT 

TGTATCTGGC 

TGTTGCACTT 

AGATGTGCTC 

AGATTCAGAT 

TGAACCAGAC 

TGACGTTGAT 

GTCTTGTGAC 

GCTCGTTGAA 

ACTTGTTGAG 

TGAGTCACTG 

ACTGATACTT 

CGATGTGCTT- 

ACTTAAGGAA 

TGATGTAGAT 

ACTTAAACTT 

TGATGTACTC 

GCTTTCTGAT 

CGAATCGCTC 

TGATTTTGAA 

TGATCCACTT 

ACTTAGcGAG 



AACCACGGAC 
GAAATAGCTA 
GAAATGTTTG 
TATATAATTT 
AAAAACATTA 
TAGTGATTGG 
GAATCATCTT 
AATAATGTCA 
AATCTTTTTT 
AATGTAGACT 
GTGCTGTGAG 
GTACTTGTTG 
ATAGATGTAC 
GTGCTTGTTG 
ATTGAAACAC 
CCTGAACCAG 
TCCGATGTAC 
ATAGAAGTTG 
ATTGAAGTAG 
TGTGATGACG 
CCAGATGCAC 
GTGCTTGTTG 
GTCGATGTTG 
GTTGAATTTG 
GCACTGATGC 
ACACTTGTTG 
TCACTTAATG 
AATGAGTCGG 
GTAGACTyGC 



CTAAtCCTAA 
TTTTAAAAAA 
TGCCTAATAT 
TATATTCGTA 
AAAAGACTAA 
TGCCTCAGTC 
GATCATTTTC 
TAACGCCACC 
CACTTTGTGC 
GAGATTCACT 
TATGATGCAT 
AGCCGGACAT 
TCAGTGATTC 
ACACTGATTC 
TCGATGAATT 
ACGTACTTAA 
TTGTCGATGT 
AATCACTTGT 
AATCACTGAT 
ATGTACTAGC 
TTGTACTTGT 
AGCTCGAGTC 
AAACTGAtwC 
ATCCACTGAT 
TCATAGAGTC 
ACGTTGATTC 
AATCCGAAGT 
AGTCACTTGT 
tTACGCTTTC 



AGTGAAAATG 
TGACTCATGT 
ATAAATAaAC 
TTGTTGTAAA 
GGGTTCATCA 
TTTTTAATTT 
ATCTTTCTTT 
TAGTAATCCA 
TGGTGTGCCA 
CGTGCTCGTT 
ACTCATTGAG 
ACTTGTTGAT 
GGATGTGCTT 
TGAGTCACTA 
AGATTCACTC 
TGATTCAGAT 
CGAGTCTGAA 
AGATTCTGAT 
ACTGTCTGAT 
ACTCATTGAC 
TGACTGGCTT 
ACTTACACTT 
GCTTCCGCTC 
G CT AG ACGAA 
AAATTGACTA 
TGATCCACTC 
GCTAAGACTT 
ACTAGTAGAA 
TGAACCACTT 



11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000*. 

12060 

12120 

12180. 

12240. 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 
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™""" C «™™™ ™T« TGACGTCGAT ' „. 

ACACTTAATG ATOACGAATC °" T »«• 

< = ) INFORMATION FOR SEQ tD H0 , ,„ , 13059 

(i) SEQUENCE CHARACTERISTICS - 

<A) LENGTH, 10758 base pairs 
B) TYPE: nucleic acid 

|C) STRANDEDNESS : double 
<DI TOPOLOGY : linear 

Oil SEQUENCE DESCRIPTION: SEQ ID NO- 221- 
*~C TTACCTAftAA AACCGGGnAA ACCCTCCAAA ACCCATTAAA AGGnTGGnTA 60 
CCCTITAAAA TGGTAGCATT TAACOGCCAC CCGCCAAGGT GGGIGGTITA j 

CTACACCATG CAGATTCTGT AGTTGAGGGA TATTTTAACG AAAGCTTATT 
AGCAACTGAT AAAAAAATAC GTCCTAAGGC ATATATTGCT TCATGGAAGG ACA!^ 

~aa atagaattta aaattaaaaa aggtattaaa tggcatgatg gtaatgaI 
«~»™» gattggattt attcaattga agtcttagct aacaaggact acgaa^g" 
~ca agtgtagaaa atatccaagg tgcgaaagat tatcatgaag gaZZa 
™agc ggattgaaga aaatagatga ctacactatg caggttacat tZZ 
acaagaaaat tacttaacag gatttattac tggacctita xtaagtaaaa aatatttT 
— acca attaaagatt tagcgaaatc agataaaatc cgaaaatatc cta™ 
tggaccgtat aaagttaaga aaatogttcc aggtgaggct ottcaactcg „aZZa 

TGATTATTGG CAAGGTAAGC CTGCACTAGA CAAAATCAAT TTAAAAGTTA ^ "° 

rrBB - _ wwaatcAAT TTAAAAGTTA TTGATCAAGC 72 0 

--TTATT AAGGCAATGG AAAAAGGCGA TATTGATGTT GCGAATGATG CTACCGGTGC 7 
"-CAAAA GATGCTAAGT CATCTAATGC TGGTCTCAAG GTATTATCTG CGCCAAGCTT 
-CTACGGT TTAATAGGCT COTATCTCAT GATTACGATA AAAAAGCTAA TAA^ 
AAAGTGAGAC CAAAATATGA AGACAAAGAA TTACGTAAAG CAATGCTTTA TGCAAT^AT 
AGAGAAaAAT GGATCAAAGC GTOTTCAAT GGTIAOGCTA GTGAAATCaA TAGTTTTGTA Z 

:~ c a ~ ~ ct -™ «~ *~ 

~TA AAAAAATCTT AGATAAGTTA GGTTATAAAG ATAGAGATGG TGACGGATTT 
AGAGAAGATC CTAAAGGTAA TAAATITGAG ATTAACTTTA AACATAATTC AGGTTCTAAT 
CCTACTTTTG AACCAAGAAC TGCTGCGATA AAAGATITCT GGGAAAAAGT TGGCTTGAAA 
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AATACGATTC CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC 1380 

GACAGACCTT TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA 1440 

GAAACAGAGC GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT 1500 

GTTTATCAAT TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT 1560 

CTTCGCGTCT TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT 1620 

ACTAAAGATT CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 1680 

ACGTACATTG AGTCATTATT AAAAGAACAC AAATTGTTAT AACGAAAACC ATTAATAGAT 174 0 

TTTTATTTGG TGATTTCAAA TCATGAGACT GGGACAGAAA TGATGTTTTC ATAAAAATTA 1800 

TTTCGTTGTT CCACTCTCAT GATTTTTTTG ATGAAACATA ATTACATGAT TGATTGCATC 1860 

ATTTTGTTAA ACAAGTGATT GCAAACCTGC CATTTCACAC TGAAAATTTA CATAATAAGT 1920 

20 GACGATATTT TACAAGTCAT ATACAAATAA CATATATTGT TAAATAATTT TACCTAATCT 1980 

TAACATTAAA TTTACAATTA TAAGCGATAA TCTAAATATA AAGCTTATTT GAGGTGAAAT 2040 

AATGGAAATG TCGGTTACAG AAGTCATTTT CTCCTTTTTA GGTGGTTTAG GTATTTTCCT 2100 

25 TTACGGCTTA AAAATCATGG GAGACGGGCT TCAAGCATCA GCAGGAGACA GGCTACGAGA 2160 

TATTTTAAAC AAATTTACAT CAAATCCAGT ATTAGGTGTT ATTGCAGGTA TCGTTGTAAC 2220 

TATTTTAATA CAAAGTAGTT CAGGTACGAC AGTTATCACA ATCGGACTGG TAACAGCTGG 2280 

30 

ATTTATGACA TTGAAACAAG CCATTGGAGT GATAATGGGT GCTAATATCG GAACAACGGT 2340 

AACTGCATTT ATTATCGGTA TAGATTTAGG CGAATATGCA ATGCCAATTT TAGCATTAGG 2400 

TGCATTCTTA ATCTTTTTCT TTAAACGCTC TAAAATCAAT AACATTGGCC G CAT ACTATT 2460 

35 

CGGTTTCGGT TCACTATTCT TCGGTCTAGA^ ATTTATGGGT GATGCGGTTA- AACCTTTAGC 2520 

ATCA3TAGAT GGATTTAAGC AATTAATGCT TGATATGTCT ACAAATCCAA TACTCGCTGT 2580 

40 CATTGTCGGC GCAGGGTTAA CAGCACTAGT TCAAAGTTCA AGTGCGACGA TTGGTATTTT 264 0 

ACAAGAATTT TATCAACAAG ATTTAATTAG CTTAAACGCA GCAATCCCTG TGTTACTAGG 2700 

CGATAACATT GGTACCACGA TTACAGCTAT CTTAGCTAGT TTAGCCGGCT CAATCGCTGC 2760 

AAAACGTGCG GCGCTTGTAC ACGTCATCTT TAACTTAATC GGGGTAATTA TCTTCACAAT 2820 

TTTCTTGCCA GTTGTGATTC ATTTGATTAG TTTGTTACAA GATTTATGGC ACTTAAAACC 2880 

AGCGATGACG ATTGCAGTAT CACATGGTAT CTTCAACATA ACAAATACTT TGATTCAATT 2940 

ACCATTTGTA GCAGGTTTAG CATGGATTGT TACAAAGCTT GTCCCAGGTA AAGATATTGC 3000 

TGATGACTAT AAACCTCAGC ACTTAAACAA AGATCTTGTT TATCACGCAC CTGGTGTTGC 3060 
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AGACATTCGC GAAATTACAA AAGACGATAA AAAATTGATC AAAAAGCTTG AACAAAAGCA 3180 

TCAAGCTGTT GAAACAATCA ATGATAGCAT TCGAAATTAT TTAGTTAGAA TTTCTACAAA 324 0 

AGCCATTACG AAGGCAGACG TTGAGCGTTT AGCAGTTATG TTTGATGTCA ATCGCTCTAT 3300 

TTTAAAAGTA GCAGAGCTAA CAGAAGAGTA TGTCGCTCAA TTAAAACGCC AACATGATGA 3360 

AGATATTCGC ATTACAGAAG ATGCACAACG CGGTATGGAT AAATTATTCA ACCATGTTGC 3420 

TGAGTCATTT GATAAAGCCA TCGACATGTT AGATGTTTAT GACAAAACGA AAAAAGATGA 34 80 

AATTGTAGAA CGTAGTAGAG AATCATTTAA TATTGAACAT AAACTACGCA AAGGTCATAT 354 0 

7S TAAACGCCTT AATCGTGGTG AATGTACAAC AAAAGGCGGA TTACTATATA TCGATATGAT 3600 

TGGTGTTCTT GAACGTATCG GTTATCATTC ACGAAATGTT TCTGAAGCAC TTGTTGGCCT 3660 

TAACGATGAT GTACCTACAG ATGAAGAAAT TGCAACAACT GAAATTTAAT TTTTACTGTC 3720 

20 TTATTTATAT TCATATTTTT TTAAAATTAG AGATTCAGAT GCATGTAAAA AGCCAATCCA 3780 

ACATTCATGG GTTGGCTTTT TTGTTTAGCA AAATTTATTA TCTTAAATCG GCTATAAACA 3840 

CTGATATAAT AATGCTTCAT TAGTATG CGG TAAGCATGAC GGACACTGTT CTCGGAGTCT 3900 

GACCCCGAAA CGTTTAATAT ACACTTTTAC ACGTCGCCTT CATTGAAGCG AATTGCCATA 3960 

ACCTTCACAT TATATATAGT TCTTTCCATA TAAATGTCCA AATTTTTAGA ACAACGCAAT 4020 

AAATAACCAT CCACCTAACT TATCAAAAAT TTAAGTGGAT GGTTTTTCAT TTTCATTTAT 4080 

ATTTATATTA GTGTTAATCC AATCATAGAT TTATCTATAT GCACTGCTCT ATACATTTCC 4140 

TCATTTAATT TGCTTTACTT TCATTTATAT CATTATCAAA ACACTTGGCG TGTCATCGTT 4200 

ATTATTTCGC ATCTTTGACA CGTTTATCAT CATTAGGAAT CGCGAATAAA ATTGCGATAA 4260 

ATGCCATGAT TCCCATTAAT ACGTTAACCC AAAGTGCAAT CATCGCACCT GTATGAATGC 4320 

TCGTTGCAGC AACTGCACCA GCATATACAG CACCACTAAT TGCGACACCG AATGCGCCAC 4380 

40 CAAGTGATGA AGCCATTTTA TAAATACCTG AAGCAACGCC AACTTTATCT AACGGTGCAT 444 0 

TCGAAATAGC TGTATCTGTA GAAGGTGTTG CATAAATACC TAAGCCTAGT CCGAAACATA 4 500 

AATATCCTAC GACACAACTG ATAACATAAA ATATGCCTGG TAAGAATACT AATGAAATAA 4560 

GTGCAATACC AATGACCACA ATGAATGTAC CTAATAACAT TGGTCGCTTA GAACCCATTT 4 620 

TTTGTAATAA TTTTTCACCA ACTCGAATCA TCAATAACAC CATGATTAAA TAAGTAATTG 468 0 

ATAAGTATCC TGCCTGCAAT GCTGTATAAC CTAAACCTTG TTGCACGAAT GTATTCGCTA 474 0 

CAATTAATGT ACCTGCAAAA CCGTTTAATA AGAAGTTCGA AATCGTTGCA CCTGTATATG 4800 

GTTTATTTTC AAATAATTTA AAATCAATAA GTGGATTATC TACTTTTTTC TCAACATTTA 4860 
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AACCAAGTGC TGCACCTTTA GTAATGACAA CGTTTAAACT TAGCAACATA ACTACTAGAA 4980 

CAATTAGCCC TGCAACGTCA AATTTATGTG TATTGGTAAT TTCTGATTTC GTTTCAGGCG 504 0 

TCCCTTTGAT GAGTAACATT GAAAGTACGG CAACGATAAT TGAGAAGATG AAAATCCATC 5100 

TCCAACCCAT AGTTGTCGCA ACTGCACCAC CGAAGAGTGA ACAGATACCA CTGCCACCCC 5160 

AAGAACCGAT AGACCAATAA CTTAAGGCAC GCTGACGTTC AGCACCCTGA TAATAAGTTT 5220 

TCATAATGGC CAATGTAGAA GGCATAATAC ACGCTGCTGA TACACCTTGT ATAACACGAC 5280 

CTAAAATTAA TAATGCCGGT AAATTCGTAA TAATAATTAA TGCTGAACCA ATAATACTTA 5340 

ATAATAAACC GATATTCGTC ATTTTCACGC GCCCAATTTT ATCTGCCAGA CCACCTGCTC 5400 

CAACAACAAA CATGCCTGAA AATAGTGCAG TTAGACTGAC CGCAATACTA ATTGTCCCCA 5460 

TGTCTGTACC AAAACTTTGT TGTAAATTCG GTACAACATT TACAAGTGAT TGTGCAAACA 5520 

20 ACCAAAATGT AATAACACCT AATACAATAC CTAAGATTAA CTTGTTGCCC CCGCGATACG 5580 

TTTCATTCAT GTTAGTTATC TCCTTTAAGG TAATCTAAAA CAACTGTCCC TACTGCTTCT 5640 

GCAGAAATAA GTAATGATTT TTCTGAAATG TTAAATTTAG GATGATGATG TGGGTAAATT 5700 

TCACCATTTT CCACCGCTGC ACCTGTATAA ATAAAGGCAC TTGGGCGTTC TTTAGCATAA 5760 

TATGCAAAGT CTTCTGAAGG TGGTTGTGGT TCACACATTT CAACACCAAA ATCAAGGTTT 5820 

GCTTCTTTCA ACGTCTTAGC CACGTACTCA GTAAACTCTG GATCATTATA TAATGCTGGA 5880 

TAATCATCGT TATATTCTAA GGTGCAAGTt ACACCATACA TATCCTCTAA TCCTTTTGAT 594 0 

AAACGTTTAA TTTCTTTTTC AATTGTTGCT TTTGTAGCAT CTGTTAATCC ACGTACATCA 6000 

CCTTCAATTT CAACAACATC TTTAATGACA TTGAATTGAC CTTTACCGTC AAATGAACCG 6060 

ATTGTGACAA CACCGGTTTC AAATGGACTT AGTOCTCTAC3 ATACAACTGT TTGTAACGCT 6120 

GTGACGAAGT AGCTACCTGC AACAATGGCA TCATTGGCCA TATGTGGTGA TGAACCATGA 6180 

CCACCTTTAC CTTGAACTTT CAATTTGAAG AATGCGCGTC CTGTTTGAAC ATAACCAGGT 6240 

CTGTAATACA CTTTACCTGT TTTCATTGTG CTCATGACGT GTACACCTAA TACATGATCA 6300 

ACACCGTCTA ATACACCATT TTCAATCATT GTTTTAGCAC CACCTGGTGG TACTTCTTCA 6360 

GCTGGTTGAT GTATCACAAC GACTTTTCCT GTAAAACTAT CTTTCATTTC AGCAAGCGTC 6420 

TCTGCTAATA CAAGCATGTA TGCTGTATGT GCATCGTGAC CACATGCGTG CATAACACCT 6480 

TTATTTTGTG ATGCAAAAGA TAATCCTGTA TCTTCAGTAA TGGGTAATGC GTCAAAGTCT 6540 

GCACGGATTG CTAATGTTTT ACCAGGTTTC CCTGAATCAA TCGTTACTTT AATTCCACGT 6600 

GGTCCGACAT TCGTTTCTAC TTCCACATCT TTACCTTTGT AAAATTCAGC GATGTATTTC 6660 

ss 
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ATCATTTTGC CTTCTTTAGA rrrra R ^ 

UMUA aahAAAGTT TCAATTAATT rTTY-AT-r^ ~ _ 

^ UAArr GTTGATTCAT ATCCTTCATC 

«™X» CA^AAA, GATTAATCAT ^A. GCCAACAACA GAGATCTTAA 
A^r TAGCTTOMi TATAAAAAAT CACAAATTAT GTATATCAAA 

«™»», TTATCAACTT GGGTAACGTT TTACTTCAAT .AAC^AA.c 

«~ GTTTAATGCC AGTGTGTCCG K a™ A^AC. 

TAATCTATAG AGTCCTTGTC CA TC CCA TCT 
ATCGTATGAT G^GCGCCA AAGATTAAAT AACGCACCAT ™ ATC 

™ OT TATACATACC X^CCAGG GCATTAATGT TCAAATGAAT CA^CCGTC 
GCAOGCATAT TC A™ GTCCAACGCT AAGTACOGAT T AAAA TCOT TCCatcat „ 

ATCMraCC ~* 

™AA A^GAA, AGAA^a T aa™ amn ^ ATGACTGACG 
-AAA-T AATTCAACCA AXACCCGA.A CCC^ TAGAACTTAA TTTTAATAAT 

T ~ TC ™ — — 

AATCTATTCC AATTGAGTAA AA^AG^ga CGC^AGGC GA^CCA^ CA^AGTCG 

~ T TC g™ aaxaa^ ^cgg ™ to 

™™ AATCAACAGC GTCATTTGAA 

*»» CTTOCOiio 

«™ TAGGGAATAA TGCTTTAAAA TAATCAATCA 

CATACAAGTT GAAC^AAA .C^a* GAC^ACATG CAGGGCAACG 
™^TAA AAATTTTAAA ^CA^A TAATCACGTG AGTCATACR3 ATCCAAATGC 
«~~AAAC TAAAG^A. ATCTAATAAA AAGTCTAAAC ACAAATCAAT J^Z 
^TTCGAAA TTTCTGCATC ATACGTGAAT G^CA, GATATAXGGA 

ATGCTAATAC ^CA^ ^ 
«™»^ G^A^AG CAAAX^c ^ 

T ™Z °™ T ocsatctot ™— — » 

TCTTCAATAA AATGATTCAT T^CAATG TAGTCATTTA AAAGTAATAT CAATTCACGG 

I" roATTOCT0G ™™»» —CCA 

TATTGTTTAG GTGTTTGATG CGTAAAGTGT lBHlmnc TC0CAMGCI 
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TTCGCATGCT 
GTAAACTTTT 
CTTAAATCTT 
TTTAAGATAC 
AACTGAATCA 
GAAACCAGTC 
GTTAAGTGAA 
AATTGGATAT 
TTAATGATGA 
GCCTCnCCTT 
CTCTCAGTTT 
TGAATACGTT 
TGTGTGGATG 
TTTTCACCTT 
TTTAAGTCTC 
CTTTTTCGTA 
CTCGGTACCT 
CTGCCATATT 
ATACCTTTAG 
CTCTTAGGTC 
GTCTTAGTGT 
GGTTGAACAA 
TCATCACGAT 
AATGT T TTTT 
AAACTAATGA 
ATTTCACAAT 
CATGATTATT 
TCTTTTTTCT 
TTCCCTGTAC 



CAATTCGCGT 
TAGACAGATG 
CATGAAAATG 
TTTGGTTCGA 
CAAGTTGTTG 
TTGCCATTAT 
AATATAGACA 
AACATATGAT 
AAATGTCGCC 
TAATAACAAA 
CTACTCGAAT 
TATTTTACAT 
ATTATTTATC 
TTTTCTCCAC 
CCgAACCTTT 
ATTGTTTTAC 
CTGGATTATA 
GCGCGAAGAA 
TTGGAATGAT 
TACCATCTTC 
TTCTATTCAT 
CCATTTCAGA 
AACCTTCTTT 
CAAAGCTTTT 
CTAATATCAA 
CCTATTCTTC 
TATCCTCACT 
CTACAAACGT 
CyTTCAACAA 



CGTATTTAAG 
GCTCTCTGAC 
TAACTCGATA 
ATGATATGTA 
CTCAGTCAAT 
ATTTCTCAGT 
ATGCACATCA 
GCCATCTTGT 
ACTGTTGCAT 
ACCAATCATT 
TAAATAATCA 
TATTTTACAG 
CTCACTCGGT 
AAAAGTAAAT 
CaACaATAAC 
ATTATAGTCA 
TGATATATCG 
CTTAAAATTC 
TTTATTGTCT 
ATCATGAAGT 
ATATAGAACC 
ACCAATAATC 
ATCGTATAAA 
CTTAACTTCC 
AAAACTAATA 
TTATTATCTT 
TGGTTTAAAA 
AAATTCAATG 
CArCTTCGGy 



AAATGATGGA 
CACCCAACGT 
TAGTCGCATA 
cGCGGGACAT 
TTAGACAACT 
TGATGTATAT 
TCAAACTTGT 
TGAAGTTGAA 
TGCGTCATAT 
AAACTATTGA 
CGTTGCATAC 
CAACATATTT 
TCAAGATGTA 
TCAATGTCTT 
TTTGGTGCTT 
TCATTAGTTA 
CCGTCTTTGT 
TCGATTTCTT 
ACCATTTTAA 
GTTTCACTCA 
ATACCTTTTG 
CATGTACCTT 
TCCTCTAGAT 
GCTTCTTTAC 
TACAATACCA 
TCCTGGATTG 
ATTAACCCAT 
TCTTTATATC 
GCTTTATTTG 



ATCCTACACC 
ATTCGCTTAA 
CTTGATTCAC 
AATGAATCAT 
CATTATGTCG 
TCTTTGTTGT 
CTGCTAAATA 
ATCGATACAA 
TATTTTCATC 
GCCTTTTGAA 
TATCCCTCAA 
GAATTTCATA 
GACTATCAGT 
TATATCCAAC 
TATTCGTTGG 
ATTGATATTT 
ACTTCGACAA 
TTTTTATATT 
CGGGATATTC 
CTATATACTT 
ATTTCATACG 
TATCATTTTT 
TTTTAATCGG 
CTATGCCACA 
ATTTGTTTAA 
ATTTCATATT 
CACTAAAGTA 
CAACTGATGA 
TTGGTATGTC 



TAGCGATTCT 

TTCTGAAAGG 

TTTATCATCA 

ATGCATAAGC 

GATATGTGTT 

GGTCGCATCT 

TTTCATTTGG 

GTCGCGGTGG 

ATAAATGTGT 

ATCTGACATA 

TTCAGTAATA 

TTGAATCGTG 

AAAAAAAGTA 

TGrTGaACCT 

TATTTTATAT 

TGCTGAATAA 

ATCTTTAAAG 

TTCGTCTTTG 

TTTATCTTTA 

CCCGGTTGTA- 

TTCCCCTTTA 

ATCAAATTCG 

ATACATACTC 

ACCAGCAGTG 

TCGTTTCATA 

TTGATCGAGT 

AATGTTCTCT 

ACCTTTTAAA 

ATATCTTTTA 



8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9X20 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
- 9.72 .0 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
1O260 
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ACCTCTGGAT TATATGATAT ATCTCCATCT TTATAATTCA TTAAATCTTT AAAATTGCTA 10380 

TATTGCGCAA AAAACTTAAA GTTTTCGATT TCTTTTTTTA TGTtTTCTTC TTTAACTTCC 10440 

TCAGTAGAAA TGAATTTATT ATTAATCATT TTAACTGGAT ATTTTTTTTG ATTATCCTGA 10500 

GCTACTTCGT ATTTCTCCGT CTTTAtTTCA TTAGTATAGT AAAAtCCTTT TGCACTTCTT 10560 

GTATTTCTAT CTATCTTCAA AAGCATGCCT TTTATTTTTA GAGCTTCTCC TTTATTTTGA 10620 

ATTGCCATTT GAGAATTTAC AATCCATGTT CCCTTATCAT TTTTATCAAA TTGATCATCA 10680 

CGATATCCTT CTTTATCGTA TAAATCCTCT AGATTTTTAA TCGGATACAT ACTCAATGTT 10740 

TTTTCAAAAC TTTTCTTT 10758 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
nTaTCaACTT TGGaATTTAA AgTCAATAAC TTTTTTAAAA ACTTTTTGTG TTCACAACCC 60 

GCTTCTTTTT CAACGCGTTT ATTGCTTAAC ACAAGAACTT ATTTTACCAG CATTCCAAAA 120 

CAAATCAACA TAAAAACGTA CAAAATAAAA GTAATTTTGT ACGTTTAGCA TATATTATAC 180 

CTATTTATTT GTAGCAGCTA TAACTTTTTG TGCAATCGAG CTATAAATTT TACCTAGACG 240 

ATCATCTGAT TGATATATTG ACGGTGCAAA ATCTTTTGGA TTCCAAGATG GTTGCTCTAA 3 00 

AGGTAATTCC CCAAGTAATT GAGTATTAAG TTCATCAGCT AACTTAGTAC CGCCACCTTT 360 

GCCAAAGACA TATTCTTTAT TACCCGTCTC TTTACTTTCA AAATAACTCA TGTTTTCAAT 420 

TACGCCAAGA ATAGAATGAT CCGTATGTTT TGCCATCGCA CCTGCGCGAg CTGCAACAAA 4 80 

TGCTGCTGTA GGATGAGGTG TCGTTACAAT AATTTCCTTA CTTGAAGGTA ACATCGTATG 54 0 

AACATCTAAA GCTACATCTC CTGTTCCAGG TGGAAGATCG AGTATTAAAT ATTCAATGTC €00 

TCCCCATTTA ACTTCTGTAA AGAAATTCGT CAACATTTTA CCTAACATTG GCCCTCTCCA 660 

TATAA CTGGC GCATTTTCTT CCACAAAAAA GGCCATTGAT ATAACTTTAA CGCCATGACG 720 

TTCAACTGGA ATTACTTCCT TCCCTTTAAT TCCAGGCTTT TCATCAATAC CCATCATATC 780 

TGGTACACTA AATCCATATA TATCGGCATC TACTAATCCG ACTTTTTTCC CTTCACGAGC 840 

TAAGGCAACG GCTAAATTTA CTGCAACAGT AGATTTACCG ACACCACCTT TACCGGAGGC 900 
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_ —tvotcc ggcaatgttt caaatcgtat 

ATTTTCTTCT m^m. .«»™C «TX*«« 
ACCGACCGTT TTCGCACCGT TTTCTTTTAA TGCATTAACA ACAGCCATC 
aTTGCGtGCA CCACCTAATT GTGCCATTG 
(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3997 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xil SECTO.CE DESCRIPTION SEQ ID NO, 223: 
remB m AAAAAATGA T TGTCTAGTTT GTArcTCTCT GAAGA.TTGG 
^CCGATAA COGTA.AATG ATTA.CGACT TAAAGTTTAT GTGGCATTTT 
^CAGG. GAGTTAGA.G «"«™» GATAGATTAT TGCTTATAAT 

ttgaatcata tcxttcattt cactgattag tgcttcttta ggattagc^ 
RTOT caaat gcgagcxctg ccattctatc aatigactca TTTAATTCTT 

ACCTTGTGAT TTCAAATTCA ^CAATTCC GACTGATTGA CCTAATTCGT 
AgCTAATGAT TCTACGAGTG C^CIGTCGT AnACCTTTT AATCCTAAGA 
ATCfGCATAA TCTGTATCTG CTCTGAAGAA CTCATATTTA GGGAATAATG 

GGATAATATG GGGTAGTAAT ATCGCATTCG 
CGGGTCTTTG GCATTATAAC OOATAKUU" 

^ACCA TAXXCGCCAC CAATTTTATG CGCAATTGAG ^CAA^C 

mttgcaaat~ gccataccag ccaaagtt^sa - tgcgttatgc -attttctctc- 
^ccrrr tcaacagatc attttaaata ttcgaacgtc aatttaatog 
^ccxctt gigtagtc aagccattac agatacata. ga.icca.xg 

T ACAXCCA« CCTGTATCTG C^AACCCT ^CACA CCATCACAA 
^^CA AOGTCAGGTG X.AAAGCAAA AXCAGCCAAC GGA^ 
ACTATCTGTG ATAACTCCAA ATGGTGTTAC TTCTGAACCT GTACCTGATG 
ACAAATGAAC GTCGCATTTT CAGOCATGCC TATTTTATAA GTACGTTTAC 

^ ttagcaccga agaatga^ oxcaggg- tcaaagaaca 
tccatcgcig aaccaccacc aagtgcaatg 

AACCATCATT TCCAGACCTT ^ACXG. ATTAGTTGAT GGGTTCGGTT 



1020 
1080 
1109 



CAATAAATAA 
TTACTTTTGT 
CATATGATGT 
TTGTACATTG 
CTTCAGACAC 
AGACAGCTTT 
ATTTGGCAAT 
CATGTTTTTG 
CTCTACCATG 
CTAAGAATGC 
JTKAAACTTT 
CTTGTAGACT 
CATGCGTTAG 
ATTGAGGGTC 
CATTTGTTTC 
TCGTAGGGAT 
CGATGTCTAG 
TCCACATTGC 
GTTGGAAATC 
CGACTTCGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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ATAACCGAAT TCTACCATAC CAGGGTCACA GACAATCATC ACTTTTTCAA TCTTGTCCAT i 32 0 

TGTTGTTAGA CTCATGATTG CATTTTCTTC AAAATAAATT TGAGCAGGCA CCTTGAAAAT 1380 

5 

TTGAGTATTA TTACGTCGTT TAGCAATCGT TTTAATGTTT AATAAATCTG TCGCACTAAC 1440 

ATTATGTGAA ATTGAGTTTC TACCGTAGaA CCACAACCTA ATGTTAAAGA CGGAATCAAT 1500 

TCGTTATACA TATCACCAAT ACCTCCAACC GCTGATGGTG TATTTACAAG TACACGACAA 1560 

10 

GCTTTCATTC TTAGTCCAAA ATCTTTTTGT AATGTTTCAT CTTCTGTATG GATAACGGCT 1620 

GTGTGTCCTA ATCCACCAAA ATGTAGTGTG TCTTCACAAA TTTGAAATGC TTGTTTTGTA 1680 

r5 GATTGGGCTT TTACTAAGGC TAATACTGGA GATAATTTTT CACGAGATAA CGGATAGTCT 1740 

GAACCTACAC CGCTAATTTC GGCTATGATA AGTTTTGTAT TTTOGGGGAC AGGTATACCT 1800 

GCTAATTCAG CTATTTCAAC TGCAGATTTA CCGACAATAT CAGGCTTAAT ACCTGTTTTT 1860 

20 TGTTCATTcA TAATTGCATT TTCTAAGCGT TGTAATTCAT CTTTTTTAAC AAAGTATGCT 1920 

TGATGTGCTT TAAATTCATT AGTAACATCT TTATAAATTT CTTTATCAAT GACTACAACT 1980 

TGTTCAGAAG CACAAATCAT ACCATTATCA AATGTTTTTG AACCAATGAT ATCATTTACT 2040 

25 

GCACGTTTAA TGTGTGCTGT TTTTTCAATG TAAGACGGCA CGTTACCTGG TCCCACACCT 2100 

AATGCCGGTT TGCCAGTTGA ATATGCAGAC TTAACCATGC CCGAACCACC TGTTGCTAGA 2160 

ACTAATG CAA TACCTTTGTG ATTCATTAAT TGTTTTGTTG CTTCGATAGA AGGCACTTCA 2220 

30 

ATCCACTGAA TAATATCTTT AGGTGCACCT GCCTTCATTG CCGCTTCTAA TACAACTTCT 2280 

GCTGCACGCT TCGACGATTC TTGTGCACTT GGATGGAATG CAAAAATGAT TGGATTTCCT 234 0 

3S GTCTTAATTG CAATCATCGC TTTAAAAATA GTTGTCGACG TAGGATTTGT TGTTGGCGTA 2400 

ACACCACAAA TAACACCAAT TGGTTCCGCT ACATACGTTA ATCCTTTTTC TTTATCTTCA 2460 

CCAATAATCC CTACTGTCTT ATTGTCTTTT ATTGAATTCC ATATATATTC AGAAGCGTAT 2520 

40 AAATTTTTAA TCGCTTTATC TTCGTATATA CCTCTTCCAG TTTCTTCATG TGCTAATTTT 2580 

GCTAGCACCA TATGTTGATC AACAGCTGCT AAGCTCaTTT GATGAACAAT ATGATCAATT 264 0 

TCTTCTTGTG ACTTTTTAGA TAATGCTTCT AATGCTTTTT TCCCTTTGTC AGCTAGAGCA 2700 

4S 

TCAATCATAA TTGCCACTTC TTGTTCTTTC GATCCACGAT TTTCTTTTTC AGGTATAGTT 2760 

AACATATACA ACCACTCCTT TATACTTTGT GAATTATTTC ACAAACATTA TAGTACATGT 2820 

CTCTCAGGAT ATAAAGAAAA TTCTATACAA AAAAGTTTAA TTTCGAATAT TATTTGAACA 2880 

SO 

AATATCAAAT TTTAAAATAA ATGTTTTCAT GAAATCATTG TTATTTCGGT GTTTTTAGAA 2940 

TGATTTTATA ATCATAATTT TTTCAATGAC ATAATTTATT CATAATTATA TATTTAATTC 3000 
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TCCTTGTCGA TACCTATCAA CAGATGTTAC AAATAAAAAC CaCCCGTGTG AACGGGTGGT 
TTGTTCTGCG gCTATAAGCC TTCCTTACTG GCCaGCCCTA AAAGGGCACT GACAAGTCAG 
CCAACTGCAC TACTATTCCA GCAATCCTAA AGGTTTACTC TTTTTTCTTT CTTTnTTAT 

rrrrcrcTCC agtgaaagga tctaaatatt cttccattga aatttcatct gcaacgatat 

CCTCTTGTAA TTGATTACGA ATATAATTTT CAATCACTTT TTTATTTCTA CCTACTGTAT 
° CCACATAAAA TCCTTTACAC CAAAACTTTC TATTTCCATA TCTATACTTT AAGTT AG CAT 
GTCTATCAAA TATCATTAAA TTACTTTTCC TTTTAAATAG CCAACAAATG ATGATACCCC 
AAGTTTGGGT GGTATACTTA CTAACATATG GATATGATCT TTACATACCT CTGCTTCAAT 
5 TATCTCTACA CCTTTTCTTT CATATAATTG ACGTAATATA ATCCCTATAT CTTTTTTTAT 
TTTTCCATAT ATCGCTTGTC TTCTGTATTT AGGTGCAAAG ACAATATGTT ACTTACAATT 
20 CCATTTCGTA TGTGCTAAAC TGTTTGTGTC AGATGACATT AAATAGCATC TCCTCGTGTT 
GATTATTTTG GTTGGCTGAC CAATATTTAC TCTAACATGT AGAGATGCAT TTTTTTGACA 
ATGGTAGAAC CTTTTCTGGG GAGTGGGACA GAAATGATAT TTTCGCAAAA TTTATTTCGC 
* CGTCCCACCC CAACTTGCAT TGTCTGTAGA AATTGGGAAT CCAATTTCTC TTTGTTGGGG 
CCCCGCCCCA ACTCGCATTG CCTGTAGAAT TTCTTTTCGA AATTCTCTGT GTTGGGGCCC 
CTGACTaGAA TTGAAAAAAG CTTaTTaCAA GCGCATT 
(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 
_ (C)- ST-RANDEDNESS:— double _ ..... _ _ 

(D) TOPOLOGY: linear 

"(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
GnGCGAGACA AACACACtTA TTGGTGCCAT TATmCcTAGA ATGaATTCaT ATGCAGTAGA 
TGAaaCAATC AAAGGATTGG CAAAACAATG CCAAAAATAT GAATCaCAAT TAATTTTAAA 
TTACACAGGT TTAAATATCG AAGCAGAAAT ACAAGCGCTT GAAACATTAG CACGCaGTAA 
AGTAGATGGT ATTGTTTTAA TGGCTACAGA CATAACAGAG AGACATATTG AAGTCATTAA 
TAAAATGAAT GTACCAATCG TTATTGTTGG TCAACAACAT GAACAACTTC ATAGTATTGT 
GCATGATGAT TATAAAGCAG GTCAAATTAT AGGCGAATGG ATTGGTCAAC AGGGATATCA 
ACAAGTTGAA GTGTTTAGTG TAAGTGAAAA AGATATTGCA GTTGGTATAC ATAGAAAACG 
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3660 

3720 

3780 

3640 

3900 

3960' 

3997 



60 
120 
180 
240 
300 
360 
420 
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TACTTATGTG GAAGCACAAA 

TGCGGTTGTT GGAGCAACTG ATACGATTGC ATTAGCTCCC 
AAAAGATGTT ATGAAACCAC ATCAAATATA TOCTTTTOOr 
AGTGTCTCCA TCGATAAAAA CAATTCATTA TAATTATTTT 
GGaAGAGATA CAACAGATGC TTAAAAAGCA AGATATGCCA 
TAATATTTAG ACGCTGTATT TTTTAAAATA AATGTGGAAC 
TGGATAGGTT AAAAGTTAAA GAACGTAGGT AAAATTTGCT 
AACAGCATAA ATTATAAAAG GAGGACTGGG TAAATATTAT 
AACGTTATCA ACGAATCGAG GACGTTGATA CTGAGTATTT 
TTGATCAATC AAAATTTCGT CAACAATTTC ATATACAACC 
ACCCCAACGG ACTTATTTTT TATAAAGGGA AGTATTATGT 
TAGGCGCAGT ACATGGCTTA AAGTATTGGT ATAACTACAC 
TTAAAGCTGA AGGGCCAATT TTAAATCCAG ATACTAAATA 
GCGGTAGCGC TTTTGAATAT AACGGGCATT TATATTATAT 
ATAATCATTG GCAACGACAT GCGAGTACAG ATGATCGCAC 
AGTTGGnAAA A 

(2) INFORMATION FOR SEQ ID NO: 225: 

<i> SEQUENCE CHARACTERISTICS - 

A LENGTH: 930 base pairs 
W TYPE.- nucleic acid 
C STRANDEDNESS: double 
<D) TOPOLOGY: linear 



GAAAATGTGG 
TATAAATATT 
GGTGACCCAA 
GAAGCTGGCC 
TATAGCGTCA 
CGATACCATA 
ATAATAGAAT 
GACCGAATGG 
TAAAACATTA 
AGAAACAGGC 
TTCACATCAA 
GAGTGATGAC 
TGACAGCCAT 
GTACACAGGA 
GATTGAAAGA 



AG CAAGTAGA 
ATTCTGATAA 
TGACACAATT 
AATGCGCGAT 
CAGTAGATGT 
TAACTATAAA 
ATAAATTGTT 
ACTAGAGAAG 
AAACAACAAG 
TTATTAAATG 
TGG TTCCCAT 
TTAATAAACT 
GGTGTATATA 
AATCATCGAG 
AGACGGTTnC 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1391 
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<«) SEQUENCE DESCRIPTOR SEQ. ID NO- 225 
ATTTATTTTA TOtat m ^ 

TTTCAATTCA AAGACTQGTQ 

TAACATATGT AAACAGTAAT GTTTACGACT CAAAA^AGA CAAAAXAAAG 
CCTTCAAGTT TTATTTATCG CATTTCTTGA AGAGAGCATT ATCATTTTAT 

cota™ tastotcoo tcaaattcct o™™™ ca tocaatt 

A™,, TTTATTmc TTATCTTCAC CAACATAAGG ^aag 
TATCTTTAAA ACTATCAIGA TGCACAATCT AATTTAATGC ATCAAAACCA 



AATTGACATT 
CAACTCCTAG 
AGATATGCCC 
TGTTGCATAA 
TCAAGTTTAT 
ATTTTCGGAA 
ATGTAACCGA 



60 
120 
180 
240 
300 
360 
420 



990 

8NSDOCID: <£P 078651 9 A2_l_> 



10 



15 



20 



30 



35 



40 



45 



EP 0 786 519 A2 

GAACAACTTT GATTCTGTCG ACTCCAATGA TTTTATCAAA TTCATTTAAT ACGCCATCAA 
AGTCCTCTTT AACATTATAT C CAGCATCAT GCGTATGACA TGTATCAAAA CATACTGATA 
AACGTTCGTT ATTATGAACT CCATCAATAA TACGTGCTAA CTCTTCAAAT GAGCGACCAA 
TCTCTGTACC TTTACCTGCC ATCGTTTCAA GCGCAATACG TACATTATTG TCATTCGTTA 
AAACTTCATT TAATCCTTCA ATAATCTTAT TAATTCCGGC ATCAACACCA GCTCCAACAT 
GCGCACCTGG ATGTAATACa ATATCTTTAG CCCCTATAGC TTGCGTTCTk TCaATTTCTT 
GTTGCAAGAA ATCTACACCA AGATTAAACG TTTCTGGTTT GGTTGTAtCG CAATaTT aAT 
GATGTATGGT GGCATGAACA ACAATATTAG 
(2> INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S . (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGACGCACCA ATTTATAACG CAATTGACAA AACAATTAGA TATACCTGTG AAATTTGTAC 
CTGGAAACCA TGATTTATGG GAAGTTGAAA GTATGACTAC GCAAGACATT TCGAATAATT 
ATAAGAGTAT GTCACAGTGC TTGGTAGGAA AACCATTTAT AGTAAATGAA GAATGGGCAA 
TCATAGGACA TACTGGCTGG TATGATTATA GCTTTGCAGC ACAACGATTT TCATTAGATG 
AGTTACAAAA AGGAAAACAT TATGGTGCGA CTTGGCAAGA TAAAGAACGA ATATCTTGGG 
GCATATCA"GA^CAAAATVrA- TCTAAAATAG ^TCCTCAACA - A'GTGAAiGAAA - GATATATTAG 
AAGTAGGAAA TAGACGAGTG ATTTTAGTCA CACATGTTGT GACGCACCCT GATTTCATTG 
TTCCTATGCC GCATCGTATA TTCGATTTTT ATAATGCATT TATTGGGACA AGTGATTTCA 
ATCCTTTGTA TGCGATGTTC GATATACCAT ATAGTATTAT GGGCCATGTT CATTTTCGTA 
AAAGTGTGAT AGATGATGGC AG ATGTT AT C TCTGTCCGTG TCTAGGCTAT CCAAGACAAT 
GGCGTTCAGA AGATATTTAC CAGGAAATAA ATGAGACGAT ACAAATAATA GAAATTTAAA 
ATGCGCAAAC CTGACCCAGT TTGCGCATTT TATGTTTTAC ACACGCGAGT AATGTGTTTA 
CTTACGTGTG TTTATTTTGT TGCTGATTTT CAATTGTATA TGAATGTGGT TGCACATAAA 
TGCACTTTCT TCCTGGTGAA TTAAAGCTGT ATTCCATTTT CTCTTTACGA ATTTTAATAA 
TTTGTTTGCG ATTTGGAATG ATGGCAGGTA AAACTAGGCC ACGACGAATA TGACTCCAAA 



540 

600 

660 

720 

780 

840 

900 

930 



55 



991 

•SDOCID <EP 078651 9A2_I_> 



EP 0 786 519 A2 



is 



30 



™»™ t^oiggc r^n^ agcggaaaac acgtao™ aoIrafl „ c 

««««« ™ m AACACAGGTT GACC^ATC GA T AA T A OT 

ctaacga™ mcca Kammr taggatota 

ATGCTTCTGG AAAXA.CGTT raraMl 

" ~™ ™«™» -™ «*»«T TC^ACCAG 

GTTCTOTAT — — «o«™» caccccatga 

CCATAGGGAT AGGAAATACC ^ 
TATAGTCCTC GCATTGATTC AATAAATATT X« TOT «„ ATATAGCGTC AATGCAAAAT 
QTCCTAAACA TA TCmTAC ATGAGTGAAT AAAATTAATG GAGTGATAAA ATGGAATATC 
»™CAACA AC^CGTCG TTAACGTTAG TAGGTATTAA AGAAACOTAT GAAAATGGAC 
~TCA GCAACATATA GCAGGG^ CGCAAAGA TC ^ 
CGGATTTACA G1TAAAAAAT AATGGTGATT TAGCCGGGAT AC^A TGTATACCTG 
AATTAGACGG TAAGATGTCA TATATGA TO CAGTTACCGG AGATAATAGT GCTOATA™ 
AAAAATATGA TGTCATAACA TTAGCAAGTT CAAAGTATAT GO^^AA GCACAGGGCG 
CAGTACCTAA AGCAGTTCAA CAAAAAA^ AAGAGGTTCA TCACTACATA CATCAATATC 

™™ ™»™» TCAGGA^GT GATACTACAA 
GTGGAAAATT AATATTACCA GAAATTTGGG ATnCCTGTTA AAGGGGTGAT TGAAATAnGA 

AnTG 
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(i) SEQUENCE CHARACTERISTICS • 

(B TYPE: nucleic acid 
(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 
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(Xil SEQUENCE DESCRIPTION: SE« 10 NO- 227 ■ 
GATTCCACGT GTGTTAAAAG AAG^ACAcC TTCAATGATG GTATTTACTA A TOOTT AO 
AGATCAAATC GATOGCTTCG GTGAAATTGA TATTATGGTT AATAACATTG CAGAGACAAT 
TAGTAATAAA GGCATCAAAT TATTGCTAAA TGCTGATGAT CCAI^IGA GTCGTTTGAA 
AATCGCAAGT GATACGATTG ^AOXATGG TATGAAAGCA CATGCCCATG AATTTGAACA 
AAGTACGATG AATGAAAGTA GATATTGTCC AAACTGTGGT CGCTXAX^C AATAOGATTA 
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AAAATATGAA ATATCAAGTT TTGATGTGGC ACCGTTTTTA TATTTAAATA TCAATGATGA 420 

AAAATATGAT ATGAAAATTG CAGGTGACTT TAACG CTTAT AACGCGTTAC AGCATATACT 4 80 

5 

GTTTTAAGAG AGCTAGGGTT AAATGAACAA ACAATTAAAA ATGGCTTTGA AACGTATACA 54 0 

TCAGACAATG GTCGTATGCA GTACTTTAAA AAAGAACGAA AAGAAGCGAT GATCAATTTA 600 

GCTAAAAATC CTGCAGGAAT GAATGCAAGT TTATCAGTTG GTGAACAATT AGAAGGCGAA 660 

10 

AAAGTGTATG TTATTTCGCT AAATGATAAC GCTGCAGATG GTCGAGATAC TTCATGGATT 720 

TATGATGCAG ATTTTGAAAA ATTATCTAAG CAACAAATTG AAGCTATCAT CGTGACAGGT 780 

75 ACACGAGCAG AAGAACTTCA ATTGCGATTG AAGTTAGCAG AGGTTGAAGT ACCAATTATA 840 

GTTGAGCGTG ATATTTATAA AGCAACGGCA AAGACTATGG ATTATAAAGG TTTCACAGTT 900 

GCAATACCAA ACTATACATC ATTAGCGCCT ATGCTTGAAC AATTAAACCG TTCGTTTGAA 960 

20 GGAGGTCAAT CAT AAT ATG C ATGAATTGAC TATTTATCAT TTTATGTCAG ATAAATTGAA 1020 

TTTATACAGT GATATAGGAA ATATTATTGC TTTAAGACAA CGTGCTAAAA AACGAAATAT 1080 

TAAAGTTAAT GTCGTAGAAA TCAATGAAAC AGAAGGTATT ACCTTTGATG AATGTGATAT 1140 

25 

TTTCTTTATC GGTGGTGGAA GTGATAGAGA ACAAGCATTA GCAACAAAAG AATTAAGTAA 1200 

AATTAAGACA CCACTTAAAG AAGCGATTGA AGATGGTATG CCGGGATTAA CGATTTGTGG 1260 

AGGCTATCAA TTTTTAGGGA AAAAATATAT CACGCCTGAT GGTACAGAAT TAGAAGGGTT 1320 

30 

AGGTATTTTA GATTTTTATA CTGAATCAAA GACAAACCGA TTAACAGGAG ATATTGTTAT 1380 

CGAAAGTGAT ACTTTTGGAA CTATTGTAGG TTTTGAAAAT CACGGTGGTA GAACATATCA 1440 

35 - TGATTTCGGT__ ACACTTGGTC ATGTTACTTT TGGTTATGGT AATAATGATG AAGATAAAAA 1500 
AGAAGGCATT CATTATAAAA ATTTATTAGG TACTTATTTA CATGGACCAA TTTTACCTAA " 1560 

AAATTACGAA ATCACTGATT ATCTGTTAGA AAAAGCTTGT GAACGTAAGG GTATTCCGTT 1620 

40 TGAGCCTAAA GAAATAGATA ATGAAGCGGA AATACAAGCG AAACAAGTAT TAATAGACAG 1680 

AGCAAATAGA CAGAAGAAAT CTCGTTAACT CTGAACATCG CATCAATGGA TTTAATATTG 1740 

ATAAACGATG AAGTTTAGTA ATTAATCATA TATGTATAAA CACACACATT ATTTTGGATG 1800 

45 

GAAACAACCA AATTGATGTG TGTTTTtTTG TTCTAGTGAA TAATTATTAT ACAATGAGTA 1860 

TCTATCCTAG AATTATCAAT AGTAATGGTG ATTATGCAAC ATGAAAAAAT GAATGATGAA 1920 

AGGAATTTGA CGATGAAGCC TACTAAAGTG ATATTAAAAG ATGCATCTTA TTTACATAGC 1980 

50 

AAAACATCGA TAACATTTAT TTTAAAAGAT GTAGTTATCG AAGAAGATAA TAAAATTTAT 2040 

TATTTCGACA CTAGTGCACT TTCGAAGATC AAGAAGTTAA ATTTGAATTT GCACTCTTTG 2100 
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TTATAGAACC TGATTTACAT TTTACAATTA TTGATTTTAA TCAAGAACTG CTTTGTATTT 2220 

ATATTGATTT TGATTCTGGT TTAAGGCATT CAAACATGGC AACAGAATCT GGTATTTCAT 2280 

TAAGGATAAA TGTTGCTAAA TCAGATTTTA CTAAATTTAT TAATGAATTA GCCTCTTTAC 234 0 

ATTAATGATT TAAATCTGAT ATGTAATTAC AATCAAAAAA GACAGCCACA TCCCTCCGTA 24 00 

GTTTAGGCGT GTGGCTATAT TTGAGTCTGA ATATTTATGC TTGTAATTTT AAAAAGGGAC 2460 

ATGCTATATA CGATAAAAAG AGGCGGGGAC ATAAATCAAT GTTCTATGCT CTACGAAGTT 2520 

ATATTGGCAG TAGTTGACTG AACGAAAATG CGCTTGTAAC AAGCTTTTTT CAATTCTAGT 2580 

CAGGGGCCCC AACAAAGAGA AATTGGATTC CCAATTTCTA CAGACAATGC AAGTTGGGGT 2640 

GTGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA GGCAAAGCGA GTTGGGGTGG 2700 

GACGACGAAA TAAATTTTAT GAAAATATCA TTTCTGTCCC ACTCCCATGG TGCCAATTAG 2760 

20 CATAAGGTAC TTAAATTAAG CATATCTGCT GTCTAGCAGT CGATAAATCA TTAGAACTTC 2820 

GTATAGTATA TGACTTTTAA TTTGATTTTC ACCACTAATT TCAAGTGCTT TTATAGTCGA 2880 

ACGTAAAGTT TCTACAGAAT CATCTTCTCT CTTAAAAGAA CCATCATAAA ATATATCTTT 2940 

GATGCTACTA CTAATTTTTA GCAATGCCAT TTTTTCGTCA CCTGAAAAGT TAACACGAGT 3000 

ATTTTTAGGC AAGTAAATGA TATTTGATAA ATGAGTGATA AACAAACGAT TCGTATATGC 3060 

ACGTTTAGTT AATTGATTGA GTAATTTCCA ATCACATTCT TTTTTCTTAT GATAGCTTAA 3120 

TTCATCACGT TGATAACTTA TTAACGTTTC AACTTGATTA TTTAAATTGA AAATATTTTT 318 0 

AT ATG CTTTT TCGCTTTTAT CAGATTGCAG TCTTGATAAG ATAAGTTCTT GGCAGCGATT 324 0 

GTAAAATAAT TTATACATCA AGGCATCTGT CTTACTTAAT TTTTCTTCGA CCTGACCATA 3300 

ATACTTAGGT GG AAACACCA TGAAGTTAAT TAAACCTGAT GTCACGAGTC CAATAA1TGC 3360 

TGTCAATGTT CGAGACAAAA AGTTGAATAT GTAGGCATCA TGAATACCTG GAATCATAGC 3420 

TAATGATGTT AGTACAGCGA CATTCGTACC AACTTGCAAT TTGAGTTTTG TACAGAATAA 34 80 

AATCGTGAAC GTTGCACTCA ATGCATATGT AAAAGGTGAT TGATCGCCGA ATAAATATGT 3540 

AAATAATACT GCAAAGCCTG CACCAATTAC CGTAGCAGGT AATCTACGAT AACCTTTAAT 3600 

AAGTGATGCC TTGGCAGTTG GTTCAATTGT GACTACAGCT GTTAAAATGG CATAGATGGG 3660 

TGTTAAATCT AGTGCCATAC AAAAGACAGC TGTTAAAAAA ATGGCAATAC CAGTTTTAAT 3720 

TGTTCTGGCA CCAATTAAAT GTTTATACCA TTGATCGTTC ATTTTTTAAC CTCTAATCAT 3780 

CGTAAAATCT TAGCGAGCG C TTTATAATAA TAGTATCGTA CATTGGAAAA GTTCATGTAT 3840 

GTAAAATATT TGAAATAATC ATACATAAGC ATTACTTTGA TTTTCATATA CATTAATCAA 3900 
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CAAGCATTTT TCAATTATAG TCCGGGGCCC CAACATAGAG AATTTCAAAA AAGAAATTCT 4020 

ACAGGCAATG CAGGTTGGCG GGGCCCCAAC ACAGAAGCTG ACGAAAAGTC AGCTTACgAT 4080 

AATGTGCAGG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCTACAATTT CTACAGGCAA 4140 

TGCAAGTTGG GGTACAACGA TAAAGAAATA TTTTTTCTTT ATCACACTAT GTCTCACTCA 4200 

CTTTCCAAAA TACTAAAGTA ACATCTTTAG TATATCAAAG AATTTTTGCT ATAATAAGTT 4260 

ATAATTATAT AAAAAAGGAA CGGGATAAAA TGATTGTAAA AACAGAAGAA GAATTACAAG 4320 

CGTTAAAAGA AATTGGATAC ATATGCGCTA AAGTGCGCAA TACAATGCAA GCTGCAACCA 4380 

AACCAGGTAT CACTACGAAA GAGCTTGATA ATATTGCGAA AGAGTTATTT GAAGAATACG 4440 

GTGCTATTTC TGCGCCAATT CATGATGAAA ATTTTCCTGG TCAAACGTGT ATTAGTGTCA 4 500 

ATGAAGAGGT GGCACATGGG ATTCCAAGTA AGCGTGTCAT TCGTGAAGGA GATTTAGTAA 4 560 

ATATTGATGT ATCGGCTTTG AAGAATGGCT ATTATGCAGA TACAGGCATT TCATTTGTCG 4620 

TTGGAGAATC AGATGATCCA ATGAAACAAA AAGTATGTGA CGTAGCAACG ATGGCATTTG 4680 

AGAATGCAAT TGCAAAAGTA AAACCGGGTA CTAAGTTAAG TAACATTGGT AAAGCGGTGC 4740 

ATAATACAGC TAGACAAAAT GATTTGAAAG TCATTAAAAA CTTAACAGGT CATGGTGTTG 4600 

GTTTATCATT ACATGAAGCA CCAGCACATG TACTTAATTA CTTTGATCCA AAAGACAAAA 4860 

CATTATTAAC TGAAGGTATG GTATTAGCTA TTGAACCGTT TATCTCATCA AATGCATCAT 4920 

TTGTTACAGA AGGTAAAAAT GAATGGGCTT TTGAAACGAG CG AT AAAAGT TTTGTTGCTC 4980 

AAATTGAGCA TACGGTTATC GTGACTAAGG ATGGTCCGAT TTTAACGACA AAGATTGAAG 5040 

AAGAATAGTT^ CAACATATAG— TAAGACTAAA^ GTATGAACAT CATTTAGTTC CGGAGCCTAT 5100 

TCATATTGGT TTCGGAACTG TTTTATAATA ATTAAGAACA CAATCAATGC GTCATTTCAA 5160 

AAATATGTTG TAACAAAGTA GTTTTTAAGC AAACATATCA TCGACATCAA CGAAGATACA 5220 

TAGCGCATTT GGTATTTTAA AACTTATTAT AAAAGGTGAT AGTTATGAAC TATGTTGAAC 5280 

GTTATATTGA ACAGTTTTTG AGAGCAACAG TAAGAAATAA TATCAAGCAC TACCTTTTAA 5340 

TGCTAGATGA AAAAATGAAA AATTTAGATG ATTATATGCG TTATTTAATT ACTAAAAAAG 5400 

AACAACTTAG CAAGTTAATT GACAGTCTAA TGCTAACATT AGAAAATAAA TATATTGATA 5460 

TTGCTGAAGC ATTTCAAATT CAATGTGCAA GAGAAATCAA TAATCAAGAA ATTGAAAATA 5520 

TTAAATCAGA GTTGAATAAA GTTGAAGCAT ATTATGCAGA AATTGAAACT CAAATTCAAC 5580 

AAACTTCAAC TGAAAAAATA GCAACAGAAA AAACATCGTA TCTAATAAAT TATATGAACG 5640 

CTGTGGCATA GAAAGGCGGC GAAACATGAC ACACAAATAT ATATCAACGC AAATGTTGAT 5700 
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C2) INPOR «ATIO N FOR SEQ ID N0: 228 : "73 

«> SEQUENCE CHARACTERISTICS • 
<A) LENGTH • 4400 k~ . 

S ^RANDEDNESS: double 
TOPOLOGY.- linear 

(Xi> SEQUENCE "**»H«, SEOI DNO: 228; 



w„. ocy id NO: 228- 

ATAGnGAAAG CGTTTTAfa p w , 

. — ^ JZZ ~ c ~*» 

— ccAA-rcrcrr ZT" 

- *5TAATAATCA CAATTCCACT ZZ ^ ^™ 

— - s -~ — - 

AATCAACCGA AGAAGTACCT GCAA^ ^CAGAT GAGGATTTAC 

45 AACCT GCAAATGAAA ATACTP& a * * ~* 

"MAACMC AAAAGATCAA w * TACTC "« TAATCAACAT GAAATPQATG 

—A ^ ~ """"" — — 

~ a^ i^rr :rr Tam ™ 
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GGGAGATTTT TTAGGCATGA GCAATCAATT 
ACAATTCCAA GCTTTCCAAA ATCAACAAAA 
ATCTAAAAAA GGATGGTTCT GGGGCTGTGG 
CATCGGTATT TCAGCTTGTA CAGCTGGTAT 
AGAAACGAAC AAAACCCATA AAATCGGTGa 
TGTAAATTCA GTGGAAACTA TGAAATCTGT 
AGGTATATTT GTCGTTGCTG ATGTGACGAT 
TGATAGTTCA ATGTTTAAGC TAAAATCCGG 
TTCAATGTCT GCTAATCAAA GTGACAATGG 
TATAAATCCA GATAGCACTG CTCAAGGTAA 
AACGCAAAaG ATAAAAAATT AGAAGTTATT 
TTTGATTTAT CCGATGCTAA AAAAACATCA 
GTAGCTGTTG CGAGTTCAAA TAGCGATAAT 
GcTACAACTT CTAGTGCGGA TACTGATTCT 
GATAAGCAGA ATGCGTCTAA AAgTGATAAA 
GAAACTGCTC CTGTAGAGCC CATGCCCCAT 
AGCCAAAATA TTCACAaTGa AGATAGCmTG 
CAGCTCGGCT ACCCTTCTTT TACGGAAAAA 
AAAAAATGAA ATTCAAAGCT ~ATCGTTGCAA- 
GTGGTGCTAA TCAACATAAA GAAAATAGTA 
AACAAACTGA CAACACTACA CAGTCAAATA 
AAGATATAGT TCGAAACGAT TACAAAGCAA 
ATAAAACAAA TCTTGAACGA AGCAATGAAC 
ATGCAGTTGG CACACCATTA AAACGTTGTG 
TTAATATTTT TGATGATATG TCAGAAAAAG 
GAAGCCCTAA ATACAATCCA GGTATGAACA 
ACATTCAACA TCATGACATT GATAATAACA 
AAGTCGACGA TAAAAATGAT AAAAATGCTG 
CAAATAATTC TGAAGAAACT AAAGTTAAAT 



CAAAAGCGAA GAAGAGCGAA GACAATGGGA 840 

CCAACAGAAC CAGCAATACG GACAAAAGAA 900 

TGGTTGTCTA GTATTATTTA TTTTAATTAT 960 

TACAGGTAAC CTTGGCGGAA ATAGTTCTAA 1020 

AACTGTTAAA AATGGCGACC TTGAAGTCAC 1080 

AGGACCATCT CTTGCACCAA CAAACGCTAA 114 0 

TAAAAACAAA GGTAAAGAAG CGTTAACAAT 1200 

TGATAAAACA TTTGAAGCAG ATAATACAGG 1260 

TAGTATAGAA AATTCATTTT TCTTACAGCG 1320 

AATTGTTTcG ATGTGTCAGA AAACATAGCC 1380 

TCTAGTTTAT TTAGCGTCAA GAAGATTACA 1440 

AAAGCTAAAA AAGACAAGCA AGATACAGAA 1500 

GTAAGTTATG AAGCTTCGGC TACTACACCT 1560 

GAAGATAGCG AAAAGTCTAG TAAAGATGAG 1620 

TCTAGTGTAG AAAAAAGTGA ATCTAATGAG 1680 

AGCAAACCTA CCACTAGTGA aGCACCACCT 1740 

TACGACGCTT CAACAGAATA AAATtnyCAG 1800 

TTAATTATAC ATAATCaAAT CaAGGAGATA 1860 

TGACATTATC aTTGTCACTA TTAACTGCCT 1920 

GTAAATCAAA TGACACTAAT AAAAAGACGC ~ ~T980~ 

CAGAAAAGCA AATGACACCA CAAGAAGCCG 2040 

GAGGCGTTAA TGAATATCAA ACATTAAATT 2100 

ATGAATATTA TGTTGAACAT CTAGTCCGCG 2160 

CTATTGTTAA TCGACACAAT GGCACAATTA 2220 

ACAAAGAAGA ATTTGAAGCA TTTAAAAAGA 2280 

ATCATGATGA AACAGATGGT GAGTCAGAAG 2340 

AAGCAATTCA AAATGACATA CCAGATCAAA 2400 

TTAATAAAGA AGAAAAACAT GATAATGGGG 2460 

AATGGCATAC TTTGATTAAT CGTAATTTTT 2520 
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ACTATGCATG GTCTTTTTAA TCAACTTAAA CTCGGCATTA TTTCAATCGA AAACGCAGAG 2640 

CATACGCTTT TTACACCTTA TATGTTGGAA ACGCTCTCTT CCCTAGGCGT GAAAGACAGC 2700 

5 . 

ATTGTCGATT TAATTCATAA AGGGACTGAA TTAGAAGACT TTGCGGCATT TAATTTATCA 2760 

ATTGAAGACA CAGTTACAGT CTGTTTACAA AGAACTGAAG AACTATTAAA ACAATACAAA 2820 

AATGTGGAAT TCAATGACAA AATATTAATC AATTGGCGTA TTATACAAGA GAAATAGACA 2880 

70 

TATAAAAGTC GAATGTAACt ACGTGAGTAT TGATTTTATT CTTTGTAAAT TACAAGCATT 2940 

TCATATTATA AAGTTTGAAA AGAGGTATAT TGAAATGGAG AAAAATGAAT ATATAGCTAA 3000 

15 ATATAATGAA TATAGTCAAT TATTAGACGC TACATACTCG CAAGCTGTAG CATmCCTTTT 3060 

AAGtAAATaT GGCGCTGTAA CCGATGATTA TTATAAAGaA AAATCATACA CX3CGATTTTT 3120 

AAAtGGAGnA ATCAAAAGTA TTTCAAAAGG AAAATACACT AGAGCTAGCG AaGGATTATA 3180 

20 TTGCCATCAT ATAAGCGAGG ACAAATTCCA AAATCTATCT GATCTAAGAT TCATtTCCAA 324 0 

ATTTAAGTAC TCATACGACG TTCAAAAGAA AGAAAACTTA GTGTACTGTG ATCTAATCGA 3300 

GCATTTAATT TTACATGCAA TTATTACAAA AGAATCCCAT GGCCAATTTG GTGTAGCTGG 3360 

25 

ATTATGTCAA ATGATCAAAC CAACAGTCAT TGATTGGTAC ATTGGCGAAT ATAATCCAAA 3420 

ACCAGCATGG ATGCAAGCCA CCAAAGCACG TGCCTATTTG CCTGG AATAT TAGTAGAGAA 34 80 

ATTACTCATT AAAATTGACG ATATGTTAAA AGGAATAGAA ATATAAGATT TCCTTGAGTC 354 0 

30 

TAGATAAATG ATTAATGTAG ATTTATTTTT TGCTGTTGAG ATTTTGTTAT AGATGTTTAA 3600 

ACCTGTAATT AAATATATTT TATAAAATAG ACCACGCATA CCTATCTATA AACGGrCAAT 3660 

35 GTTTATAAAT GAGTTTGCAT GGtCTTGAAT TGTATTAAAT TTCTTTTGGT TTTAATAAAT 3720 

CGACTAGATT TTCACAATAT TTATCAAATA TGTATTCCTA AATTATACAG CCTTAATCCA 3780 

GCACJCTACTT TCGAAACTTC CAACTTAGTT GATATAAGGT TCAATAGTTT GTTTCGTTCT 3 840 

40 TTTTCAGATA AACCAGAACT TAAATTGATA TTATTGACTT CATAAAAATT ATAGACTAAT 3900 

GCCTCTATTT GCTTTTTAGG CATAAGTAAG TCGACTGAAA ACTGATTTAC GTCGCTTTCA 3960 

TAAATCATTT CATGTAAATT CTTTAGACTA TTATCGTTAC TATCTCTCAT TAAGTCTGTA 4020 

45 

TTTTTAAATA AATAACGGCC CAATTCACGA GCTATTGAAA ATCTTGTATT ATTAATCGAG 4080 

TGATTATTAT TAATATAGAT TGTTCTTCCA CTTAAATAAC CCGAAGTATT ACCCTCCATT 4140 

TTAATATATC TAACATTTAA ATTAAGTTGA AATAATAGCT TGTCTATGTC AATAGCAAAG 4200 

SO 

TGTTCAGAAG TAATAAAAAG TTGATCCATT TTGTCCTTTA TAAATGCCTG AAATAATCGA 4260 

ACTATTTTTG ATTCTAAAAT ATCTTCATAA TGAACTTTCT CAATAACTTT CAATTGATTC 4320 
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